Approaches for Parsing of Pages in Web Based Information Systems
Approaches for Parsing of Pages in Web Based Information Systems
Author(s): Yavor Tabov
Subject(s): Social Sciences, Economy, Business Economy / Management, Sociology, Evaluation research, Social Informatics, ICT Information and Communications Technologies
Published by: Университет за национално и световно стопанство (УНСС)
Keywords: data parsing; web data parsing; web scrapping; regex; html dom; xpath; web-based information system
Summary/Abstract: The report examines possible solutions for data parsing from web-based information systems. The essence of the concepts data parsing, web data parsing and web scrapping is presented. Existing methods for web scrapping in the context of web-based information systems are indicated. The report also examines the Python programming language in the context of parsing of pages in web-based information systems and covers the different ways to implement a web scraper programmatically. Finally, conclusions summarized from the study are presented regarding the possibilities for parsing pages in web-based information systems.
- Page Range: 88-93
- Page Count: 6
- Publication Year: 2024
- Language: English
- Content File-PDF