Approaches for Parsing of Pages in Web Based Information Systems Cover Image

Approaches for Parsing of Pages in Web Based Information Systems
Approaches for Parsing of Pages in Web Based Information Systems

Author(s): Yavor Tabov
Subject(s): Social Sciences, Economy, Business Economy / Management, Sociology, Evaluation research, Social Informatics, ICT Information and Communications Technologies
Published by: Университет за национално и световно стопанство (УНСС)
Keywords: data parsing; web data parsing; web scrapping; regex; html dom; xpath; web-based information system
Summary/Abstract: The report examines possible solutions for data parsing from web-based information systems. The essence of the concepts data parsing, web data parsing and web scrapping is presented. Existing methods for web scrapping in the context of web-based information systems are indicated. The report also examines the Python programming language in the context of parsing of pages in web-based information systems and covers the different ways to implement a web scraper programmatically. Finally, conclusions summarized from the study are presented regarding the possibilities for parsing pages in web-based information systems.

Toggle Accessibility Mode