Enriching Large Document Stores with Intelligent Metadata: a Framework for Effective Knowledge Management and Applied Analytics
Enriching Large Document Stores with Intelligent Metadata: a Framework for Effective Knowledge Management and Applied Analytics
Author(s): Penko Ivanov, Elitsa PavlovaSubject(s): Social Sciences, Education
Published by: Национално издателство за образование и наука „Аз-буки“
Keywords: software engineering; AI; data science; machine learning; NLP; metadata; knowledge graphs; ontology; metadata quality; business analytics
Summary/Abstract: The current paper focuses on a framework for structuring large document stores with the help of intelligent metadata. The described landscape includes a proprietary knowledge graph which ingests millions of concepts from external, third-party data providers and accommodates internal class taxonomies; an NLP service for automated annotation of textual data; an annotations quality control mechanism; tools for knowledge graph ontology and concept management; and an extensive API layer. The authors present an approach they have tested and proved successful in one of the leading media companies in the world, whose media content is a core data asset. The proposed solutions enable content analytics in their proper context and allow explicit and implicit connections between the content and other company data – i.e., user (media content consumer) data. The latter empowers the efficient application of advanced analytical models for searches and recommendations and the implementation of accurate data-driven virtual assistants. The paper advises addressing the metadata quality concerns, which the authors’ extensive practice identifies as an essential prerequisite for applied analytics delivering significant business value.
Journal: Математика и информатика
- Issue Year: 66/2023
- Issue No: 4
- Page Range: 339-352
- Page Count: 14
- Language: English
- Content File-PDF