Converting Numeral Text in Bulgarian into Digit Number Using GATE
Converting Numeral Text in Bulgarian into Digit Number Using GATE
Author(s): Nadezhda Borisova, Elena KarashtranovaSubject(s): Education, School education, Vocational Education, Adult Education, Higher Education , History of Education, State/Government and Education, Distance learning / e-learning, Pedagogy
Published by: Национално издателство за образование и наука „Аз-буки“
Keywords: Natural language processing; Bulgarian grammar; GATE
Summary/Abstract: The Internet serves billions of users providing a variety of information resources whereby a lot of the information is presented in natural human language and needs an efficient approach to be processed. Natural language processing (NLP) refers to the ability of computers to analyze and understand the structure of human language. By utilizing NLP this linguistic knowledge is transformed into algorithms for solving specific problems. GATE is widely used, open-source software infrastructure that provides a framework and components for solving NLP tasks. The available GATE tools can be adapted to other languages and text processing tasks. This article will present an approach for converting numeric data, written as words in Bulgarian, into digit numbers. For this case, a relevant configuration file for Bulgarian has been integrated into the general tool set in the open source software for natural language processing GATE. The aim of this survey is to determine the exact numeric value of Bulgarian text numeric data, which can be used as a starting point for producing more complex annotations, such as monetary measurement units, etc.
Journal: Математика и информатика
- Issue Year: 65/2022
- Issue No: 3
- Page Range: 231-246
- Page Count: 16
- Language: English
- Content File-PDF