DIDELĮ KATEGORIJŲ KIEKĮ TURINČIŲ DRAUDIMO BENDROVĖS KLIENTŲ UŽKLAUSŲ, GAUTŲ ELEKTRONINIAIS LAIŠKAIS, LIETUVIŠKO TEKSTO KLASIFIKAVIMAS
CLASSIFICATION OF THE LITHUANIAN TEXT OF EMAIL ENQUIERIES OF AN INSURANCE COMPANY WITH A BIG NUMBER OF CUSTOMER CATEGORIES
Author(s): Karolis Kiaunė, Simona RamanauskaitėSubject(s): Computational linguistics, Baltic Languages, ICT Information and Communications Technologies
Published by: Vilniaus Universiteto Leidykla
Keywords: NLP; text classification; emails; text processing;
Summary/Abstract: Natural language processing and classification have been widely used in English-speaking countries. However, analysis and classification of a Lithuanian text is a complex issue and has not been fully implemented. This is due to complexity and peculiarities of the Lithuanian language, so methods appropriate for other languages, are not always appropriate for the Lithuanian language. Three selected word processing options and their various combinations were used and it was assessed how different and consistent text classification methods are able to classify insurance company customers‘ enquiries sent by email. This study is unique because a great number of methods were used and classification accuracy of a Lithuanian text in a large number of categories (33) was further assessed. Natural language processing problems, analogous studies of Lithuanian text classification were analyzed, research methodology was proposed and research findings were discussed in the paper.
Journal: Jaunųjų mokslininkų darbai
- Issue Year: 2019
- Issue No: 2 (49)
- Page Range: 52-59
- Page Count: 8
- Language: Lithuanian