Sentiment Classification of Bank Clients’ Reviews Written in the Polish Language
Sentiment Classification of Bank Clients’ Reviews Written in the Polish Language
Author(s): Adam IdczakSubject(s): Economy
Published by: Wydawnictwo Uniwersytetu Łódzkiego
Keywords: sentiment analysis; opinion mining; text classification; text mining; logistic regression; naive Bayes classifier
Summary/Abstract: It is estimated that approximately 80% of all data gathered by companies are text documents. This article is devoted to one of the most common problems in text mining, i.e. text classification in sentiment analysis, which focuses on determining the sentiment of a document. A lack of defined structure of the text makes this problem more challenging. This has led to the development of various techniques used in determining the sentiment of a document. In this paper, a comparative analysis of two methods in sentiment classification, a naive Bayes classifier and logistic regression, was conducted. Analysed texts are written in the Polish language and come from banks. The classification was conducted by means of a bag‑of‑n‑grams approach, where a text document is presented as a set of terms and each term consists of n words. The results show that logistic regression performed better.
Journal: Acta Universitatis Lodziensis. Folia Oeconomica
- Issue Year: 2/2021
- Issue No: 353
- Page Range: 43-56
- Page Count: 14
- Language: English