Tweets as a Challenge for the Automatic Linguistic Processing Cover Image

Туитовете като предизвикателство пред автоматичната лингвистична обработка
Tweets as a Challenge for the Automatic Linguistic Processing

Author(s): Petya Osenova
Subject(s): Language studies, Language and Literature Studies, Eastern Slavic Languages, Philology
Published by: Великотърновски университет „Св. св. Кирил и Методий”
Keywords: Tweet; Bulgarian language; automatic linguistic processing; written colloquial speech

Summary/Abstract: The paper focuses on the specificities of the written colloquial speech in tweets as a challenge for the automatic linguistic analysis. Such an analysis includes: text segmentation into words; morphological analysis in parts-of-speech and related grammatical characteristics; dependency syntactic analysis; named entity recognition of people, locations and organizations; handling abbreviations. The problems are of the following kinds: out-of-vocabulary words; word blending; colloquial variants that have not been normalized, etc. The survey explores 630 tweets that discuss the crisis of two banks in Bulgaria in 2014.

  • Issue Year: 11/2018
  • Issue No: 1
  • Page Range: 205-216
  • Page Count: 12
  • Language: Bulgarian
Toggle Accessibility Mode