A Morpho-syntactic Analysis  of Human-moderated Hate Speech Samples from Wykop.pl WebService Cover Image

A Morpho-syntactic Analysis of Human-moderated Hate Speech Samples from Wykop.pl WebService
A Morpho-syntactic Analysis of Human-moderated Hate Speech Samples from Wykop.pl WebService

Author(s): Inez Okulska, Anna Kołos
Subject(s): Language and Literature Studies, Theoretical Linguistics, Applied Linguistics, Communication studies, Pragmatics, Stylistics
Published by: Krakowskie Towarzystwo Popularyzowania Wiedzy o Komunikacji Językowej Tertium
Keywords: cyberbullying; hate speech; user-generated online content; automated detection; stylometry

Summary/Abstract: The dynamic increase in user-generated content on the web presents significant challenges in protecting Internet users from exposure to offensive material, such as cyberbullying and hate speech, while also minimizing the spread of wrongful conduct. However, designing automated detection models for such offensive content remains complex, particularly in languages with limited publicly available data. To address this issue, our research collaborates with the Wykop.pl web service to fine-tune a model using genuinecontent that has been banned by professional moderators. In this paper, we focus on the Polish language and discuss the notion of datasets and annotation frameworks, presenting our stylometric analysis of Wykop.pl content to identify morpho-syntactic structures that are commonly applied incyberbullying and hate speech. By doing so, we contribute to the ongoing discussion on offensive language and hate speech in sociolinguistic studies, emphasizing the need to consider user-generated online content.

  • Issue Year: 8/2023
  • Issue No: 2
  • Page Range: 54-71
  • Page Count: 18
  • Language: English
Toggle Accessibility Mode