Sourcing Data from Wikipedia for the Study of Language Contact: the csbwiki Cover Image

Sourcing Data from Wikipedia for the Study of Language Contact: the csbwiki
Sourcing Data from Wikipedia for the Study of Language Contact: the csbwiki

Author(s): Robert Borges
Subject(s): Computational linguistics, ICT Information and Communications Technologies
Published by: Komisja Nauk Filologicznych Oddziału Polskiej Akademii Nauk we Wrocławiu
Keywords: contact-induced language change; wiki data; Kashubian; corpus linguistics; vowel alternation;

Summary/Abstract: Contact-induced language change is pervasive in contexts involving historically minoritized languages, where social contexts are not particularly conducive to equitable intergroup relations. Empirically driven studies involving these language contexts allow us to more thoroughly understand the social and cognitive processes that lead to language change. Paradoxically, empirical data on minoritized languages is relatively scarce and expensive to generate. But in the digital age we have the ability to look beyond the traditional data types used in language studies, like spoken data gathered under fieldwork conditions, literature, etc. In this paper, I will explore the potential utility of user-created wiki data in investigating Polish influence on the Kashubian language.

  • Issue Year: 2022
  • Issue No: 18
  • Page Range: 7-22
  • Page Count: 16
  • Language: English