Sourcing Data from Wikipedia for the Study of Language Contact: the csbwiki
Sourcing Data from Wikipedia for the Study of Language Contact: the csbwiki
Author(s): Robert BorgesSubject(s): Computational linguistics, ICT Information and Communications Technologies
Published by: Komisja Nauk Filologicznych Oddziału Polskiej Akademii Nauk we Wrocławiu
Keywords: contact-induced language change; wiki data; Kashubian; corpus linguistics; vowel alternation;
Summary/Abstract: Contact-induced language change is pervasive in contexts involving historically minoritized languages, where social contexts are not particularly conducive to equitable intergroup relations. Empirically driven studies involving these language contexts allow us to more thoroughly understand the social and cognitive processes that lead to language change. Paradoxically, empirical data on minoritized languages is relatively scarce and expensive to generate. But in the digital age we have the ability to look beyond the traditional data types used in language studies, like spoken data gathered under fieldwork conditions, literature, etc. In this paper, I will explore the potential utility of user-created wiki data in investigating Polish influence on the Kashubian language.
Journal: Academic Journal of Modern Philology
- Issue Year: 2022
- Issue No: 18
- Page Range: 7-22
- Page Count: 16
- Language: English