Corpus of Historical Slovak: hist-6.0 Cover Image

Historický korpus slovenčiny: hist-6.0
Corpus of Historical Slovak: hist-6.0

Author(s): Katarína Rausová
Subject(s): Language studies, Language and Literature Studies, Theoretical Linguistics, Applied Linguistics, Historical Linguistics, Western Slavic Languages
Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: corpus; Slovak language; diachronic; transliteration rules; tagging

Summary/Abstract: The article presents the sixth version of the Corpus of Historical Slovak marked as hist-6.0. The Corpus of Historical Slovak is a diachronic corpus of Slovak texts from the pre-codification period. It contains both project’s transliterated texts from photocopies of the original texts, as well as printed texts preserving the original orthography. Preparation of the current version started in autumn 2020. The article describes three areas of conceptual development of this corpus version, namely: 1. adding of new texts, 2. unification of the rules for text transliteration and 3. unification of existing tagging.

  • Issue Year: 89/2024
  • Issue No: 1
  • Page Range: 156-163
  • Page Count: 8
  • Language: Slovak
Toggle Accessibility Mode