Sõnaliik kui rakenduslik ja lingvistiline probleem: sõnaliikide märgendamine vana kirjakeele korpuses
Parts of speech as a functional and linguistic problem: annotation of parts of speech in the corpus of Old Written Estonian
Author(s): Külli Habicht, Külli Prillop, Pille PenjamSubject(s): Language and Literature Studies
Published by: Eesti Rakenduslingvistika Ühing (ERÜ)
Keywords: morphosyntactical tagging; corpus linguistics; old literary Estonian; Estonian
Summary/Abstract: The article provides an overview of parts of speech annotation adjusted for the corpus of Old Written Estonian, against the background of parts of speech typology on the one hand and the previous treatments of parts of speech on the other. The corpus user is introduced to the principles of annotation and previous treatments of parts of speech. The core of the paper is constituted by practical issues of annotation concerning the boundaries of different parts of speech. Examples are drawn from the University of Tartu Corpus of Old Literary Estonian.
Journal: Eesti Rakenduslingvistika Ühingu aastaraamat
- Issue Year: 2011
- Issue No: 7
- Page Range: 019-041
- Page Count: 23
- Language: Estonian