Česká deverbální příjmení a problém jejich homonymie v elektronických korpusech
Czech Deverbal Surnames and the Problem of Their Homonymy in Electronic Corpora
Author(s): Markus Giger, Pavel ŠtěpánSubject(s): Language and Literature Studies
Published by: AV ČR - Akademie věd České republiky - Ústav pro jazyk český
Keywords: onomastics; anthroponyms; surnames; corpora
Summary/Abstract: The article discusses the problems connected with the homonymy of Czech deverbal surnames which must be solved during automatic annotation in electronic corpora. Solutions for improvement of these procedures, based on orthographic and punctuation criteria, on the list of corresponding surnames, and especially on syntactic features of surnames, are suggested in the paper. The degree of success of the steps proposed for disambiguation of the surnames will be very high; it can be estimated around 95 – 99 %.
Journal: Acta Onomastica
- Issue Year: XLVII/2006
- Issue No: 47
- Page Range: 185-196
- Page Count: 12
- Language: Czech