Czech Deverbal Surnames and the Problem of Their Homonymy in Electronic Corpora Cover Image

Česká deverbální příjmení a problém jejich homonymie v elektronických korpusech
Czech Deverbal Surnames and the Problem of Their Homonymy in Electronic Corpora

Author(s): Markus Giger, Pavel Štěpán
Subject(s): Language and Literature Studies
Published by: AV ČR - Akademie věd České republiky - Ústav pro jazyk český
Keywords: onomastics; anthroponyms; surnames; corpora

Summary/Abstract: The article discusses the problems connected with the homonymy of Czech deverbal surnames which must be solved during automatic annotation in electronic corpora. Solutions for improvement of these procedures, based on orthographic and punctuation criteria, on the list of corresponding surnames, and especially on syntactic features of surnames, are suggested in the paper. The degree of success of the steps proposed for disambiguation of the surnames will be very high; it can be estimated around 95 – 99 %.

  • Issue Year: XLVII/2006
  • Issue No: 47
  • Page Range: 185-196
  • Page Count: 12
  • Language: Czech