Consistency of morphological dictionary MorfFlex Cover Image

Konzistence morfologického slovníku MorfFlex
Consistency of morphological dictionary MorfFlex

Author(s): Jaroslava Hlaváčová, Marie Mikulová, Barbora Štěpánková
Subject(s): Language and Literature Studies, Theoretical Linguistics, Applied Linguistics, Morphology
Published by: Jazykovedný ústav Ľudovíta Štúra Slovenskej akadémie vied
Keywords: morphological dictionary; morphological analysis; language corpus; the Czech language

Summary/Abstract: Language corpora usually contain, in addition to their own texts, various types of annotations. The most common one is a morphological annotation, which consists in assigning a lemma and a morphological tag to each wordform. For morphological tagging, morphological dictionaries are traditionally used. Our paper presents a new version of the so-called "Prague" morphological dictionary MorfFlex used for tagging many Czech corpora (particularly Prague Dependency Treebanks, corpora published by the Institute of the Czech National Corpus in Prague or large Czech web corpora of the Aranea series). Three basic principles were used to update the dictionary: the Golden Rule of Morphology, the Principle of Paradigm Unity, and the Principle of Paradigm Uniqueness.

  • Issue Year: 72/2021
  • Issue No: 4
  • Page Range: 855-861
  • Page Count: 7
  • Language: Czech