Parsing and beyond. Tools and resources for Estonian
Parsing and beyond. Tools and resources for Estonian
Author(s): Kadri Muischnek, Kaili Müürisep, Tiina PuolakaineniSubject(s): Morphology, Computational linguistics, Finno-Ugrian studies, ICT Information and Communications Technologies
Published by: Akadémiai Kiadó
Keywords: morphological disambiguation; dependency parsing; treebank; Constraint Grammar; Universal Dependencies; Estonian;
Summary/Abstract: This article gives an overview of the state of art of tools and resources for syntactic analysis of Estonian. A morphosyntactic disambiguator, surface-syntactic analyzer and dependency parser are all based on the Constraint Grammar formalism. As for language resources, a 400,000-word manually annotated dependency treebank has been created, its annotation scheme is compatible with the output of the Constraint Grammar dependency parser. Part of the treebank has been converted to the Universal Dependencies annotation scheme. Our tools have also been tested by large-scale corpus annotation.
- Issue Year: 64/2017
- Issue No: 3
- Page Range: 347-367
- Page Count: 21
- Language: English
- Content File-PDF