Some Current Problems of Corpus and Computational Linguistics or Fifteen Commandments and General Truths
Some Current Problems of Corpus and Computational Linguistics or Fifteen Commandments and General Truths
Author(s): František ČermákSubject(s): Language and Literature Studies, Applied Linguistics, Computational linguistics
Published by: AV ČR - Akademie věd České republiky - Ústav pro jazyk český
Keywords: corpus; corpus lingustics; computational linguistics; methodology; type of data; type of information; representativeness of corpora; systems of tagging; lemmatizers; ir/regularity in language
Summary/Abstract: This contribution, which in a brief, succint and almost aphoristic way, critic-ally brings forward to the reader a number of problems of today’s corpus and computational lingu-istics as well as their unsatisfactory solutions, is trying, at the same time, to do away with a number of myths and simplified opinions in the field.
Journal: Korpus - gramatika - axiologie
- Issue Year: 2011
- Issue No: 3
- Page Range: 33-44
- Page Count: 12
- Language: English