Some Current Problems of Corpus and Computational Linguistics or Fifteen Commandments and General Truths Cover Image

Some Current Problems of Corpus and Computational Linguistics or Fifteen Commandments and General Truths
Some Current Problems of Corpus and Computational Linguistics or Fifteen Commandments and General Truths

Author(s): František Čermák
Subject(s): Language and Literature Studies, Applied Linguistics, Computational linguistics
Published by: AV ČR - Akademie věd České republiky - Ústav pro jazyk český
Keywords: corpus; corpus lingustics; computational linguistics; methodology; type of data; type of information; representativeness of corpora; systems of tagging; lemmatizers; ir/regularity in language

Summary/Abstract: This contribution, which in a brief, succint and almost aphoristic way, critic-ally brings forward to the reader a number of problems of today’s corpus and computational lingu-istics as well as their unsatisfactory solutions, is trying, at the same time, to do away with a number of myths and simplified opinions in the field.

  • Issue Year: 2011
  • Issue No: 3
  • Page Range: 33-44
  • Page Count: 12
  • Language: English