Corpora of spoken Lithuanian
Corpora of spoken Lithuanian
Author(s): Laura Kamandulytė-Merfeldienė, Ineta DabašinkieneSubject(s): Language and Literature Studies
Published by: Eesti Rakenduslingvistika Ühing (ERÜ)
Keywords: corpus of spoken language; grammatical annotation; grammatical disambiguation; lexicon; adult-directed speech (ADS); child-directed speech (CDS); child speech (CS); Lithuanian
Summary/Abstract: The paper discusses the development of spoken Lithuanian corpora. In the analytical part longitudinal child language data as well as adult conversations are discussed in view of the issues that occurred during the period of data collection, transcription and coding. The data are transcribed and coded according to the requirements of CHILDES. The second part of the paper presents a corpus based analysis and provides preliminary results. The data of adult-directed speech, child-directed speech and child speech are analysed to reveal the frequency distribution of parts of speech. Spoken language is compared to written language in order to observe the tendencies of usage. The main differences and similarities within the spoken language registers are discussed as well.
Journal: Eesti Rakenduslingvistika Ühingu aastaraamat
- Issue Year: 2009
- Issue No: 5
- Page Range: 067-077
- Page Count: 11
- Language: English