Data Collection for Studying Language Acquisition
Data Collection for Studying Language Acquisition
Author(s): Velina SlavovaSubject(s): Education, ICT Information and Communications Technologies
Published by: Нов български университет
Keywords: Corpus linguistics; Databases; Language acquisition; Cognitive development; Statistical analysis;
Summary/Abstract: An important problem in AI is related to human language processing. The present paper offers methodology of data collection designed for studying child language acquisition, and language aspects such as its structural and semantic features. The Data from 42 corpora contains transcribed dialogues of child speech in American English and French annotated with parts of speech, extracted from CHILDES (Child Language Data Exchange System) and used for the construction of a relational database. The main purpose of this data collection is to enable statistical methods of child language acquisition studies from the point of view of cognitive science, applied linguistics, and AI. The paper offers the data sources, the resultant relational database, and some technical details related to labelling, annotation and storage.
Journal: Computer Science and Education in Computer Science
- Issue Year: 12/2016
- Issue No: 1
- Page Range: 119-127
- Page Count: 9
- Language: English