O nowych ręcznie zrównoleglonych i znakowanych dwujęzycznych korpusach równoległych oraz ich zastosowaniach
On New Manually Aligned and Tagged Bilingual Parallel Corpora and Their Applications
Author(s): Roman RoszkoSubject(s): Language studies, Western Slavic Languages, Eastern Slavic Languages, South Slavic Languages, Baltic Languages, ICT Information and Communications Technologies
Published by: Instytut Slawistyki Polskiej Akademii Nauk
Keywords: manually aligned and tagged bilingual parallel corpora; Lithuanian language; Slavic languages; language tools; CLARIN-PL;
Summary/Abstract: This article is devoted to the manually aligned and tagged bilingual parallel CLARIN-PL-BIZ corpora of the Baltic and Slavic languages which are currently being developed. The study discusses the essential features of these corpora that make their applications go far beyond typical corpus analysis. Applications of these corpora include the design of cross-language models for the development of machine translation and artificial intelligence. The article also draws attention to the high potential of these resources as a model training base for testing natural language processing tools.
Journal: Acta Baltico Slavica
- Issue Year: 2021
- Issue No: 45
- Page Range: 1-20
- Page Count: 20
- Language: Polish