Building of Broadcast News Database for Evaluation of the Automated Subtitling Service
Building of Broadcast News Database for Evaluation of the Automated Subtitling Service
Author(s): Matúš Pleva, Jozef JuharSubject(s): Media studies, ICT Information and Communications Technologies
Published by: Žilinská univerzita v Žilině
Keywords: broadcast news; segmentation; speech recognition; transcriber;
Summary/Abstract: This paper describes the process of recording, annotation, correction and evaluation of the new Broadcast News (BN) speech database named KEMT-BN2, as an extension for our older KEMT-BN1 and COST-278 databases used for automatic Slovak continuous speech recognition development. The database utilisation and statistics are presented. This database was prepared for evaluation of the automated BN transcription system, developed in our laboratory, which is mainly used for subtitle generation for recorded BN shows. The speech database is the key part of the acoustic models training for specific domains and also for speaker and anchor adapted models creation.
Journal: Komunikácie - vedecké listy Žilinskej univerzity v Žiline
- Issue Year: 15/2013
- Issue No: 2A
- Page Range: 124-128
- Page Count: 5
- Language: English