Building of Broadcast News Database for Evaluation of the Automated Subtitling Service

Matus Pleva, Jozef Juhar
2013 Communications - Scientific Letters of the University of Zilina  
This paper describes the process of recording, annotation, correction and evaluation of the new Broadcast News (BN) speech database named KEMT-BN2, as an extension for our older KEMT-BN1 and COST-278 databases used for automatic Slovak continuous speech recognition development. The database utilisation and statistics are presented. This database was prepared for evaluation of the automated BN transcription system, developed in our laboratory, which is mainly used for subtitle generation for
more » ... rded BN shows. The speech database is the key part of the acoustic models training for specific domains and also for speaker and anchor adapted models creation.
doi:10.26552/com.c.2013.2a.124-128 fatcat:66ul7p3kq5acpoksgwyg2annbe