Development of a Large Spontaneous Speech Database of Agglutinative Hungarian Language [chapter]

Tilda Neuberger, Dorottya Gyarmathy, Tekla Etelka Gráczi, Viktória Horváth, Mária Gósy, András Beke
2014 Lecture Notes in Computer Science  
In this paper, a large Hungarian spoken language database is introduced. This phonetically-based multi-purpose database contains various types of spontaneous and read speech from 333 monolingual speakers (about 50 minutes of speech sample per speaker). This study presents the background and motivation of the development of the BEA Hungarian database, describes its protocol and the transcription procedure, and also presents existing and proposed research using this database. Due to its recording
more » ... protocol and the transcription it provides a challenging material for various comparisons of segmental structures of speech also across languages.
doi:10.1007/978-3-319-10816-2_51 fatcat:4ucbmrlewvfwtbkfcjyp54nxbe