Phonetic pronunciations for arabic speech-to-text systems

F. Diehl, M.J.F. Gales, M. Tomalin, P.C. Woodland
2008 Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing  
In this paper two aspects of generating and using phonetic Arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of Arabic large vocabulary Automatic Speech Recognition (ASR) is investigated. These have been found to be useful for English ASR systems, when combined with standard multiple pronunciation systems. The second area examined is automatically deriving phonetic "pronunciations" for words that standard approaches, such as the Buckwalter
more » ... orphological Analyzer, cannot handle. Without pronunciations for these words the OOV rates for various Arabic tasks significantly increase. Here, pronunciations are automatically found by first deriving grapheme-to-phone rules, and associated rule probabilities. These are then used to produce the most likely pronunciation, or pronunciations, for any word. These approaches are evaluated on a large vocabulary Arabic Broadcast News and Broadcast Conversation transcription task. Both schemes are found to yield gains with a multi-pass/combination framework.
doi:10.1109/icassp.2008.4517924 dblp:conf/icassp/DiehlGTW08 fatcat:vsgzbm32b5eobb7cfennrshun4