A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Automatic pronunciation prediction for text-to-speech synthesis of dialectal arabic in a speech-to-speech translation system
2012
2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Text-to-speech synthesis (TTS) is the final stage in the speech-tospeech (S2S) translation pipeline, producing an audible rendition of translated text in the target language. TTS systems typically rely on a lexicon to look up pronunciations for each word in the input text. This is problematic when the target language is dialectal Arabic, because the statistical machine translation (SMT) system usually produces undiacritized text output. Many words in the latter possess multiple pronunciations;
doi:10.1109/icassp.2012.6289032
dblp:conf/icassp/AnanthakrishnanTPNV12
fatcat:n443k2xijzfxtezn6gkkox7hse