Towards Machine Speech-to-speech Translation

Satoshi Nakamura, Katsuhito Sudoh, Sakriani Sakti
2020 Tradumàtica tecnologies de la traducció  
There has been a good deal of research on machine speechto-speech translation (S2ST) in Japan, and this article presents these and our own recent research on automatic simultaneous speech translation. The S2ST system is basically composed of three modules: large vocabulary continuous automatic speech recognition (ASR), machine text-to-text translation (MT) and textto-speech synthesis (TTS). All these modules need to be multilingual in nature and thus require multilingual speech and corpora for
more » ... raining models. S2ST performance is drastically improved by deep learning and large training corpora, but many issues still still remain such as simultaneity, paralinguistics, context and situation dependency, intention and cultural dependency. This article presents current on-going research and discusses issues with a view to next-generation speech-tospeech translation.
doi:10.5565/rev/tradumatica.238 fatcat:n54mfznx5vfxnemrxjpoomhnaq