Development of an Amharic text-to-speech system using cepstral method

Tadesse Anberbir, Tomio Takara
2009 Proceedings of the First Workshop on Language Technologies for African Languages - AfLaT '09   unpublished
This paper presents a speech synthesis system for Amharic language and describes and how the important prosodic features of the language were modeled in the system. The developed Amharic Text-to-Speech system (AmhTTS) is parametric and rule-based that employs a cepstral method. The system uses a source filter model for speech production and a Log Magnitude Approximation (LMA) filter as the vocal tract filter. The intelligibility and naturalness of the system was evaluated by word and sentence
more » ... stening tests respectively and we achieved 98% correct-rates for words and an average Mean Opinion Score (MOS) of 3.2 (which is categorized as good) for sentences listening tests. The synthesized speech has high intelligibility and moderate naturalness. Comparing with previous similar study, our system produced considerably similar quality speech with a fairly good prosody. In particular our system is mainly suitable for building new languages with little modification.
doi:10.3115/1564508.1564517 fatcat:surr6fifn5c4jgzv2hdxnl5gr4