Creation of Language Resources for the Development of a Medical Speech Recognition System for Latvian

Roberts Dargis, Normunds Gruzitis, Ilze Auzina, Kaspars Stepanovs
2020 Human Language Technology - The Baltic Perspectiv  
This paper describes an ongoing work on the creation of Latvian language resources for the medical domain focusing on digital imaging to develop a medical speech recognition system for Latvian. The language resources include a pronunciation lexicon, a text corpus for language modelling, and an orthographically transcribed speech corpus for the (i) adaptation of the acoustic model, (ii) evaluation of the speech recognition accuracy, (iii) development and testing of rewrite rules for automatic
more » ... t conversion to the spoken form and back to the written form. This work is part of a larger industry-driven research project which aims at the development of specific Latvian speech recognition systems for the medical domain.
doi:10.3233/faia200615 dblp:conf/hlt/DargisGAS20 fatcat:w32xnll3jvfn3kjjjktc4niac4