Filters








6 Hits in 6.3 sec

Comparison of ALBAYZIN query-by-example spoken term detection 2012 and 2014 evaluations

Javier Tejedor, Doroteo T. Toledano, Paula Lopez-Otero, Laura Docio-Fernandez, Carmen Garcia-Mateo
2016 EURASIP Journal on Audio, Speech, and Music Processing  
Despite the significant improvement obtained by the systems submitted to this second evaluation compared to those of the first evaluation, results still show the difficulty of this task and indicate that  ...  Four different research groups took part in the evaluations held in 2012 and 2014. In 2014, new multi-word and foreign queries were added to the single-word and in-language queries used in 2012.  ...  contract GRC2014/024 (Modalidade: Grupos de Referencia Competitiva 2014) and 'AtlantTIC Project' CN2012/160, and also by the Spanish Government and the European Regional Development Fund (ERDF) under  ... 
doi:10.1186/s13636-016-0080-2 fatcat:les5r3cipzcsnhrxrv6lgiuedu

Challenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak) [chapter]

Jan Nouza, Jindrich Zdansky, Petr Cerva, Jan Silovsky
2010 Lecture Notes in Computer Science  
All the presented systems operate in real time with very large vocabularies (350K words in Czech, 170K words in Slovak) and some of them have been already deployed in practice.  ...  This has a large impact on the size of lexical inventories in these languages, and significantly complicates the design of text-to-speech and, in particular, speech-to-text systems.  ...  Acknowledgement This work was supported by grants no. 102/08/0707 by the Grant Agency of the Czech Republic and no.  ... 
doi:10.1007/978-3-642-12397-9_19 fatcat:h3tegqw37zhnrm6vk53nnc6ewa

Computational intelligence in processing of speech acoustics: a survey

Amitoj Singh, Navkiran Kaur, Vinay Kukreja, Virender Kadyan, Munish Kumar
2022 Complex & Intelligent Systems  
This paper presents a comprehensive survey on the speech recognition techniques for non-Indian and Indian languages, and compiled some of the computational models used for processing speech acoustics.  ...  It has also been observed that the research in this field is quite thin and still more research needs to be carried out, particularly in the case of minority languages.  ...  [143] build the acoustic model with the combination of knowledge-based and statistical approaches to create several different phoneme sets.  ... 
doi:10.1007/s40747-022-00665-1 fatcat:6pu2xccbq5as7bn2y2tav2fdwa

Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

Javier Tejedor, Doroteo T. Toledano, Paula Lopez-Otero, Laura Docio-Fernandez, Mikel Peñagarikano, Luis Javier Rodriguez-Fuentes, Antonio Moreno-Sandoval
2019 EURASIP Journal on Audio, Speech, and Music Processing  
The results suggest that the QbE STD task is still in progress, and the performance of these systems is highly sensitive to changes in the data domain.  ...  The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays.  ...  derived from a Gaussian mixture model (GMM) tokenizer, phoneme recognition, and acoustic segment modeling are used in [45] .  ... 
doi:10.1186/s13636-019-0156-x fatcat:y3rcaw35gvhthnwptttw5kscfi

The Multi-Domain International Search on Speech 2020 ALBAYZIN Evaluation: Overview, Systems, Results, Discussion and Post-Evaluation Analyses

Javier Tejedor, Doroteo T. Toledano, Jose M. Ramirez, Ana R. Montalvo, Juan Ignacio Alvarez-Trejos
2021 Applied Sciences  
The most novel features of the submitted systems are a data augmentation technique for the STD task and an end-to-end system for the QbE STD task.  ...  The obtained results suggest that there is clearly room for improvement in the SoS task and that performance is highly sensitive to changes in the data domain.  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/app11188519 fatcat:rut6jfdlrbb2bcycey42dsll5e

Acoustic Modelling for Under-Resourced Languages

Sebastian Stüker
2009
In this thesis we examine methods to rapidly create acoustic models in new, possibly under-resourced languages, in a time and cost effective manner.  ...  For this we examine the use of multilingual models, the application of articulatory features across languages, and the automatic discovery of word-like units in unwritten languages.  ...  We thus substituted all nouns in the unsegmented phoneme string of the target language by the correct word and then repeated our word discovery algorithm.  ... 
doi:10.5445/ir/1000014983 fatcat:7pfagjzw3jbeffmrd6qbnllkym