Filters








11 Hits in 4.6 sec

Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA)

Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano, Mireia Díez, Germán Bordel
2011 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
la NIST 2011 Language Recognition Evaluation.  ...  adecuada para desarrollar y evaluar nuevos métodos de verificación de la lengua; y (2) preparar un sistema competitivo de verificación de la lengua para señales telefónicas con objeto de presentarlo a  ...  Developing a competitive language recognition system for conversational telephone speech, which will be eventually presented to the NIST 2011 Language Recognition Evaluation. 3.  ... 
dblp:journals/pdln/Rodriguez-FuentesVPDB11 fatcat:7djrhpmecjgbxmlkkwqkjhktum

On the Use of Dot Scoring for Speaker Diarization [chapter]

Mireia Diez, Mikel Penagarikano, Amparo Varona, Luis Javier Rodriguez-Fuentes, German Bordel
2011 Lecture Notes in Computer Science  
This diarization system was developed for the Albayzin 2010 Speaker Diarization Evaluation on broadcast news.  ...  Results show that the lowest error rate that the clustering algorithm could attain for the evaluation set was around 20% and that over-segmentation was the main source of degradation, due to the lack of  ...  Although the evaluation was limited to Catalan TV speech, in order to increase the speaker variability, TV broadcast speech in Spanish, Catalan, Galician and Basque, taken from the Kalaka database [7]  ... 
doi:10.1007/978-3-642-21257-4_76 fatcat:mwm3ecwfn5am5oscqqaftpfvxu

Language Recognition on Albayzin 2010 LRE using PLLR features

Mireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel
2013 Revista de Procesamiento de Lenguaje Natural (SEPLN)  
Resumen: Los así denominados Phone Log-Likelihood Ratios (PLLR), han sido introducidos como características alternativas a los MFCC-SDC para sistemas de Reconocimiento de la Lengua (RL) mediante iVectors  ...  Los sistemas de iVectors entrenados con PLLRs obtienen mejoras relativas significativas respecto a los sistemas fonotácticos y sistemas de iVectors entrenados con características MFCC-SDC, tanto en condiciones  ...  Diez is supported by a research fellowship from the Department of Education, Universities and Research of the Basque Country Government.  ... 
dblp:journals/pdln/DiezVPRB13 fatcat:qe4gevb2ojaftc3ozxybz2gzq4

Scaling and universality in the human voice

J. Luque, B. Luque, L. Lacasa
2015 Journal of the Royal Society Interface  
Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.  ...  In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages.  ...  After submission of this manuscript we learned of a recent publication [36] where the authors explore scaling and complexity matching in conversational speech, finding similar power-law distribution  ... 
doi:10.1098/rsif.2014.1344 pmid:25694542 pmcid:PMC4387524 fatcat:5p5d6ltv2fc67npun6c2r6227q

Speech earthquakes: scaling and universality in human voice [article]

Jordi Luque, Bartolo Luque, Lucas Lacasa
2014 arXiv   pre-print
Results are robust and independent of the communication language or the number of speakers, pointing towards an universal pattern and yet another hint of complexity in human speech.  ...  Speech is a distinctive complex feature of human capabilities.  ...  The authors would like to thank Luis Javier Rodríguez-Fuentes and Mikel Peñagarikano for recording and hand-labeling the speech corpus.  ... 
arXiv:1408.0985v1 fatcat:q5vk3hma3zejvftufmbhs2jxh4

Toward a Web-based Speech Corpus for Algerian Dialectal Arabic Varieties

Soumia Bougrine, Aicha Chorana, Abdallah Lakhdari, Hadda Cherroun
2017 Proceedings of the Third Arabic Natural Language Processing Workshop  
The success of machine learning for automatic speech processing has raised the need for large scale datasets.  ...  In this paper, we devise a recipe for building largescale Speech Corpora by harnessing Web resources namely YouTube, other Social Media, Online Radio and TV.  ...  This is a speech database specifically designed for Spoken Language Recognition. The dataset provides TV broadcast speech for training, and audio data extracted from YouTube videos for testing.  ... 
doi:10.18653/v1/w17-1317 dblp:conf/wanlp/BougrineCLC17 fatcat:ggvdemzuvjgi7f7qqbqekfcada

Emergence of linguistic laws in human voice [article]

Ivan Gonzalez Torre, Bartolo Luque, Lucas Lacasa, Jordi Luque and Antoni Hernandez-Fernandez
2016 arXiv   pre-print
complexity and criticality in a biological system.  ...  These methods further pave the way for new comparative studies in animal communication or the analysis of signals of unknown code.  ...  BL acknowledges the hospitality and support of Queen Mary University of London, where part of this research was developed, and a Salvador de Madariaga fellowship.  ... 
arXiv:1610.02736v1 fatcat:dnhxt2x4vnavrmelb33sg7dxoa

The Albayzin 2010 language recognition evaluation

Luis Javier Rodriguez-Fuentes, Mikel Penagarikano, Amparo Varona, Mireia Diez, Germán Bordel
2011 Interspeech 2011   unpublished
A speech database was created for system development and evaluation. Speech signals were recorded from TV broadcasts, including clean and noisy speech.  ...  This paper presents the main features of the evaluation, analyses system performance on different conditions, including the confusion among languages, and gives hints for future evaluations.  ...  Acknowledgements We thank all the members of the Organizing Committee of FALA 2010 for their help and support. We also thank all the participants for their work and feedback.  ... 
doi:10.21437/interspeech.2011-322 fatcat:vvu46tpn3bbdvjm4ho7qa74hcu

I3a language recognition system for albayzin 2010 LRE

David Martínez, Jesús Villalba, Antonio Miguel, Alfonso Ortega, Eduardo Lleida
2011 Interspeech 2011   unpublished
State-of-the art methods for Language Recognition are adapted to and investigated in the KALAKA-2 database. Our primary system was ranked in the first position of the evaluation.  ...  This paper describes the two systems submitted to the Albayzin 2010 Language Recognition Evaluation by I3A.  ...  Acknowledgements We would like to thank GTTS for his big work organizing Albayzin 2010 LRE, and also the organization of Fala 2010 for supporting this evaluation.  ... 
doi:10.21437/interspeech.2011-326 fatcat:e5dua3d5jbhcziyvcekvi2kbzu

Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition

Mireia Diez, Amparo Varona, Mikel Penagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel
2013 Interspeech 2013   unpublished
In a previous work, we introduced the use of log-likelihood ratios of phone posterior probabilities, called Phone Log-Likelihood Ratios (PLLR) as features for language recognition under an iVector-based  ...  Finally, Principal Component Analysis (PCA) is also applied to the original PLLR vector as a feature projection method for comparison purposes.  ...  Mireia Diez is supported by a 4-year research fellowship from the Department of Education, University and Research of the Basque Country.  ... 
doi:10.21437/interspeech.2013-39 fatcat:ucm7dmkwengtxdf6ldne52mtde

The albayzin 2012 language recognition evaluation

Luis Javier Rodríguez-Fuentes, Niko Brümmer, Mikel Penagarikano, Amparo Varona, Germán Bordel, Mireia Diez
2013 Interspeech 2013   unpublished
The Albayzin 2012 Language Recognition Evaluation (LRE), carried out from June to October 2012, was the third effort made by the Spanish/Portuguese community for benchmarking language recognition technology  ...  This paper presents the main features of the evaluation and analyses the performance of the submitted systems on the different conditions, including the confusion among target languages.  ...  Acknowledgements We thank all the members of the Organizing Committee of Iberspeech 2012 for their help and support. We also thank all the participants for their work and feedback.  ... 
doi:10.21437/interspeech.2013-387 fatcat:52lsgqegovetdpa2p2zwibz5kq