A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
Spoken language recognition in conversational telephone speech and TV broadcast news (GLOSA)
2011
Revista de Procesamiento de Lenguaje Natural (SEPLN)
la NIST 2011 Language Recognition Evaluation. ...
adecuada para desarrollar y evaluar nuevos métodos de verificación de la lengua; y (2) preparar un sistema competitivo de verificación de la lengua para señales telefónicas con objeto de presentarlo a ...
Developing a competitive language recognition system for conversational telephone speech, which will be eventually presented to the NIST 2011 Language Recognition Evaluation. 3. ...
dblp:journals/pdln/Rodriguez-FuentesVPDB11
fatcat:7djrhpmecjgbxmlkkwqkjhktum
On the Use of Dot Scoring for Speaker Diarization
[chapter]
2011
Lecture Notes in Computer Science
This diarization system was developed for the Albayzin 2010 Speaker Diarization Evaluation on broadcast news. ...
Results show that the lowest error rate that the clustering algorithm could attain for the evaluation set was around 20% and that over-segmentation was the main source of degradation, due to the lack of ...
Although the evaluation was limited to Catalan TV speech, in order to increase the speaker variability, TV broadcast speech in Spanish, Catalan, Galician and Basque, taken from the Kalaka database [7] ...
doi:10.1007/978-3-642-21257-4_76
fatcat:mwm3ecwfn5am5oscqqaftpfvxu
Language Recognition on Albayzin 2010 LRE using PLLR features
2013
Revista de Procesamiento de Lenguaje Natural (SEPLN)
Resumen: Los así denominados Phone Log-Likelihood Ratios (PLLR), han sido introducidos como características alternativas a los MFCC-SDC para sistemas de Reconocimiento de la Lengua (RL) mediante iVectors ...
Los sistemas de iVectors entrenados con PLLRs obtienen mejoras relativas significativas respecto a los sistemas fonotácticos y sistemas de iVectors entrenados con características MFCC-SDC, tanto en condiciones ...
Diez is supported by a research fellowship from the Department of Education, Universities and Research of the Basque Country Government. ...
dblp:journals/pdln/DiezVPRB13
fatcat:qe4gevb2ojaftc3ozxybz2gzq4
Scaling and universality in the human voice
2015
Journal of the Royal Society Interface
Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech. ...
In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. ...
After submission of this manuscript we learned of a recent publication [36] where the authors explore scaling and complexity matching in conversational speech, finding similar power-law distribution ...
doi:10.1098/rsif.2014.1344
pmid:25694542
pmcid:PMC4387524
fatcat:5p5d6ltv2fc67npun6c2r6227q
Speech earthquakes: scaling and universality in human voice
[article]
2014
arXiv
pre-print
Results are robust and independent of the communication language or the number of speakers, pointing towards an universal pattern and yet another hint of complexity in human speech. ...
Speech is a distinctive complex feature of human capabilities. ...
The authors would like to thank Luis Javier Rodríguez-Fuentes and Mikel Peñagarikano for recording and hand-labeling the speech corpus. ...
arXiv:1408.0985v1
fatcat:q5vk3hma3zejvftufmbhs2jxh4
Toward a Web-based Speech Corpus for Algerian Dialectal Arabic Varieties
2017
Proceedings of the Third Arabic Natural Language Processing Workshop
The success of machine learning for automatic speech processing has raised the need for large scale datasets. ...
In this paper, we devise a recipe for building largescale Speech Corpora by harnessing Web resources namely YouTube, other Social Media, Online Radio and TV. ...
This is a speech database specifically designed for Spoken Language Recognition. The dataset provides TV broadcast speech for training, and audio data extracted from YouTube videos for testing. ...
doi:10.18653/v1/w17-1317
dblp:conf/wanlp/BougrineCLC17
fatcat:ggvdemzuvjgi7f7qqbqekfcada
Emergence of linguistic laws in human voice
[article]
2016
arXiv
pre-print
complexity and criticality in a biological system. ...
These methods further pave the way for new comparative studies in animal communication or the analysis of signals of unknown code. ...
BL acknowledges the hospitality and support of Queen Mary University of London, where part of this research was developed, and a Salvador de Madariaga fellowship. ...
arXiv:1610.02736v1
fatcat:dnhxt2x4vnavrmelb33sg7dxoa
The Albayzin 2010 language recognition evaluation
2011
Interspeech 2011
unpublished
A speech database was created for system development and evaluation. Speech signals were recorded from TV broadcasts, including clean and noisy speech. ...
This paper presents the main features of the evaluation, analyses system performance on different conditions, including the confusion among languages, and gives hints for future evaluations. ...
Acknowledgements We thank all the members of the Organizing Committee of FALA 2010 for their help and support. We also thank all the participants for their work and feedback. ...
doi:10.21437/interspeech.2011-322
fatcat:vvu46tpn3bbdvjm4ho7qa74hcu
I3a language recognition system for albayzin 2010 LRE
2011
Interspeech 2011
unpublished
State-of-the art methods for Language Recognition are adapted to and investigated in the KALAKA-2 database. Our primary system was ranked in the first position of the evaluation. ...
This paper describes the two systems submitted to the Albayzin 2010 Language Recognition Evaluation by I3A. ...
Acknowledgements We would like to thank GTTS for his big work organizing Albayzin 2010 LRE, and also the organization of Fala 2010 for supporting this evaluation. ...
doi:10.21437/interspeech.2011-326
fatcat:e5dua3d5jbhcziyvcekvi2kbzu
Dimensionality reduction of phone log-likelihood ratio features for spoken language recognition
2013
Interspeech 2013
unpublished
In a previous work, we introduced the use of log-likelihood ratios of phone posterior probabilities, called Phone Log-Likelihood Ratios (PLLR) as features for language recognition under an iVector-based ...
Finally, Principal Component Analysis (PCA) is also applied to the original PLLR vector as a feature projection method for comparison purposes. ...
Mireia Diez is supported by a 4-year research fellowship from the Department of Education, University and Research of the Basque Country. ...
doi:10.21437/interspeech.2013-39
fatcat:ucm7dmkwengtxdf6ldne52mtde
The albayzin 2012 language recognition evaluation
2013
Interspeech 2013
unpublished
The Albayzin 2012 Language Recognition Evaluation (LRE), carried out from June to October 2012, was the third effort made by the Spanish/Portuguese community for benchmarking language recognition technology ...
This paper presents the main features of the evaluation and analyses the performance of the submitted systems on the different conditions, including the confusion among target languages. ...
Acknowledgements We thank all the members of the Organizing Committee of Iberspeech 2012 for their help and support. We also thank all the participants for their work and feedback. ...
doi:10.21437/interspeech.2013-387
fatcat:52lsgqegovetdpa2p2zwibz5kq