Filters








8 Hits in 3.6 sec

Speaker Diarization with Enhancing Speech for the First DIHARD Challenge

Lei Sun, Jun Du, Chao Jiang, Xueyang Zhang, Shan He, Bing Yin, Chin-Hui Lee
2018 Interspeech 2018  
The enhanced speech can boost the performance for the subsequent SAD, segmentation and clustering.  ...  To the best of our knowledge, this is the first time we show significant improvements of deep learning based single-channel speech enhancement over state-of-the-art diarization systems in highly mismatch  ...  While state-of-the-art diarization systems perform remarkably well for some domains (e.g., conversational telephone speech such as CallHome), as was discovered at the 2017 JSALT Summer Workshop at CMU  ... 
doi:10.21437/interspeech.2018-1742 dblp:conf/interspeech/SunDJZHYL18 fatcat:vrnnolsq6vhmvidwetpk73dti4

The Second DIHARD Diarization Challenge: Dataset, Task, and Baselines

Neville Ryant, Kenneth Church, Christopher Cieri, Alejandrina Cristia, Jun Du, Sriram Ganapathy, Mark Liberman
2019 Interspeech 2019  
We describe the task and metrics, challenge design, datasets, and baseline systems for speech enhancement, speech activity detection, and diarization.  ...  , noise conditions, and conversational domain.  ...  It is against this backdrop that the JSALT-2017 workshop [12] and DIHARD challenges 2 emerged.  ... 
doi:10.21437/interspeech.2019-1268 dblp:conf/interspeech/RyantCCCDGL19 fatcat:cxmnf46l6vc4bdr4ujm22n2bje

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team [article]

Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li
2020 arXiv   pre-print
For each module, we explore different techniques to enhance performance.  ...  In this paper, we present the submitted system for the second DIHARD Speech Diarization Challenge from the DKULENOVO team.  ...  However, the 2017 JSALT Summer Workshop at CMU found it hard to migrate the success to more challenging corpora including web videos, speech in the wild, child language recordings, etc [26] .  ... 
arXiv:2002.12761v2 fatcat:5te7enwf7fbkvhy5xjiyojmyje

The Nunavut Hansard Inuktitut-English Parallel Corpus 3.0 with Preliminary Machine Translation Results

Eric Joanis, Rebecca Knowles, Roland Kuhn, Samuel Larkin, Patrick Littell, Chi-kiu Lo, Darlene A. Stewart, Jeffrey Micher
2020 International Conference on Language Resources and Evaluation  
This paper describes a newly released sentence-aligned Inuktitut-English corpus based on the proceedings of the Legislative Assembly of Nunavut, covering sessions from April 1999 to June 2017.  ...  It is an official language of two territories, Nunavut and the Northwest Territories, and has recognition in additional regions.  ...  We would also like to thank the 2019 Annual Jelinek Memorial Workshop on Speech and Language Technology (JSALT) for providing a venue for experiments and feedback on the beta version of this corpus.  ... 
dblp:conf/lrec/JoanisKKLLLSM20 fatcat:nyepziu5hrgntawpgebzvreplu

ALICE: An open-source tool for automatic measurement of phoneme, syllable, and word counts from child-centered daylong recordings

Okko Räsänen, Shreyas Seshadri, Marvin Lavechin, Alejandrina Cristia, Marisa Casillas
2020 Behavior Research Methods  
A key measure to quantify from such data is the amount of speech present in children's home environments.  ...  We discuss the advantages and disadvantages of measuring different units from theoretical and technical points of view.  ...  Some of the daylong recordings in BER, and all recordings in TSE, YEL, MCD, and WAR, are available from HomeBank repository (VanDam et al., 2016) .  ... 
doi:10.3758/s13428-020-01460-x pmid:32875399 fatcat:qbjvaxg4xba3vg5u55cihazpg4

DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

Qingjian Lin, Weicheng Cai, Lin Yang, Junjie Wang, Jun Zhang, Ming Li
2020 Odyssey 2020 The Speaker and Language Recognition Workshop   unpublished
For each module, we explore different techniques to enhance performance.  ...  In this paper, we present the submitted system for the second DIHARD Speech Diarization Challenge from the DKU-LENOVO team.  ...  However, the 2017 JSALT Summer Workshop at CMU found it hard to migrate the success to more challenging corpora including web videos, speech in the wild, child language recordings, etc [26] .  ... 
doi:10.21437/odyssey.2020-15 fatcat:bpci45neejftppr6z5d7atfy2i

Benchmarking: Past, Present and Future

Kenneth Church, Mark Liberman, Valia Kordoni
2021 Proceedings of the 1st Workshop on Benchmarking: Past, Present and Future   unpublished
Where have we been, and where are we going? It is easier to talk about the past than the future. These days, benchmarks evolve more bottom up (such as papers with code)  ...  , speech coding, speech recognition, speech enhancement, artificial neural networks, human-machine interaction using voice, optical character recognition, machine translation, and cross-lingual information  ...  Bio John Makhoul is a Chief Scientist at Raytheon BBN Technologies, Cambridge, MA, where he has been working on various aspects of speech and language processing, including speech analysis and synthesis  ... 
doi:10.18653/v1/2021.bppf-1.1 fatcat:ipnmbjgvqndjlhawhiximarfvy

Learning Representations of Social Media Users [article]

Adrian Benton
2018 arXiv   pre-print
We apply several extensions of generalized canonical correlation analysis to learn these representations and evaluate them at three tasks: predicting future hashtag mentions, friending behavior, and demographic  ...  and complicated as social media?  ...  , and Pan, 2017) , speech and cognitive impairment features 4 , as well as to learn multimodal representations of video (Tsai and Kender, 2017) .  ... 
arXiv:1812.00436v1 fatcat:qp2hf6f6nfe7djyjakkns36epq