Filters








3,825 Hits in 7.8 sec

An Efficient Feature Selection Method for Speaker Recognition

Hanwu Sun, Bin Ma, Haizhou Li
2008 2008 6th International Symposium on Chinese Spoken Language Processing  
It demonstrates that this approach can provide an efficient way to select high quality speech frames in the noisy environment for speaker recognition.  ...  In this paper, a new feature selection method for speaker recognition is proposed to keep the high quality speech frames for speaker modelling and to remove noisy and corrupted speech frames.  ...  SPECTRAL SUBTRACTION In order to make an efficient and robust feature selection for speaker recognition in noise environment, we adopt the spectral subtraction process to assistant the voice activity A  ... 
doi:10.1109/chinsl.2008.ecp.57 dblp:conf/iscslp/SunML08a fatcat:5zpui2zcwfgjbfih2uqdwqg33i

COSINE - A corpus of multi-party COnversational Speech In Noisy Environments

Alex Stupakov, Evan Hanusa, Jeff Bilmes, Dieter Fox
2009 2009 IEEE International Conference on Acoustics, Speech and Signal Processing  
We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (CO-SINE) corpus.  ...  The corpus is a set of multi-party conversations recorded in real world environments with background noise that can be used to train noise-robust speech recognition systems.  ...  effect of the noisy environment on the nature of the speech (Lombard effect).  ... 
doi:10.1109/icassp.2009.4960543 dblp:conf/icassp/StupakovHBF09 fatcat:lmpfzd2twvdmjg2ifugohbaym4

The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments

Alex Stupakov, Evan Hanusa, Deepak Vijaywargi, Dieter Fox, Jeff Bilmes
2012 Computer Speech and Language  
We present an overview of the data collection and transcription efforts for the COnversational Speech In Noisy Environments (COSINE) corpus.  ...  Several in-car corpora have been created, with multi-microphone recordings of limited-vocabulary speech in noisy environments.  ...  Acknowledgements This material is based upon work supported in part by DARPA's ASSIST Program (contract number NBCH-C-05-0137) and an ONR MURI grant (No. N000140510388).  ... 
doi:10.1016/j.csl.2010.12.003 fatcat:wm5xuivsfvejbcymqshtrybbj4

A Real-Time Speech Separation Method Based on Camera and Microphone Array Sensors Fusion Approach

Ching-Feng Liu, Wei-Siang Ciou, Peng-Ting Chen, Yi-Chun Du
2020 Sensors  
In the context of assisted human, identifying and enhancing non-stationary speech targets speech in various noise environments, such as a cocktail party, is an important issue for real-time speech separation  ...  In this study, we propose a real-time speech separation method based on an approach that combines an optical camera and a microphone array. The method was divided into two stages.  ...  in a noisy environment.  ... 
doi:10.3390/s20123527 pmid:32580328 fatcat:xfonjvw2izgp3c6qxztlal6do4

Speech based emotion classification framework for driver assistance system

Ashish Tawari, Mohan Trivedi
2010 2010 IEEE Intelligent Vehicles Symposium  
In this paper, we utilize contextual information of the outside environment as well as inside car user to improve the emotion recognition accuracy.  ...  Ever changing environment while driving poses a serious challenge to existing techniques for speech emotion recognition.  ...  We are thankful to our colleagues at CVRR lab for useful discussions and assistance.  ... 
doi:10.1109/ivs.2010.5547956 dblp:conf/ivs/TawariT10 fatcat:2hjvzsxc6nap7oflewrbfbt7gy

Benefits of incorporating the adaptive dynamic range optimization amplification scheme into an assistive listening device for people with mild or moderate hearing loss

Hung-Yue Chang, Ching-Hsing Luo, Tun-Shin Lo, Hsiao-Chuan Chen, Kuo-You Huang, Wen-Huei Liao, Mao-Chang Su, Shu-Yu Liu, Nan-Mai Wang
2017 Assistive technology  
The benchmark results showed that the ADRO scheme is effectively beneficial to people who experience mild or moderate hearing loss in noisy environments.  ...  satisfied with ADRO than they with the linear scheme in noisy environments.  ...  Acknowledgments The authors thank Merry Electronics, Inc. for providing research funding and equipment, Dynamic Hearing Pty Ltd for providing technical support, and audiologist Yung-Yu Chiang for assisting  ... 
doi:10.1080/10400435.2017.1317674 pmid:28846498 fatcat:is2oyxiewbdp5bunx44zuengd4

Construct Validity of the Ecological Momentary Assessment in Audiology Research

Yu-Hsiang Wu, Elizabeth Stangl, Xuyang Zhang, Ruth A. Bentler
2015 Journal of american academy of audiology  
In the field testing the participants reported the listening context (e.g., noisiness level) of several semicontrolled real-world conversations.  ...  In Experiment 2, participants repeatedly reported the degree of speech understanding, hearing aid use, and listening context using paper-and-pencil journals in their natural environments for 1 week.  ...  In each survey, the participants were allowed to select only one activity and one environmental category.  ... 
doi:10.3766/jaaa.15034 pmid:26554491 pmcid:PMC4732705 fatcat:gl6uxx5rcjaydhopc5r7r7gvaa

Characteristics of Real-World Signal to Noise Ratios and Speech Listening Situations of Older Adults With Mild to Moderate Hearing Loss

Yu-Hsiang Wu, Elizabeth Stangl, Octav Chipara, Syed Shabih Hasan, Anne Welhaven, Jacob Oleson
2018 Ear and Hearing  
procedures to estimate real-world speech communicative functions for older adults with hearing loss. activity categories (e.g., conversation in a group more than three people) and five environmental categories  ...  The mean speech level of the PLSs that described quiet situations was 62.8 dBA, and the mean SNR of the PLSs that represented noisy environments was 7.4 dB (speech = 67.9 dBA).  ...  The contents of this paper do not necessarily represent the policy of National Institute on Disability, Independent Living, and Rehabilitation Research, Administration for Community Living, and Department  ... 
doi:10.1097/aud.0000000000000486 pmid:29466265 pmcid:PMC5824438 fatcat:dkunxtt7kza3nofardewqvkwku

Benefits in Speech Recognition in Noise with Remote Wireless Microphones in Group Settings

Linda M Thibodeau
2019 Journal of american academy of audiology  
Although hearing aids (HAs) and cochlear implants (CIs) can provide significant benefits to persons with hearing loss, users frequently report difficulty hearing in noisy environments, particularly when  ...  The results suggest that the Roger Select microphone can provide significant benefits in speech recognition in noise over the use of HA/CI alone (61%) and also significant benefits over the use of a Roger  ...  Acknowledgments The author thanks Phonak for providing the wireless technology and the participants in the Summer Intensive Auditory Revitalization Conference for their involvement.  ... 
doi:10.3766/jaaa.19060 pmid:31758679 pmcid:PMC7353922 fatcat:3ezcpsddv5cvfk3pq6cupog6je

EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments [article]

Jacob Donley, Vladimir Tourbabin, Jung-Suk Lee, Mark Broyles, Hao Jiang, Jie Shen, Maja Pantic, Vamsi Krishna Ithapu, Ravish Mehra
2021 arXiv   pre-print
noisy environment.  ...  In this work, we describe, evaluate and release a dataset that contains over 5 hours of multi-modal data useful for training and testing algorithms for the application of improving conversations for an  ...  ACKNOWLEDGMENTS We would like to thank the research assistant for their excellent work helping collect this dataset (whose name is anonymous for privacy reasons).  ... 
arXiv:2107.04174v2 fatcat:owdguaovsnd67n57vm6l253jn4

Coordination of Speech Recognition Devices in Intelligent Environments with Multiple Responsive Devices

Benítez-Guijarro, Callejas, Noguera, Benghazi
2019 Proceedings (MDPI)  
In order to validate this approach, we have performed an evaluation using a corpus of 4608 audios recorded in a two-room intelligent environment with 24 microphones.  ...  input, both in clear and noisy settings.  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/proceedings2019031054 fatcat:igy7fbxmivfkpg66wacnojrfca

Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments [article]

Alejandro Mottini, Jaime Lorenzo-Trueba, Sri Vishnu Kumar Karlapati, Thomas Drugman
2021 arXiv   pre-print
Experimental results show that Voicy outperforms other tested VC techniques in terms of naturalness and target speaker similarity in noisy reverberant environments.  ...  However, many acoustic environments are noisy and reverberant, severely restricting the applicability of popular VC methods to such scenarios.  ...  This is a shortcoming, since most real acoustic environments are noisy and reverberant. Second, not all methods can perform nonparallel zero-shot conversion [8, 9] .  ... 
arXiv:2106.08873v1 fatcat:2hfsxmpb3bhalm5uyisch3agbi

Multimodal voice conversion based on non-negative matrix factorization

Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki
2015 EURASIP Journal on Audio, Speech, and Music Processing  
A multimodal voice conversion (VC) method for noisy environments is proposed.  ...  The effectiveness of the proposed method is confirmed by comparing its effectiveness with that of a conventional audio-input NMF-based method and a Gaussian mixture model-based method.  ...  We also investigated the effectiveness of the proposed method in various noisy environments.  ... 
doi:10.1186/s13636-015-0067-4 fatcat:o3fxru6ku5duvaw3hxkkkizwla

Introduction for Speech and language for interactive robots

Heriberto Cuayáhuitl, Kazunori Komatani, Gabriel Skantze
2015 Computer Speech and Language  
Environments are typically unstructured, dynamic and noisy and therefore challenging for robots.  ...  However, the effectiveness of interactive robots needs to be demonstrated in real (or at least realistic) environments. This special issue presents some case studies.  ...  Finally, we thank the anonymous reviewers for their support in maintaining the high standard of this special issue.  ... 
doi:10.1016/j.csl.2015.05.006 fatcat:eiqdafexljgzzkwjfpsxcmj46i

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments [article]

Kouhei Sekiguchi, Aditya Arie Nugraha, Yicheng Du, Yoshiaki Bando, Mathieu Fontaine, Kazuyoshi Yoshii
2022 arXiv   pre-print
This method is used with a blind dereverberation method called weighted prediction error (WPE) for transcribing the noisy reverberant speech of a speaker, which can be detected from video or selected by  ...  This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real  ...  We thus integrate a blind dereverberation method called weighted prediction error (WPE) [22] , [23] , which has been shown to be effective for improving the ASR performance in diverse reverberant environments  ... 
arXiv:2207.07296v1 fatcat:65la6axo5rds5e6rll2ri2xuii
« Previous Showing results 1 — 15 out of 3,825 results