Filters








136 Hits in 5.3 sec

Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors [article]

Henri Gode, Marvin Tammen, Simon Doclo
2021 arXiv   pre-print
Recently, the convolutional weighted power minimization distortionless response (WPD) beamformer was proposed, which unifies multi-channel weighted prediction error dereverberation and minimum power distortionless  ...  response beamforming.  ...  Second, to achieve dereverberation, the so-called weighted prediction error (WPE) technique is commonly applied in the short-time Fourier transform (STFT) domain [12] [13] [14] .  ... 
arXiv:2106.01902v1 fatcat:vellxiusmzblpmiiryetjdfsbq

Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments

Kyoungjin Noh, Joon-Hyuk Chang
2020 Sensors  
Next, the STFT coefficients of the dereverberated multi-channel audio signals are conveyed to the DNN-supported minimum variance distortionless response (MVDR) beamformer in which DNN-supported MVDR beamforming  ...  prediction error (WPE) dereverberation with the estimated masks.  ...  WPE, weighted prediction error; MVDR, minimum variance distortionless response; SED, sound event detection. Figure 2 . 2 Figure 2.  ... 
doi:10.3390/s20071883 pmid:32231161 fatcat:pxjtoka2y5hcjh2fdbwfeqawyq

Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator [article]

Dushyant Sharma and Rong Gong and James Fosburgh and Stanislav Yu. Kruchinin and Patrick A. Naylor and Ljubomir Milanovic
2022 arXiv   pre-print
We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based  ...  channel combination (SACC) scheme, for tackling the distant ASR problem.  ...  combination.  ... 
arXiv:2203.13919v1 fatcat:4wqksvsgj5anrgdybqtpfm2dji

End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming [article]

Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Shinji Watanabe, Yanmin Qian
2020 arXiv   pre-print
First, a multi-source mask-based weighted prediction error (WPE) module is incorporated in the frontend for dereverberation.  ...  Second, another novel frontend architecture is proposed, which extends the weighted power minimization distortionless response (WPD) convolutional beamformer to perform simultaneous separation and dereverberation  ...  variance distortionless response (MVDR) and minimum power distortionless response (MPDR) beamforming [12, 13] , etc.  ... 
arXiv:2005.10479v2 fatcat:yxjujwvewfdhnl3mwlf4w6zavy

End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming

Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Shinji Watanabe, Yanmin Qian
2020 Interspeech 2020  
First, a multisource mask-based weighted prediction error (WPE) module is incorporated in the frontend for dereverberation.  ...  Second, another novel frontend architecture is proposed, which extends the weighted power minimization distortionless response (WPD) convolutional beamformer to perform simultaneous separation and dereverberation  ...  variance distortionless response (MVDR) and minimum power distortionless † Shinji Watanabe and Yanmin Qian are the corresponding authors. response (MPDR) beamforming [12, 13] , multi-frame beamforming  ... 
doi:10.21437/interspeech.2020-2432 dblp:conf/interspeech/ZhangSC0Q20 fatcat:bnylkmimlfe4fngtgepqjf7eym

Jointly optimal denoising, dereverberation, and source separation [article]

Tomohiro Nakatani, Christoph Boeddeker, Keisuke Kinoshita, Rintaro Ikeshita, Marc Delcroix, Reinhold Haeb-Umbach
2020 arXiv   pre-print
Conventionally, cascade configuration composed of a Weighted Prediction Error minimization (WPE) dereverberation filter followed by a Minimum Variance Distortionless Response beamformer has been usedas  ...  This paper refers to a CBF optimized by this objective function as a weighted Minimum-Power Distortionless Response (wMPDR) CBF.  ...  Conventionally, cascade configuration composed of a Weighted Prediction Error minimization (WPE) dereverberation filter followed by a Minimum Variance Distortionless Response (MVDR) beamformer has been  ... 
arXiv:2005.09843v2 fatcat:bdhveyic2jbnhdyuuzxyano4oa

Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation

Lukas Drude, Christoph Boeddeker, Jahn Heymann, Reinhold Haeb-Umbach, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani
2018 Interspeech 2018  
The weighted prediction error (WPE) algorithm has proven to be a very successful dereverberation method for the REVERB challenge.  ...  For these integrated variants we identify a consistent word error rate (WER) reduction on two distinct databases.  ...  Delcroix et al. compare consecutive execution of either minimum variance distortionless response (MVDR) beamforming followed by WPE or vice versa [15] on REVERB challenge data [16] but do not consider  ... 
doi:10.21437/interspeech.2018-2196 dblp:conf/interspeech/DrudeBHHKDN18 fatcat:dzqaxz2wvjaltoy6tbjyc5in3q

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend [article]

Wangyou Zhang, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian
2021 arXiv   pre-print
In this work, we focus on the multichannel multi-speaker reverberant condition, and propose to extend our previous framework for end-to-end dereverberation, beamforming, and speech recognition with improved  ...  speech dereverberation and separation performance (SDR=12.5 dB) in the reverberant multi-speaker condition while trained only with the ASR criterion.  ...  In addition, we mainly support two alternative beamformer types, respectively, based on 1) minimum variance distortionless response (MVDR) [28] and 2) wMPDR.  ... 
arXiv:2102.11525v1 fatcat:krumqa2svbegrpbqcsa6svh4ea

An Effective Dereverberation Algorithm by Fusing MVDR and MCLP [article]

Fengqi Tan, Changchun Bao
2022 arXiv   pre-print
In this paper, minimum variance distortionless response (MVDR) beamformer and MCLP are effectively fused in the dereverberation, where the PSD of target speech used for Kalman filter is modified in the  ...  In order to solve this problem, many methods for the dereverberation have emerged.  ...  Minimum variance distortionless response (MVDR) beamforming [8] requires the estimation of covariance matrix of the noise and the steering vector of target signal relative to the reference microphone  ... 
arXiv:2203.14561v1 fatcat:y6xggffbkrah3mosgeyxr2gyfa

Deep Learning Applied to Dereverberation and Sound Event Classification in Reverberant Environments

Mingsian R. Bai, Wen-Chuan Chen
2019 Proceedings of the ICA congress  
filter (MWF), and the variance-normalized delayed linear prediction (NDLP).  ...  A room response simulator based on the image source method is employed to create reverberant signals for numerous RT60 conditions in the training phase.  ...  Alternatively, the Multichannel Wiener Filter (MWF) (6) which consists of a Minimum Variance Distortionless Response (MVDR) cascaded with a single channel post filter can be used.  ... 
doi:10.18154/rwth-conv-238800 fatcat:sgkqh55sfrc53mjfwl7b2va3je

Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence [article]

Scott Wisdom, Thomas Powers, Les Atlas, James Pitton
2015 arXiv   pre-print
In the case of multiple microphones, we preprocess the data with either a minimum variance distortionless response (MVDR) beamformer, or a delay-and-sum beamformer (DSB).  ...  We evaluate our algorithm on both speech enhancement and recognition tasks for the REVERB challenge dataset.  ...  Acknowledgments We wish to thank Derek Huang for his help with the Kaldi tools. This work is funded by ONR contract N00014-12-G-0078, delivery order 0013, and ARO grant number W911NF1210277.  ... 
arXiv:1509.00533v1 fatcat:bg2w4swpunettbf677xcldo7qm

New Insights Into the MVDR Beamformer in Room Acoustics

E. Habets, J. Benesty, I. Cohen, S. Gannot, J. Dmochowski
2010 IEEE Transactions on Audio, Speech, and Language Processing  
The minimum variance distortionless response (MVDR) beamformer, also known as Capon's beamformer, is widely studied in the area of speech enhancement.  ...  The MVDR beamformer can be used for both speech dereverberation and noise reduction. This paper provides new insights into the MVDR beamformer.  ...  MINIMUM VARIANCE DISTORTIONLESS RESPONSE BEAMFORMER We now derive the celebrated MVDR beamformer proposed by Capon [3] in the context of room acoustics.  ... 
doi:10.1109/tasl.2009.2024731 fatcat:vl7ek4ezwra6df2dhzlnjwnfgy

Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness [article]

Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng
2022 arXiv   pre-print
In this paper, we present a multi-channel ConvMixer for speech command recognitions.  ...  In addition, a far-field and noisy environment with multiple signals interference aggravates the problem causing the accuracy to degrade significantly.  ...  Acknowledgements This work was supported by Alibaba Group through Alibaba Innovative Research (AIR) Program and Alibaba-NTU Singapore Joint Research Institute (JRI), Nanyang Technological University, Singapore  ... 
arXiv:2204.05445v1 fatcat:2a2eygx3kvbslawgryp3zw2mfe

An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions [article]

Aswin Shanmugam Subramanian, Xiaofei Wang, Shinji Watanabe, Toru Taniguchi, Dung Tran, Yuya Fujita
2019 arXiv   pre-print
This report uses a recently developed architecture for far-field ASR by composing neural extensions of dereverberation and beamforming modules with the S2S ASR module as a single differentiable neural  ...  It is clear from both recent challenge outcomes and successful products that far-field systems would be incomplete without solving both denoising and dereverberation simultaneously.  ...  Weighted prediction error (WPE) [24, 25] is a technique based on variance normalized long term linear prediction popularly used for dereverberation of wet (reverberant) signals.  ... 
arXiv:1904.09049v3 fatcat:qut2jhht7bafpkfbuo5iwhzjde

A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions

Sunit Sivasankaran, Emmanuel Vincent, Irina Illina
2017 Computer Speech and Language  
Our results indicate that performing weighted prediction error (WPE) dereverberation on a reverberated test speech utterance and decoding using an deep neural network (DNN) acoustic model trained with  ...  A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions. Computer Speech and Language, Elsevier, 2017, 46, pp.  ...  Delcroix et al. (2015) detailed different techniques to counter reverberation using the REVERB dataset such as dereverberation using weighted prediction error (WPE) and minimum variance distortionless  ... 
doi:10.1016/j.csl.2017.02.003 fatcat:jktivy64ondo5nmf3wfkdptj4q
« Previous Showing results 1 — 15 out of 136 results