A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2021; you can also visit the original URL.
The file type is application/pdf
.
Filters
Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors
[article]
2021
arXiv
pre-print
Recently, the convolutional weighted power minimization distortionless response (WPD) beamformer was proposed, which unifies multi-channel weighted prediction error dereverberation and minimum power distortionless ...
response beamforming. ...
Second, to achieve dereverberation, the so-called weighted prediction error (WPE) technique is commonly applied in the short-time Fourier transform (STFT) domain [12] [13] [14] . ...
arXiv:2106.01902v1
fatcat:vellxiusmzblpmiiryetjdfsbq
Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments
2020
Sensors
Next, the STFT coefficients of the dereverberated multi-channel audio signals are conveyed to the DNN-supported minimum variance distortionless response (MVDR) beamformer in which DNN-supported MVDR beamforming ...
prediction error (WPE) dereverberation with the estimated masks. ...
WPE, weighted prediction error; MVDR, minimum variance distortionless response; SED, sound event detection.
Figure 2 . 2 Figure 2. ...
doi:10.3390/s20071883
pmid:32231161
fatcat:pxjtoka2y5hcjh2fdbwfeqawyq
Spatial Processing Front-End For Distant ASR Exploiting Self-Attention Channel Combinator
[article]
2022
arXiv
pre-print
We present a novel multi-channel front-end based on channel shortening with theWeighted Prediction Error (WPE) method followed by a fixed MVDR beamformer used in combination with a recently proposed self-attention-based ...
channel combination (SACC) scheme, for tackling the distant ASR problem. ...
combination. ...
arXiv:2203.13919v1
fatcat:4wqksvsgj5anrgdybqtpfm2dji
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
[article]
2020
arXiv
pre-print
First, a multi-source mask-based weighted prediction error (WPE) module is incorporated in the frontend for dereverberation. ...
Second, another novel frontend architecture is proposed, which extends the weighted power minimization distortionless response (WPD) convolutional beamformer to perform simultaneous separation and dereverberation ...
variance distortionless response (MVDR) and minimum power distortionless response (MPDR) beamforming [12, 13] , etc. ...
arXiv:2005.10479v2
fatcat:yxjujwvewfdhnl3mwlf4w6zavy
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming
2020
Interspeech 2020
First, a multisource mask-based weighted prediction error (WPE) module is incorporated in the frontend for dereverberation. ...
Second, another novel frontend architecture is proposed, which extends the weighted power minimization distortionless response (WPD) convolutional beamformer to perform simultaneous separation and dereverberation ...
variance distortionless response (MVDR) and minimum power distortionless † Shinji Watanabe and Yanmin Qian are the corresponding authors. response (MPDR) beamforming [12, 13] , multi-frame beamforming ...
doi:10.21437/interspeech.2020-2432
dblp:conf/interspeech/ZhangSC0Q20
fatcat:bnylkmimlfe4fngtgepqjf7eym
Jointly optimal denoising, dereverberation, and source separation
[article]
2020
arXiv
pre-print
Conventionally, cascade configuration composed of a Weighted Prediction Error minimization (WPE) dereverberation filter followed by a Minimum Variance Distortionless Response beamformer has been usedas ...
This paper refers to a CBF optimized by this objective function as a weighted Minimum-Power Distortionless Response (wMPDR) CBF. ...
Conventionally, cascade configuration composed of a Weighted Prediction Error minimization (WPE) dereverberation filter followed by a Minimum Variance Distortionless Response (MVDR) beamformer has been ...
arXiv:2005.09843v2
fatcat:bdhveyic2jbnhdyuuzxyano4oa
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation
2018
Interspeech 2018
The weighted prediction error (WPE) algorithm has proven to be a very successful dereverberation method for the REVERB challenge. ...
For these integrated variants we identify a consistent word error rate (WER) reduction on two distinct databases. ...
Delcroix et al. compare consecutive execution of either minimum variance distortionless response (MVDR) beamforming followed by WPE or vice versa [15] on REVERB challenge data [16] but do not consider ...
doi:10.21437/interspeech.2018-2196
dblp:conf/interspeech/DrudeBHHKDN18
fatcat:dzqaxz2wvjaltoy6tbjyc5in3q
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
[article]
2021
arXiv
pre-print
In this work, we focus on the multichannel multi-speaker reverberant condition, and propose to extend our previous framework for end-to-end dereverberation, beamforming, and speech recognition with improved ...
speech dereverberation and separation performance (SDR=12.5 dB) in the reverberant multi-speaker condition while trained only with the ASR criterion. ...
In addition, we mainly support two alternative beamformer types, respectively, based on 1) minimum variance distortionless response (MVDR) [28] and 2) wMPDR. ...
arXiv:2102.11525v1
fatcat:krumqa2svbegrpbqcsa6svh4ea
An Effective Dereverberation Algorithm by Fusing MVDR and MCLP
[article]
2022
arXiv
pre-print
In this paper, minimum variance distortionless response (MVDR) beamformer and MCLP are effectively fused in the dereverberation, where the PSD of target speech used for Kalman filter is modified in the ...
In order to solve this problem, many methods for the dereverberation have emerged. ...
Minimum variance distortionless response (MVDR) beamforming [8] requires the estimation of covariance matrix of the noise and the steering vector of target signal relative to the reference microphone ...
arXiv:2203.14561v1
fatcat:y6xggffbkrah3mosgeyxr2gyfa
Deep Learning Applied to Dereverberation and Sound Event Classification in Reverberant Environments
2019
Proceedings of the ICA congress
filter (MWF), and the variance-normalized delayed linear prediction (NDLP). ...
A room response simulator based on the image source method is employed to create reverberant signals for numerous RT60 conditions in the training phase. ...
Alternatively, the Multichannel Wiener Filter (MWF) (6) which consists of a Minimum Variance Distortionless Response (MVDR) cascaded with a single channel post filter can be used. ...
doi:10.18154/rwth-conv-238800
fatcat:sgkqh55sfrc53mjfwl7b2va3je
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence
[article]
2015
arXiv
pre-print
In the case of multiple microphones, we preprocess the data with either a minimum variance distortionless response (MVDR) beamformer, or a delay-and-sum beamformer (DSB). ...
We evaluate our algorithm on both speech enhancement and recognition tasks for the REVERB challenge dataset. ...
Acknowledgments We wish to thank Derek Huang for his help with the Kaldi tools. This work is funded by ONR contract N00014-12-G-0078, delivery order 0013, and ARO grant number W911NF1210277. ...
arXiv:1509.00533v1
fatcat:bg2w4swpunettbf677xcldo7qm
New Insights Into the MVDR Beamformer in Room Acoustics
2010
IEEE Transactions on Audio, Speech, and Language Processing
The minimum variance distortionless response (MVDR) beamformer, also known as Capon's beamformer, is widely studied in the area of speech enhancement. ...
The MVDR beamformer can be used for both speech dereverberation and noise reduction. This paper provides new insights into the MVDR beamformer. ...
MINIMUM VARIANCE DISTORTIONLESS RESPONSE BEAMFORMER We now derive the celebrated MVDR beamformer proposed by Capon [3] in the context of room acoustics. ...
doi:10.1109/tasl.2009.2024731
fatcat:vl7ek4ezwra6df2dhzlnjwnfgy
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness
[article]
2022
arXiv
pre-print
In this paper, we present a multi-channel ConvMixer for speech command recognitions. ...
In addition, a far-field and noisy environment with multiple signals interference aggravates the problem causing the accuracy to degrade significantly. ...
Acknowledgements This work was supported by Alibaba Group through Alibaba Innovative Research (AIR) Program and Alibaba-NTU Singapore Joint Research Institute (JRI), Nanyang Technological University, Singapore ...
arXiv:2204.05445v1
fatcat:2a2eygx3kvbslawgryp3zw2mfe
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions
[article]
2019
arXiv
pre-print
This report uses a recently developed architecture for far-field ASR by composing neural extensions of dereverberation and beamforming modules with the S2S ASR module as a single differentiable neural ...
It is clear from both recent challenge outcomes and successful products that far-field systems would be incomplete without solving both denoising and dereverberation simultaneously. ...
Weighted prediction error (WPE) [24, 25] is a technique based on variance normalized long term linear prediction popularly used for dereverberation of wet (reverberant) signals. ...
arXiv:1904.09049v3
fatcat:qut2jhht7bafpkfbuo5iwhzjde
A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions
2017
Computer Speech and Language
Our results indicate that performing weighted prediction error (WPE) dereverberation on a reverberated test speech utterance and decoding using an deep neural network (DNN) acoustic model trained with ...
A combined evaluation of established and new approaches for speech recognition in varied reverberation conditions. Computer Speech and Language, Elsevier, 2017, 46, pp. ...
Delcroix et al. (2015) detailed different techniques to counter reverberation using the REVERB dataset such as dereverberation using weighted prediction error (WPE) and minimum variance distortionless ...
doi:10.1016/j.csl.2017.02.003
fatcat:jktivy64ondo5nmf3wfkdptj4q
« Previous
Showing results 1 — 15 out of 136 results