A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Neural Network Based Time-Frequency Masking and Steering Vector Estimation for Two-Channel Mvdr Beamforming
2018
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
We present a neural network based approach to two-channel beamforming. First, single-and cross-channel spectral features are extracted to form a feature map for each utterance. A large neural network that is the concatenation of a convolution neural network (CNN), long short-term memory recurrent neural network (LSTM-RNN) and deep neural network (DNN) is then employed to estimate frame-level speech and noise masks. Later, these predicted masks are used to compute cross-power spectral density
doi:10.1109/icassp.2018.8462069
dblp:conf/icassp/LiuGKK18
fatcat:5c4mqnymuncgtcbvyrtk4qdhii