A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
VoiceFilter-Lite: Streaming Targeted Voice Separation for On-Device Speech Recognition
2020
Interspeech 2020
We introduce VoiceFilter-Lite, a single-channel source separation model that runs on the device to preserve only the speech signals from a target user, as part of a streaming speech recognition system. Delivering such a model presents numerous challenges: It should improve the performance when the input signal consists of overlapped speech, and must not hurt the speech recognition performance under all other acoustic conditions. Besides, this model must be tiny, fast, and perform inference in a
doi:10.21437/interspeech.2020-1193
dblp:conf/interspeech/WangLSWCLHLPNG20
fatcat:7bi4ldkrujg4pekpqllu4x6fpi