A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
CNN Based Query by Example Spoken Term Detection
2018
Interspeech 2018
In this work, we address the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State of the art solutions usually rely on dynamic time warping (DTW) based template matching. In contrast, we propose here to tackle the problem as binary classification of images. Similar to the DTW approach, we rely on deep neural network (DNN) based posterior probabilities as feature vectors. The posteriors from a spoken query and a test utterance are used to compute
doi:10.21437/interspeech.2018-1722
dblp:conf/interspeech/RamWB18
fatcat:trvuoqytivg3tp2viisnll3f2y