A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Audio-visual Keyword Spotting for Mandarin Based on Discriminative Local Spatial-Temporal Descriptors
2014
2014 22nd International Conference on Pattern Recognition
Although keyword spotting (KWS) technologies have been successfully applied to some applications, most KWS systems have a common problem of noise-robustness when applied to real-world environments. Audio-visual keyword spotting (AVKWS) using both acoustic and visual information is a solution to complementarily solve the problem. Most existing audio-visual speech recognition (AVSR) systems extract geometric features as visual features, which heavily rely on accurate and reliable detection and
doi:10.1109/icpr.2014.145
dblp:conf/icpr/LiuFW14
fatcat:a57xfmsmobchjf6264pe7ocv5m