A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Local spatiotemporal descriptors for visual recognition of spoken phrases
2007
Proceedings of the international workshop on Human-centered multimedia - HCM '07
Visual speech information plays an important role in speech recognition under noisy conditions or for listeners with hearing impairment. In this paper, we propose local spatiotemporal descriptors to represent and recognize spoken isolated phrases based solely on visual input. Positions of the eyes determined by a robust face and eye detector are used for localizing the mouth regions in face images. Spatiotemporal local binary patterns extracted from these regions are used for describing phrase
doi:10.1145/1290128.1290138
dblp:conf/mm/ZhaoPH07
fatcat:uooot3dn35a7faffo2e34betii