A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Speaker Detection and Applications to Cross-Modal Analysis of Planning Meetings
2009
2009 11th IEEE International Symposium on Multimedia
Detection of meeting events is one of the most important tasks in multimodal analysis of planning meetings. Speaker detection is a key step for extraction of most meaningful meeting events. In this paper, we present an approach of speaker localization using combination of visual and audio information in multimodal meeting analysis. When talking, people make a speech accompanying mouth movements and hand gestures. By computing correlation of audio signals, mouth movements, and hand motion, we
doi:10.1109/ism.2009.66
dblp:conf/ism/FangXQ09
fatcat:dhk4spssorf2xcbdq7gbosxhzy