A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
The gesturer is the speaker
2013
2013 IEEE International Conference on Acoustics, Speech and Signal Processing
We present and solve the speaker diarization problem in a novel way. We hypothesize that the gesturer is the speaker and that identifying the gesturer can be taken as identifying the active speaker. We provide evidence in support of the hy pothesis from gesture literature and audio-visual synchrony studies. We also present a vision-only diarization algorithm that relies on gestures (i.e. upper body movements). Experi ments carried out on 8.9 hours of a publicly available dataset (the AMI
doi:10.1109/icassp.2013.6638359
dblp:conf/icassp/GebreWH13
fatcat:jrzm3jlqeferrok434co3h4kfa