Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Group Conversations

Oya Aran, Daniel Gatica-Perez
2010 2010 20th International Conference on Pattern Recognition  
This paper addresses the multimodal nature of social dominance and presents multimodal fusion techniques to combine audio and visual nonverbal cues for dominance estimation in small group conversations. We combine the two modalities both at the feature extraction level and at the classifier level via score and rank level fusion. The classification is done by a simple rulebased estimator. We perform experiments on a new 10hour dataset derived from the popular AMI meeting corpus. We objectively
more » ... aluate the performance of each modality and each cue alone and in combination. Our results show that the combination of audio and visual cues is necessary to achieve the best performance.
doi:10.1109/icpr.2010.898 dblp:conf/icpr/AranG10 fatcat:hoghxsdyk5hexorurqnsyiwz2a