An audio-video front-end for multimedia applications

D. Zotkin, R. Duraiswami, L. Davis, I. Haritaoglu
SMC 2000 Conference Proceedings. 2000 IEEE International Conference on Systems, Man and Cybernetics. 'Cybernetics Evolving to Systems, Humans, Organizations, and their Complex Interactions' (Cat. No.00CH37166)  
Applications such as video gaming, virtual reality, multimodal user interfaces, and video conferencing, require systems that can locate and track persons in a room through a combination of visual and audio cues, enhance the sound that they produce, and perform identi¿cation. We describe the development of a particular multimodal sensor fusion system that is portable, runs in real time and achieves these objectives. The system employs novel algorithms for acoustical source location, video-based
more » ... erson tracking, and overall system control, which are also described.
doi:10.1109/icsmc.2000.885945 dblp:conf/smc/ZotkinDDH00 fatcat:rywlmoibtbh6zevv22ormuzscq