A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2013; you can also visit the original URL.
The file type is application/pdf
.
Blind Audiovisual Source Separation Based on Sparse Redundant Representations
2010
IEEE transactions on multimedia
In this paper we propose a novel method which is able to detect and separate audio-visual sources present in a scene. Our method exploits the correlation between the video signal captured with a camera and a synchronously recorded one-microphone audio track. In a first stage, audio and video modalities are decomposed into relevant basic structures using redundant representations. Next, synchrony between relevant events in audio and video modalities is quantified. Based on this co-occurrence
doi:10.1109/tmm.2010.2050650
fatcat:rusd73kyvjfbre6lgup4i366xe