A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
An integrated framework for multi-channel multi-source localization and voice activity detection
2011
2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays
Two of the major challenges in microphone array based adaptive beamforming, speech enhancement and distant speech recognition, are robust and accurate source localization and voice activity detection. This paper introduces a spatial gradient steered response power using the phase transform (SRP-PHAT) method which is capable of localization of competing speakers in overlapping conditions. We further investigate the behavior of the SRP function and characterize theoretically a fixed point in its
doi:10.1109/hscma.2011.5942417
fatcat:74cbsxriozgr7hb7js4dxdj3gq