A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos
2013
2013 IEEE International Conference on Multimedia and Expo (ICME)
Traditional speech/non-speech segmentation systems have been designed for specific acoustic conditions, such as broadcast news or meetings. However, little research has been done on consumer-produced audio. This type of media is constantly growing and has complex characteristics such as low quality recordings, environmental noise and overlapping sounds. This paper discusses an evaluation of three different approaches for speech/non-speech detection on consumer-produced audio. The approaches are
doi:10.1109/icme.2013.6607486
dblp:conf/icmcs/ElizaldeF13
fatcat:wiretpir7re77jxwkhrj4yejna