A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
OCR-aided person annotation and label propagation for speaker modeling in TV shows
2016
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
In this paper, we present an approach for minimizing human effort in manual speaker annotation. Label propagation is used at each iteration of an active learning cycle. More precisely, a selection strategy for choosing the most suitable speech track to be labeled is proposed. Four different selection strategies are evaluated and all the tracks in a corresponding cluster are gathered using agglomerative clustering in order to propagate human annotations. To further reduce the manual labor
doi:10.1109/icassp.2016.7472743
dblp:conf/icassp/BudnikBKD16
fatcat:k6a7xeihzfflvg4xkfa52svuri