A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
An Automatic Speech Recognition solution with speaker identification support
2014
2014 10th International Conference on Communications (COMM)
Automatic Speech Recognition may suffer in terms of intelligibility if the audio recording contains heterogeneous regions of multiple speakers, music or noise. Diarization is the process of segmenting an audio file into homogeneous regions and when used in conjunction with an Automatic Speech Recognition system, it filters out the non-speech audio regions and significantly improves the intelligibility of the recognition output. In this paper, we present an integrated diarization and
doi:10.1109/iccomm.2014.6866674
fatcat:asvf7zuahvbjlkvsqvd4hugisu