Audio segmentation-by-classification approach based on factor analysis in broadcast news domain

Diego Castán, Alfonso Ortega, Antonio Miguel, Eduardo Lleida
2014 EURASIP Journal on Audio, Speech, and Music Processing  
This paper studies a novel audio segmentation-by-classification approach based on factor analysis. The proposed technique compensates the within-class variability by using class-dependent factor loading matrices and obtains the scores by computing the log-likelihood ratio for the class model to a non-class model over fixed-length windows. Afterwards, these scores are smoothed to yield longer contiguous segments of the same class by means of different back-end systems. Unlike previous solutions,
more » ... previous solutions, our proposal does not make use of specific acoustic features and does not need a hierarchical structure. The proposed method is applied to segment and classify audios coming from TV shows into five different acoustic classes: speech, music, speech with music, speech with noise, and others. The technique is compared to a hierarchical system with specific acoustic features achieving a significant error reduction.
doi:10.1186/preaccept-1330210582123399 fatcat:i7orhxn4rjcv7iyee2r4yfuyou