A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Histogram of gradients of Time-Frequency Representations for Audio Scene Detection
2014
IEEE/ACM Transactions on Audio Speech and Language Processing
This paper addresses the problem of audio scenes classification and contributes to the state of the art by proposing a novel feature. We build this feature by considering histogram of gradients (HOG) of an audio scene time-frequency representation. Contrarily to classical audio features like MFCC, we make the hypothesis that histograms of gradients are able to encode some relevant informations in a time-frequency representation: namely, the local direction of variation (in time and frequency)
doi:10.1109/taslp.2014.2375575
fatcat:zru7ljznkjelfi7h7biaen4xve