A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Audio Signal Representations for Indexing in the Transform Domain
2010
IEEE Transactions on Audio, Speech, and Language Processing
Indexing audio signals directly in the transform domain can potentially save a significant amount of computation when working on a large database of signals stored in a lossy compression format, without having to fully decode the signals. Here, we show that the representations used in standard transform-based audio codecs (e.g. MDCT for AAC, or hybrid PQF/MDCT for MP3) have a sufficient time resolution for some rhythmic features, but a poor frequency resolution, which prevents their use in
doi:10.1109/tasl.2009.2025099
fatcat:56n3nzdzjzaqjjfisnckmzrw3y