A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Enhancing Music Information Retrieval by Incorporating Image-Based Local Features
[chapter]
2012
Lecture Notes in Computer Science
This paper presents a novel approach to music genre classification. Having represented music tracks in the form of two dimensional images, we apply the "bag of visual words" method from visual IR in order to classify the songs into 19 genres. By switching to visual domain, we can abstract from musical concepts such as melody, timbre and rhythm. We obtained classification accuracy of 46% (with 5% theoretical baseline for random classification) which is comparable with existing state-of-the-art
doi:10.1007/978-3-642-35341-3_19
fatcat:54efxhkxqzcy5iob6fy5uq6nbm