Recognizing Music Mood and Theme Using Convolutional Neural Networks and Attention

Alish Dipani, Gaurav Iyer, Veeky Baths
2020 MediaEval Benchmarking Initiative for Multimedia Evaluation  
We present the UAI-CNRL submission to MediaEval 2020 task on Emotion and Theme Recognition in Music. We make use of the ResNet34 architecture, coupled with a self-attention module to detect moods/themes in music tracks. The autotagging-moodtheme subset of the MTG-Jamendo dataset was used to train the model. We show that the proposed model outperforms the provided VGG-ish and popularity baselines.
dblp:conf/mediaeval/DipaniIB20 fatcat:g22qnc5zvzbdvb3ycyotccczfu