A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling
2015
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval - ICMR '15
This paper presents advances in analyzing audio content information to detect events in videos, such as a parade or a birthday party. We developed a set of tools for audio processing within the predominantly vision-focused deep neural network (DNN) framework Caffe. Using these tools, we show, for the first time, the potential of using only a DNN for audio-based multimedia event detection. Training DNNs for event detection using the entire audio track from each video causes a computational
doi:10.1145/2671188.2749396
dblp:conf/mir/AshrafEIMBFK15
fatcat:tjfd4zwz5zb5dk3m24lmlvmmq4