A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2016; you can also visit the original URL.
The file type is application/pdf
.
VLAD3: Encoding Dynamics of Deep Features for Action Recognition
2016
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Previous approaches to action recognition with deep features tend to process video frames only within a small temporal region, and do not model long-range dynamic information explicitly. However, such information is important for the accurate recognition of actions, especially for the discrimination of complex activities that share sub-actions, and when dealing with untrimmed videos. Here, we propose a representation, VLAD for Deep Dynamics (VLAD 3 ), that accounts for different levels of video
doi:10.1109/cvpr.2016.215
dblp:conf/cvpr/LiLMV16
fatcat:25edszc4gbcy3pddlnmvcwf4la