A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Deep Learning for Video Captioning: A Review
2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence
Deep learning has achieved great successes in solving specific artificial intelligence problems recently. Substantial progresses are made on Computer Vision (CV) and Natural Language Processing (NLP). As a connection between the two worlds of vision and language, video captioning is the task of producing a natural-language utterance (usually a sentence) that describes the visual content of a video. The task is naturally decomposed into two sub-tasks. One is to encode a video via a thorough
doi:10.24963/ijcai.2019/877
dblp:conf/ijcai/ChenYJ19
fatcat:3xxssrzqjjd5jbvtgkkp5lw7xa