A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning
2020
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Cross-modal retrieval between videos and texts has attracted growing attentions due to the rapid emergence of videos on the web. The current dominant approach is to learn a joint embedding space to measure cross-modal similarities. However, simple embeddings are insufficient to represent complicated visual and textual details, such as scenes, objects, actions and their compositions. To improve fine-grained video-text retrieval, we propose a Hierarchical Graph Reasoning (HGR) model, which
doi:10.1109/cvpr42600.2020.01065
dblp:conf/cvpr/ChenZJW20
fatcat:brlrtsp7lre7bne7cm37esr5wu