A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Webly Supervised Knowledge Embedding Model for Visual Reasoning
2020
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Visual reasoning between visual image and natural language description is a long-standing challenge in computer vision. While recent approaches offer a great promise by compositionality or relational computing, most of them are oppressed by the challenge of training with datasets containing only a limited number of images with ground-truth texts. Besides, it is extremely time-consuming and difficult to build a larger dataset by annotating millions of images with text descriptions that may very
doi:10.1109/cvpr42600.2020.01246
dblp:conf/cvpr/ZhengYG020
fatcat:bh3fesjp4vg43ewkoiaomehutq