A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
GroupCap: Group-Based Image Captioning with Structured Relevance and Diversity Constraints
2018
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Most image captioning models focus on one-line (single image) captioning, where the correlations like relevance and diversity among group images (e.g., within the same album or event) are simply neglected, resulting in less accurate and diverse captions. Recent works mainly consider imposing the diversity during the online inference only, which neglect the correlation among visual structures in offline training. In this paper, we propose a novel group-based image captioning scheme (termed
doi:10.1109/cvpr.2018.00146
dblp:conf/cvpr/ChenJSWS18
fatcat:soi74rhcgrdrlk3wmzrunyuxl4