1 Hit in 2.3 sec

Egoshots, an ego-vision life-logging dataset and semantic fidelity metric to evaluate diversity in image captioning models [article]

Pranav Agarwal, Alejandro Betancourt, Vana Panagiotou, Natalia Díaz-Rodríguez
2020 arXiv   pre-print
Furthermore, in order to evaluate the quality of the generated captions, we propose a new image captioning metric, object based Semantic Fidelity (SF).  ...  In this paper, we attempt to show the biased nature of the currently existing image captioning models and present a new image captioning dataset, Egoshots, consisting of 978 real life images with no captions  ...  We thank the site QuickTurtles 8 for sharing fun images worth feeding to a neural network to stress-test them.  ... 
arXiv:2003.11743v2 fatcat:hcudv5byyzgjjlnnipnwyp74sa