Location Sensitive Image Retrieval and Tagging [article]

Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas
2020 arXiv   pre-print
People from different parts of the globe describe objects and concepts in distinct manners. Visual appearance can thus vary across different geographic locations, which makes location a relevant contextual information when analysing visual data. In this work, we address the task of image retrieval related to a given tag conditioned on a certain location on Earth. We present LocSens, a model that learns to rank triplets of images, tags and coordinates by plausibility, and two training strategies
more » ... to balance the location influence in the final ranking. LocSens learns to fuse textual and location information of multimodal queries to retrieve related images at different levels of location granularity, and successfully utilizes location information to improve image tagging.
arXiv:2007.03375v1 fatcat:5yd2fzjflzbrdnkd4ew2f3pqdi