Large Scale Near-Duplicate Image Retrieval via Patch Embedding

Shangpeng Yan, Xiaoyun Zhang, Li Chen, Wenbo Bao, Zhiyong Gao
2019 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)  
Large scale near-duplicate image retrieval (NDIR) relies on the Bag-of-Words methodology which quantizes local features into visual words. However the direct match of these visual words typically leads to unpleasant mismatches due to quantization errors. To enhance the discriminability of the matching process, existing methods usually exploit hand-crafted contextual information, which have limited performance in complicated real-world scenarios. In contrast, we in this paper propose a trainable
more » ... lightweight embedding network to extract local binary features. The network takes image patches as inputs and generates the binary code that can be efficiently stored in the inverted indexing file and helps discard mismatches immediately during the retrieval process. We improve the discriminability of the code by elaborately composing the training patches for network optimization, which consists of a proper interclass (non-duplicate) patches selection and a rich intraclass (near-duplicate) patches generation. We evaluate our approach on the open NDIR dataset, INRIA CopyDays, and the experimental results show that our method performs favorably against the state-of-the-art algorithms. Furthermore, with a relatively short code length, our approach achieves higher query speed and lower storage occupation.
doi:10.1109/iccvw.2019.00359 dblp:conf/iccvw/YanZCBG19 fatcat:hxz6tzvh6bbolfukecclzmw4p4