Sparse similarity matrix learning for visual object retrieval

Zhicheng Yan, Yizhou Yu
2013 The 2013 International Joint Conference on Neural Networks (IJCNN)  
Tf-idf weighting scheme is adopted by state-ofthe-art object retrieval systems to reflect the difference in discriminability between visual words. However, we argue it is only suboptimal by noting that tf-idf weighting scheme does not take quantization error into account and exploit word correlation. We view tf-idf weights as an example of diagonal Mahalanobis-type similarity matrix and generalize it into a sparse one by selectively activating off-diagonal elements. Our goal is to separate
more » ... arity of relevant images from that of irrelevant ones by a safe margin. We satisfy such similarity constraints by learning an optimal similarity metric from labeled data. An effective scheme is developed to collect training data with an emphasis on cases where the tf-idf weights violates the relative relevance constraints. Experimental results on benchmark datasets indicate the learnt similarity metric consistently and significantly outperforms the tf-idf weighting scheme.
doi:10.1109/ijcnn.2013.6707063 dblp:conf/ijcnn/YanY13 fatcat:uxfjeirhpva2reedkbi7bdhxca