Fast Similarity Search for Learned Metrics

B. Kulis, P. Jain, K. Grauman
2009 IEEE Transactions on Pattern Analysis and Machine Intelligence  
We propose a method to efficiently index into a large database of examples according to a learned metric. Given a collection of examples, we learn a Mahalanobis distance using an information-theoretic metric learning technique that adapts prior knowledge about pairwise distances to incorporate similarity and dissimilarity constraints. To enable sub-linear time similarity search under the learned metric, we show how to encode a learned Mahalanobis parameterization into randomized
more » ... ve hash functions. We further formulate an indirect solution that enables metric learning and hashing for sparse input vector spaces whose high dimensionality make it infeasible to learn an explicit weighting over the feature dimensions. We demonstrate the approach applied to systems and image datasets, and show that our learned metrics improve accuracy relative to commonly-used metric baselines, while our hashing construction permits efficient indexing with a learned distance and very large databases.
doi:10.1109/tpami.2009.151 pmid:19834137 fatcat:hxm7popjxzfj3gpukzn43jufau