Learning for efficient retrieval of structured data with noisy queries

Charles Parker, Alan Fern, Prasad Tadepalli
2007 Proceedings of the 24th international conference on Machine learning - ICML '07  
Increasingly large collections of structured data necessitate the development of efficient, noise-tolerant retrieval tools. In this work, we consider this issue and describe an approach to learn a similarity function that is not only accurate, but that also increases the effectiveness of retrieval data structures. We present an algorithm that uses functional gradient boosting to maximize both retrieval accuracy and the retrieval efficiency of vantage point trees. We demonstrate the
more » ... of our approach on two datasets, including a moderately sized real-world dataset of folk music.
doi:10.1145/1273496.1273588 dblp:conf/icml/ParkerFT07 fatcat:ljf6h5dz6ncnvpfgmu3fhx43ky