The Internet Archive has a preservation copy of this work in our general collections.
The file type is
The scalability, as well as the effectiveness, of the different Content-based Image Retrieval (CBIR) approaches proposed in literature, is today an important research issue. Given the wealth of images on the Web, CBIR systems must in fact leap towards Web-scale datasets. In this paper, we report on our experience in building a test collection of 100 million images, with the corresponding descriptive features, to be used in experimenting new scalable techniques for similarity searching, andarXiv:0905.4627v2 fatcat:h6myccbt6vh2rasga7lmmqvzt4