A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2007; you can also visit the original URL.
The file type is
Near-duplicate images introduce problems of redundancy and copyright infringement in large image collections. The problem is acute on the web, where appropriation of images without acknowledgment of source is prevalent. In this paper, we present an effective clustering approach for nearduplicate images, using a combination of techniques from invariant image local descriptors and an adaptation of nearduplicate text-document clustering techniques; we extend our earlier approach of near-duplicatedoi:10.1145/1290082.1290089 dblp:conf/mir/FooZS07 fatcat:4wr6hqbxrfdcxjulnpcffy2pxm