A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2010; you can also visit the original URL.
The file type is application/pdf
.
Harvesting Image Databases from the Web
2011
IEEE Transactions on Pattern Analysis and Machine Intelligence
The objective of this work 1 is to automatically generate a large number of images for a specified object class (for example, penguin). A multi-modal approach employing both text, meta data and visual features is used to gather many, high-quality images from the web. Candidate images are obtained by a text based web search querying on the object identifier (the word penguin). The web pages and the images they contain are downloaded. The task is then to remove irrelevant images and re-rank the
doi:10.1109/tpami.2010.133
pmid:21330688
fatcat:5bsag2wfmbhgrlilytd5zqxjbq