Exploiting Multiple Web Resources towards Collecting Positive Training Samples for Visual Concept Learning

Olga Papadopoulou, Vasileios Mezaris
2015 Proceedings of the 5th ACM on International Conference on Multimedia Retrieval - ICMR '15  
The number of images uploaded to the web is enormous and is rapidly increasing. The purpose of our work is to use these for acquiring positive training data for visual concept learning. Manually creating training data for visual concept classifiers is an expensive and time consuming task. We propose an approach which automatically collects positive training samples from the Web by constructing a multitude of text queries and retaining for each query only very few top-ranked images returned by
more » ... ch one of the different web image search engines (Google, Flickr and Bing). In this way, we sift the burden of false positive rejection to the Web search engines and directly assemble a rich set of highquality positive training samples. Experiments on forty concepts, evaluated on the ImageNet dataset, show the merit of the proposed approach.
doi:10.1145/2671188.2749338 dblp:conf/mir/PapadopoulouM15 fatcat:wiyknuhhfnenvdb4kay5t4vf3m