Putting active learning into multimedia applications

Ming-yu Chen, Michael Christel, Alexander Hauptmann, Howard Wactlar
2005 Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05  
The authors developed an extensible system for video exploitation that puts the user in control to better accommodate novel situations and source material. Visually dense displays of thumbnail imagery in storyboard views are used for shot-based video exploration and retrieval. The user can identify a need for a class of audiovisual detection, adeptly and fluently supply training material for that class, and iteratively evaluate and improve the resulting automatic classification produced via
more » ... iple modality active learning and SVM. By iteratively reviewing the output of the classifier and updating the positive and negative training samples with less effort than typical for relevance feedback systems, the user can play an active role in directing the classification process while still needing to truth only a very small percentage of the multimedia data set. Examples are given illustrating the iterative creation of a classifier for a concept of interest to be included in subsequent investigations, and for a concept typically deemed irrelevant to be weeded out in follow-up queries. Filtering and browsing tools making use of existing and iteratively added concepts put the user further in control of the multimedia browsing and retrieval process.
doi:10.1145/1101149.1101342 dblp:conf/mm/ChenCHW05 fatcat:yiqc6355azatpfgte2fsuvdnpi