Incorporating Concept Ontology for Hierarchical Video Classification, Annotation, and Visualization

Jianping Fan, Hangzai Luo, Yuli Gao, R. Jain
2007 IEEE transactions on multimedia  
Most existing content-based video retrieval (CBVR) systems are now amenable to support automatic low-level feature extraction, but they still have limited effectiveness from a user's perspective because of the semantic gap. Automatic video concept detection via semantic classification is one promising solution to bridge the semantic gap. To speed up SVM video classifier training in high-dimensional heterogeneous feature space, a novel multimodal boosting algorithm is proposed by incorporating
more » ... ature hierarchy and boosting to reduce both the training cost and the size of training samples significantly. To avoid the inter-level error transmission problem, a novel hierarchical boosting scheme is proposed by incorporating concept ontology and multitask learning to boost hierarchical video classifier training through exploiting the strong correlations between the video concepts. To bridge the semantic gap between the available video concepts and the users' real needs, a novel hyperbolic visualization framework is seamlessly incorporated to enable intuitive query specification and evaluation by acquainting the users with a good global view of large-scale video collections. Our experiments in one specific domain of surgery education videos have also provided very convincing results. Index Terms-Concept ontology, hierarchical boosting, hyperbolic visualization, multimodal boosting, multitask learning, semantic gap, video classification and annotation.
doi:10.1109/tmm.2007.900143 fatcat:ts2cgdhakreanildysxdmcbncm