A general framework for temporal video scene segmentation

Yun Zhai, M. Shah
2005 Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1  
Videos are composed of many shots caused by different camera operations, e.g., on/off operations and switching between cameras. One important goal in video analysis is to group the shots into temporal scenes, such that all the shots in a single scene are related to a particular physical setting, an on-going action or a theme. In this paper, we present a general framework for temporal scene segmentation for various video types. The proposed method is formulated in a statistical fashion and uses
more » ... he Markov chain Monte Carlo (MCMC) technique to determine the boundaries between video scenes. In this approach, an arbitrary number of scene boundaries are randomly initialized and automatically updated using two types of updates: diffuse and jumps. The posterior probability on the number of scenes and their boundary locations is computed based on the model priors and the data likelihood. The updates of the model parameters are controlled by the hypothesis ratio test in the MCMC process. The proposed framework has been experimented on two types of videos, home videos and feature films, and accurate results have been obtained.
doi:10.1109/iccv.2005.6 dblp:conf/iccv/ZhaiS05 fatcat:bd5kf6uwovf5jk2sofrawxu6iu