Filters








8 Hits in 5.2 sec

Video shot boundary detection: Seven years of TRECVid activity

Alan F. Smeaton, Paul Over, Aiden R. Doherty
2010 Computer Vision and Image Understanding  
In this paper we present an overview of the TRECVid shot boundary detection task, a high-level overview of the most significant of the approaches taken, and a comparison of performances, focussing on one  ...  It is a problem which has attracted much attention since video became available in digital form as it is an essential pre-processing step to almost all video analysis, indexing, summarisation, search,  ...  AS and AD are sponsored by Science Foundation Ireland under grant numbers 03/IN.3/I361 and 07/CE/I1147.  ... 
doi:10.1016/j.cviu.2009.03.011 fatcat:w5kluejxi5bfrfrwq7jku7jgda

TechWare: Video-Based Human Action Detection Resources [Best of the Web

Junsong Yuan, Zicheng Liu
2010 IEEE Signal Processing Magazine  
However, the detected interest points are usually quite sparse, and it is time consuming to extract STIP features for high-resolution videos.  ...  of patients and senior people, medical diagnosis and training, video content analysis and search, and intelligent human computer interaction.  ...  Lyon (dicklyon@ieee.org) is a research scientist at Google, Inc., and a Fellow of the IEEE.  ... 
doi:10.1109/msp.2010.937496 fatcat:56rypucynneohale67doue3avy

Scalable Visual Instance Mining with Threads of Features

Wei Zhang, Hongzhi Li, Chong-Wah Ngo, Shih-Fu Chang
2014 Proceedings of the ACM International Conference on Multimedia - MM '14  
To demonstrate, the datasets Search task, which includes 30 query instances defined by are constructed by crawling Youtube videos with keyword TRECVID, and 470k reference video clips  ...  ., APriori [1], Eclat [25], and FP-growth [7], have been a “high-levelfeature for understanding the dataset. People developed for this purpose.  ... 
doi:10.1145/2647868.2654942 dblp:conf/mm/ZhangLNC14 fatcat:xnqs5uudbffnjltcrm6eiareyq

Classification of Cinematographic Shots Using Lie Algebra and its Application to Complex Event Recognition

Subhabrata Bhattacharya, Ramin Mehran, Rahul Sukthankar, Mubarak Shah
2014 IEEE transactions on multimedia  
In this paper, we propose a discriminative representation of a video shot based on its camera motion and demonstrate how the representation can be used for high level multimedia tasks like complex event  ...  To extract meaningful features from time-series, we propose an efficient linear dynamical system based technique.  ...  Detection of such useful concepts can be used by current video search engines at a later stage to perform high-level content analysis such as detection of events from videos.  ... 
doi:10.1109/tmm.2014.2300833 fatcat:qqqmd5ovhfh35gobqaovuxnwfa

From survey to representation. Operation guidelines [chapter]

2012 Computational Modelling of Objects Represented in Images III  
Skorton, and S. Fleagle (1995) . Methods of graph searching for border detection in image sequences with applications to cardiac magnetic resonance imaging.  ...  NHK STRL at TRECVid 2008: High-level feature extraction and surveillance event detection. In Surveillance Event Detection Pilot. http://wwwnlpir.nist.gov/projects/trecvid/. Lienhart, R. and J.  ...  University of Central Florida at TRECVid2008:Content based copydetection and surveillance event detection. In Surveillance Event Detection Pilot. http://www-nlpir.nist.gov/projects/trecvid/.  ... 
doi:10.1201/b12753-91 fatcat:nh3phzgj2rcr3hwvqajsligzoq

Video Description: A Survey of Methods, Datasets and Evaluation Metrics [article]

Nayyer Aafaq, Ajmal Mian, Wei Liu, Syed Zulqarnain Gilani, Mubarak Shah
2019 arXiv   pre-print
Analysis of video description models is challenging because it is difficult to ascertain the contributions, towards accuracy or errors, of the visual features and the adopted language model in the final  ...  Video description is the automatic generation of natural language sentences that describe the contents of a given video.  ...  The research was supported by ARC Discovery Grant DP160101458 and DP150102405.  ... 
arXiv:1806.00186v3 fatcat:elxztcpzizhr7clugnbjvvrpte

Visual Tracking: An Experimental Survey

2014 IEEE Transactions on Pattern Analysis and Machine Intelligence  
A good tracker should perform well in a large number of videos involving illumination changes, occlusion, clutter, camera motion, low contrast, specularities and at least six more aspects.  ...  In this paper, we aim to evaluate trackers systematically and experimentally on 315 video fragments covering above aspects.  ...  Mubarak Shah Agere Chair Professor of Computer Science, is the founding director of the Computer Vision Lab at the University of Central Florida.  ... 
doi:10.1109/tpami.2013.230 pmid:26353314 fatcat:mw4q3amsfzdtjgzpooq2sjubpu

Human-centered computing

Alejandro Jaimes, Nicu Sebe, Daniel Gatica-Perez
2006 Proceedings of the 14th annual ACM international conference on Multimedia - MULTIMEDIA '06  
In addition, we identify the core characteristics of HCM, describe example applications, and propose a research agenda for HCM.  ...  We describe what we consider to be the three main areas of Human-Centered Multimedia (HCM): media production, analysis, and interaction.  ...  The work of Nicu Sebe was partially supported by the Muscle NoE and MIAUCE projects. D. Gatica-Perez acknowledges support by the EU AMI and Swiss IM2 projects.  ... 
doi:10.1145/1180639.1180829 dblp:conf/mm/JaimesSG06 fatcat:pyxuhio3cnawxji66vkkzxkm5e