Massive-scale multimedia semantic modeling

John R. Smith, Liangliang Cao
2013 Proceedings of the 21st ACM international conference on Multimedia - MM '13  
Visual data is exploding! 500 billion consumer photos are taken each year world-wide, 633 million photos taken per year in NYC alone. 120 new video-hours are uploaded on YouTube per minute. The explosion of digital multimedia data is creating a valuable open source for insights. However, the unconstrained nature ofŞimage/video in the wildŤ makes it very challenging for automated computer-based analysis. Furthermore, the most interesting content in the multimedia files is often complex in nature
more » ... reflecting a diversity of human behaviors, scenes, activities and events. To address these challenges, this tutorial will provide a unified overview of the two emerging techniques: Semantic modeling and Massive scale visual recognition, with a goal of both introducing people from different backgrounds to this exciting field and reviewing state of the art research in the new computational era.
doi:10.1145/2502081.2502235 dblp:conf/mm/SmithC13a fatcat:gvhxnztvnvakpl4pmizqwox7di