Filters








395 Hits in 6.1 sec

Discovery and fusion of salient multimodal features toward news story segmentation

Winston Hsu, Shih-Fu Chang, Chih-Wei Huang, Lyndon Kennedy, Ching-Yung Lin, Giridharan Iyengar, Minerva M. Yeung, Rainer W. Lienhart, Chung-Sheng Li
2003 Storage and Retrieval Methods and Applications for Multimedia 2004  
In this paper, we present our new results in news video story segmentation and classification in the context of TRECVID video retrieval benchmarking event 2003.  ...  The statistical fusion model is used to automatically discover relevant features contributing to the detection of story boundaries.  ...  We believe multimodality fusion through effective statistical modelling and feature selection are keys to the solutions.  ... 
doi:10.1117/12.533037 dblp:conf/spieSR/HsuCHKLI04 fatcat:7tjh3ixd35gjdacfgipajwggzq

IBM Research TRECVID 2004 Video Retrieval System

Arnon Amir, Janne Argillander, Marco Berg, Shih-Fu Chang, Martin Franz, Winston H. Hsu, Giridharan Iyengar, John R. Kender, Lyndon S. Kennedy, Ching-Yung Lin, Milind R. Naphade, Apostol Natsev (+5 others)
2004 TREC Video Retrieval Evaluation  
We participated in four tasks of the benchmark including shot boundary detection, high-level feature detection, story segmentation, and search.  ...  We describe the different runs we submitted for each track and provide a preliminary analysis of our performance.  ...  of news segmentation and retrieval [25][26] .  ... 
dblp:conf/trecvid/AmirABCFHIKKLNN04 fatcat:hpkfbxgi2vg73nrfwfkvv44k6a

Reranking Methods for Visual Search

Winston H. Hsu, Lyndon S. Kennedy, Shih-Fu Chang
2007 IEEE Multimedia  
Here the authors introduce two reranking processes for image and video search that automatically reorder results from initial text-only searches based on visual features and content similarity.  ...  Most semantic video search methods use text-keyword queries or example video clips and images. But such methods have limitations.  ...  Any opinions, findings, and conclusions or recommendations expressed in this material are ours and don't necessarily reflect the views of the US government.  ... 
doi:10.1109/mmul.2007.61 fatcat:2x7a5gdmhzds3gpdatl2t57umm

Table of Contents

2021 IEEE transactions on multimedia  
Liu and G. Zhao Multimodal Perception, Integration, and Multisensory Fusion Shared Low-Rank Correlation Embedding for Multiple Feature Fusion . . . . . . .Z. Wang, L. Wang, J. Wan, and H.  ...  Zhu Multimodal Perception, Integration, and Multisensory Fusion Learning Feature Representation and Partial Correlation for Multimodal Multi-Label Data . . . . . . . . . . . . . . . . . . . . . . . . .  ... 
doi:10.1109/tmm.2021.3132246 fatcat:el7u2udtybddrpbl5gxkvfricy

2021 Index IEEE Transactions on Multimedia Vol. 23

2021 IEEE transactions on multimedia  
The primary entry includes the coauthors' names, the title of the paper or other item, and its location, specified by the publication abbreviation, year, month, and inclusive pagination.  ...  The Subject Index contains entries describing the item under all appropriate subject headings, plus the first author's name, the publication abbreviation, month, and year, and inclusive pages.  ...  ., +, TMM 2021 3540-3550 Orthogonalization-Guided Feature Fusion Network for Multimodal 2D+3D Facial Expression Recognition.  ... 
doi:10.1109/tmm.2022.3141947 fatcat:lil2nf3vd5ehbfgtslulu7y3lq

Exploring Large-Scale Video News via Interactive Visualization

Hangzai Luo, Jianping Fan, Jing Yang, William Ribarsky, Shin'ichi Satoh
2006 2006 IEEE Symposium On Visual Analytics And Technology  
video clips and visually represented according to their interestingness measurement to help audiences find news stories of interest at first glance.  ...  Our news video visualization system is very useful for security applications and for general audiences to quickly find news topics of interest from among many channels.  ...  Department of Homeland Security Program, under the auspices of the Southeastern Regional Visualization and Analytics Center.  ... 
doi:10.1109/vast.2006.261433 dblp:conf/ieeevast/LuoFYRS06 fatcat:db5ciyaq7zbbnfk5hvzhefr6q4

A Natural and Immersive Virtual Interface for the Surgical Safety Checklist Training

Andrea Ferracani, Daniele Pezzatini, Alberto Del Bimbo
2014 Proceedings of the 2014 ACM International Workshop on Serious Games - SeriousGames '14  
By leveraging big data from billions of search queries, billions of images on the web and from the social networks, and billions of user clicks, we have designed massive machine learning systems to continuously  ...  Since the launch of Bing (www.bing.com) in June 2009, we have seen Bing web search market share in the US more than doubled and Bing image search query share quadrupled.  ...  University of Technology, Austria Location: Palm 5 Local Selection of Features for Image Search and Annotation Jichao Sun Panelist: Arnold Smeulders Segmentation and Indexing of Endoscopic Videos  ... 
doi:10.1145/2656719.2656725 dblp:conf/mm/FerracaniPB14a fatcat:obsb2i4iybhu3dq77hujvjtbze

Image retrieval

Ritendra Datta, Dhiraj Joshi, Jia Li, James Z. Wang
2008 ACM Computing Surveys  
While the last decade laid foundation to such promise, it also paved the way for a large number of new techniques and systems, got many new people involved, and triggered stronger association of weakly  ...  In this paper, we survey almost 300 key theoretical and empirical contributions in the current decade related to image retrieval and automatic image annotation, and discuss the spawning of related sub-fields  ...  Fusion approaches have been found to be beneficial for important video applications such as detection of documentary scene changes [Velivelli et al. 2004 ] and story segmentation [Zhai et al. 2005] .  ... 
doi:10.1145/1348246.1348248 fatcat:5jbcrsxkkbac5cya3zb7eb22ea

Event Mining in Multimedia Streams

Lexing Xie, H. Sundaram, M. Campbell
2008 Proceedings of the IEEE  
The review includes detection of events and actions in one or more continuous sequences, events in edited video streams, unsupervised event discovery, events in a collection of media objects, and a discussion  ...  These problems span a wide range of multimedia domains such as surveillance, meetings, broadcast news, sports, documentary, and films, as well as personal and online media collections.  ...  Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the U.S. Government.  ... 
doi:10.1109/jproc.2008.916362 fatcat:b3utldtbwvehjo4brlnteetdbq

A Survey on Visual Content-Based Video Indexing and Retrieval

Weiming Hu, Nianhua Xie, Li Li, Xianglin Zeng, S. Maybank
2011 IEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews)  
boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, video  ...  Video indexing and retrieval have a wide spectrum of promising applications, motivating the interest of researchers worldwide.  ...  Scene Segmentation Scene segmentation is also known as story unit segmentation. In general, a scene is a group of contiguous shots that are coherent with a certain subject or theme.  ... 
doi:10.1109/tsmcc.2011.2109710 fatcat:qtenus4htffcfbyuiwidgjojku

Front Matter: Volume 11018

Lynne L. Grewe, Erik P. Blasch, Ivan Kadar
2019 Signal Processing, Sensor/Information Fusion, and Target Recognition XXVIII  
Utilization of CIDs allows articles to be fully citable as soon as they are published online, and connects the same identifier to all online and print versions of the publication.  ...  Publication of record for individual papers is online in the SPIE Digital Library. SPIEDigitalLibrary.org Paper Numbering: Proceedings of SPIE follow an e-First publication model.  ...  Acknowledgements The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the  ... 
doi:10.1117/12.2536301 fatcat:njnuw2reafbppn2oj4krhtncx4

TRECVID 2005 - An Overview

Paul Over, Tzveta Ianeva, Wessel Kraaij, Alan F. Smeaton
2005 TREC Video Retrieval Evaluation  
CMU once again donated a set of features for use by other participants. Columbia University donated story boundaries.  ...  Timo Volkmer and others at IBM created and supported the use of a new web-based system for collaborative annotation. CMU made their annotation system available.  ...  For some regional features a new generalized multiple instance learning algorithm was used. Results indicated both hierarchical feature fusion and fusion across approaches are effective techniques.  ... 
dblp:conf/trecvid/OverIKS05 fatcat:czlfoelhrnaxnnhwr74antamqi

Event Detection and Retrieval on Social Media [article]

Manos Schinas, Symeon Papadopoulos, Yiannis Kompatsiaris, Pericles Mitkas
2018 arXiv   pre-print
In the recent years, we have witnessed the rapid adoption of social media platforms, such as Twitter, Facebook and YouTube, and their use as part of the everyday life of billions of people worldwide.  ...  Given the key role of events in our life, the task of annotating and organizing social media content around them is of crucial importance for ensuring real-time and future access to multimedia content  ...  is proposed to supervise the multimodal fusion and clustering procedure.  ... 
arXiv:1807.03675v1 fatcat:x3dgrhj6k5ewbb5gwm3vmi7khm

Women in poetry and comics: multimodal dialogue between John Keats and Edna St. Vincent Millay

Ana Abril Hernández
2020 Cuadernos del Centro de Estudios en Diseño y Comunicación. Ensayos  
The comic form of art has witnessed a dramatic increase in the number of readers who have chosen this medium to deepen into meaning-making processes in multimodal texts.  ...  ' own point of view and also from their corresponding graphic artists' in order to have a look at the changes in the depiction of women in poetry from the Romantic image of women to the view of women in  ...  Hence, the study of the systems of signs in the new revisions in rapport with the original works draw on semiotics to disentangle the multimodal meaning of a work of literature.  ... 
doi:10.18682/cdc.vi123.4403 fatcat:5pvtqsha7fcmrk7beqphvwqphy

Video Skimming

Vivekraj V. K., Debashis Sen, Balasubramanian Raman
2019 ACM Computing Surveys  
We present a taxonomy of video skimming approaches, and discuss their evolution highlighting key advances.  ...  Skimming can be achieved by identifying significant components either in uni-modal or multi-modal features extracted from the video.  ...  The authors would like to thank all the reviewers' for their insightful comments through which the quality of this work has been enhanced.  ... 
doi:10.1145/3347712 fatcat:h4zbzmdfx5c2rm3dm4cmmzrsoa
« Previous Showing results 1 — 15 out of 395 results