5,909 Hits in 8.2 sec

SAVE: A framework for semantic annotation of visual events

Mun Wai Lee, Asaad Hakeem, Niels Haering, Song-Chun Zhu
2008 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops  
The method involves identifying objects in the scene, describing their inter-relations, detecting events of interest, and representing them semantically in a human readable and query-able format.  ...  , primitives, parts, objects and scenes, and specify their spatio-temporal or compositional relations; and a bottom-up top-down strategy is used for inference.  ...  Attributes such as object class (e.g. car), scene context (e.g. traffic intersection), speed, and time, provide important semantic and contextual information for accurate retrieval and data mining.  ... 
doi:10.1109/cvprw.2008.4562954 dblp:conf/cvpr/LeeHHZ08 fatcat:63q64oofnnggtbvxlgmu6eutum

Geotagging in multimedia and computer vision—a survey

Jiebo Luo, Dhiraj Joshi, Jie Yu, Andrew Gallagher
2010 Multimedia tools and applications  
can benefit from the use of geographical information, and 3) The interplay between modalities and applications.  ...  The presence of geographically relevant metadata with images and videos has opened up interesting research avenues within the multimedia and computer vision domains.  ...  Geotagging Driven Applications in Multimedia and Vision Research Semantic Multimedia Understanding -Events, Scenes, and Objects Annotation, Organization & Retrieval Semantic multimedia understanding  ... 
doi:10.1007/s11042-010-0623-y fatcat:esd7subpbjhntpes6quvngtwti

Semantics and Agents for Intelligent Simulation and Collaboration in the 3D Internet

Matthias Klusch, Xiaoqi Cao, Patrick Kapahnke, Stefan Warwas
2011 2011 Seventh International Conference on Semantics, Knowledge and Grids  
For this purpose, the platform integrates semantic Web technologies, semantic services, intelligent agents, verification and web-based 3D graphics.  ...  In addition, we describe our vision of a scalable web-based multiuser 3DI platform for intelligent semantic-enabled collaboration between multiple users and outline selected research challenges of realizing  ...  ACKNOWLEDGMENT The reported work on the ISReal platform has been funded by the German Ministry for Education and Research (BMBF) under the project grant 01IWO8005.  ... 
doi:10.1109/skg.2011.40 dblp:conf/skg/KluschCKW11 fatcat:yywywr545bgqhbl4ywajuxtrsu

A Review on Intelligent Object Perception Methods Combining Knowledge-based Reasoning and Machine Learning [article]

Filippos Gouidis, Alexandros Vassiliades, Theodore Patkos, Antonis Argyros, Nick Bassiliades, Dimitris Plexousakis
2020 arXiv   pre-print
Object perception is a fundamental sub-field of Computer Vision, covering a multitude of individual areas and having contributed high-impact results.  ...  , their properties and their relations with their environment.  ...  labels and semantic entities (objects and scenes) (Wu et al. 2016b ).  ... 
arXiv:1912.11861v2 fatcat:dhjvffblprbonn4xtssj3jzb6q

Visualizing Natural Language Descriptions

Kaveh Hassani, Won-Sook Lee
2016 ACM Computing Surveys  
One of the promising applications of such interfaces is generating visual interpretations of semantic content of a given natural language that can be then visualized either as a static scene or a dynamic  ...  This survey discusses requirements and challenges of developing such systems and reports 26 graphical systems that exploit natural language interfaces and addresses both artificial intelligence and visualization  ...  This system is a multi-agent and data-driven system that utilizes statistical Web content mining techniques for extracting the attribute values of objects such as relative sizes and velocities.  ... 
doi:10.1145/2932710 fatcat:soyygmiqfvb4rlry6qaeqyvm6e

[Invited Paper] A Review of Web Image Mining

Keiji Yanai
2015 ITE Transactions on Media Technology and Applications  
of visual concept database for image/video recognition, (2) Web image application for visual concept analysis and data-driven computer graphics, and (3) real-world sensing through Web images to detect  ...  In this paper, we review works related to big visual data on the Web in the literature of computer vision and multimedia research regarding the following points: (1) Web image acquisition for construction  ...  Do Hang Nga and anonymous reviewers.  ... 
doi:10.3169/mta.3.156 fatcat:gduk25dp7nedvm65xurbwzdu3y

Semantic web-mining and deep vision for lifelong object discovery

Jay Young, Lars Kunze, Valerio Basile, Elena Cabrio, Nick Hawes, Barbara Caputo
2017 2017 IEEE International Conference on Robotics and Automation (ICRA)  
In this work we investigate the problem of unknown object hypotheses generation, and employ a semantic web-mining framework along with deep-learning-based object detectors.  ...  Autonomous robots that are to assist humans in their daily lives must recognize and understand the meaning of objects in their environment.  ...  SEMANTIC WEB-MINING In previous work we developed a Semantic Web-Mining component for robot systems [18] .  ... 
doi:10.1109/icra.2017.7989323 dblp:conf/icra/YoungKBCHC17 fatcat:fhm2blocq5eubgt5pw2au3xfoa

Text-to-picture tools, systems, and approaches: a survey

Jezia Zakraoui, Moutaz Saleh, Jihad Al Ja'am
2019 Multimedia tools and applications  
Utkus has been enhanced to operate with ontology to allow loose coupling of the system's components, unifying the interacting objects' representation and behavior, and making possible verification of system  ...  Our survey showed that currently emerging techniques in natural language processing tools and computer vision have made promising advances in analyzing general text and understanding images and videos.  ...  reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.  ... 
doi:10.1007/s11042-019-7541-4 fatcat:7n7gix2qdfgo7jllick5mb6via

Interact as You Intend: Intention-Driven Human-Object Interaction Detection [article]

Bingjie Xu, Junnan Li, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao
2019 arXiv   pre-print
In this work, we focus on detecting human-object interactions (HOIs) in social scene images, which is demanding in terms of research and increasingly useful for practical applications.  ...  Specifically, the proposed human intention-driven HOI detection (iHOI) framework models human pose with the relative distances from body joints to the object instances.  ...  Enabling such applications requires deeper understanding of the scene semantics beyond instancelevel understanding.  ... 
arXiv:1808.09796v2 fatcat:rae6hhktzrfjbcj5d4nkeguxem

Cross-media analysis and reasoning: advances and directions

Yu-xin Peng, Wen-wu Zhu, Yao Zhao, Chang-sheng Xu, Qing-ming Huang, Han-qing Lu, Qing-hua Zheng, Tie-jun Huang, Wen Gao
2017 Frontiers of Information Technology & Electronic Engineering  
To address these issues, we provide an overview as follows: (1) theory and model for cross-media uniform representation; (2) cross-media correlation understanding and deep mining; (3) cross-media knowledge  ...  graph construction and learning methodologies; (4) cross-media knowledge evolution and reasoning; (5) cross-media description and generation; (6) cross-media intelligent engines; and (7) cross-media intelligent  ...  Acknowledgements The authors would like to thank Peng CUI, Shi-kui WEI, Ji-tao SANG, Shu-hui WANG, Jing LIU, and Bu-yue QIAN for their valuable discussions and assistance.  ... 
doi:10.1631/fitee.1601787 fatcat:dqnizhdlbfhpvodzkhv5nlarxq

NEIL: Extracting Visual Knowledge from Web Data

Xinlei Chen, Abhinav Shrivastava, Abhinav Gupta
2013 2013 IEEE International Conference on Computer Vision  
As of 10 th October 2013, NEIL has been continuously running for 2.5 months on 200 core cluster (more than 350K CPU hours) and has an ontology of 1152 object categories, 1034 scene categories and 87 attributes  ...  During this period, NEIL has discovered more than 1700 relationships and has labeled more than 400K visual instances.  ...  Acknowledgements: This research was supported by ONR MURI N000141010934 and a gift from Google. The authors would like to thank Tom Mitchell and David Fouhey for insightful discussions.  ... 
doi:10.1109/iccv.2013.178 dblp:conf/iccv/ChenSG13 fatcat:qks2a3nkanf5vabuqxehikj7ee

A review of EO image information mining [article]

Marco Quartulli, Igor G. Olaizola
2012 arXiv   pre-print
The solutions envisaged for the issues related to feature simplification and synthesis, indexing, semantic labeling are reviewed. The methodologies for query specification and execution are analyzed.  ...  It builds upon semantic web technologies and combines them with pattern recognition and machine learning techniques to develop a framework for semantics driven retrieval of knowledge from EO data archives  ...  Furthermore, for objectives related to rapid mapping and large-scale scene understanding in connection with large product archives, more effective tools for multi-sensor and multi-temporal EO mining and  ... 
arXiv:1203.0747v2 fatcat:nwiylcsdrnhthi753xcxwxgo7e

I2T: Image Parsing to Text Description

Benjamin Z Yao, Xiong Yang, Liang Lin, Mun Wai Lee, Song-Chun Zhu
2010 Proceedings of the IEEE  
It entails vocabularies of visual elements including pixels, primitives, parts, objects and scenes and a stochastic image grammar specifying compositional, spatial, temporal and functional relations between  ...  representation, in a spirit similar to parsing sentences in speech and natural language. 2) The parse graphs are converted into semantic representation using the Web Ontology Language (OWL) format, which  ...  With this framework, image and video content can be published on the Semantic Web and allowing various semantic mining and inference tools to retrieve, process and analyze video content.  ... 
doi:10.1109/jproc.2010.2050411 fatcat:efostazxbrhghmtctxr6uoqdfu

Data-Driven Shape Analysis and Processing

Kai Xu, Vladimir G. Kim, Qixing Huang, Evangelos Kalogerakis
2016 Computer graphics forum (Print)  
Data-driven methods serve an increasingly important role in discovering geometric, structural and semantic relationships between shapes.  ...  , modelling and exploration, as well as scene analysis and synthesis.  ...  Kai Xu is supported by NSFC (61572507, 61202333 and 61532003).  ... 
doi:10.1111/cgf.12790 fatcat:q76sq2syjvce5fjjjb7yoafww4


F. Poux, P. Hallot, R. Neuville, R. Billen
2016 ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences  
understanding.  ...  A review of feature detection, machine learning frameworks and database systems indexed both for mining queries and data visualisation is studied.  ...  The evolution and expansion to a wider audience is driven mainly through scene understanding where tasks like navigation, grasping or scene manipulation are essential to its applications.  ... 
doi:10.5194/isprs-annals-iv-2-w1-119-2016 fatcat:prtjp6m6gfbs7dfvvot7lx62mu
« Previous Showing results 1 — 15 out of 5,909 results