Filters








692 Hits in 4.5 sec

A Multimodal Information Collector for Content-Based Image Retrieval System [chapter]

He Zhang, Mats Sjöberg, Jorma Laaksonen, Erkki Oja
2011 Lecture Notes in Computer Science  
Explicit relevance feedback requires the user to explicitly refine the search queries for content-based image retrieval.  ...  We present a multimodal information collector that can unobtrusively record and asynchronously transmit the user's implicit relevance feedback on a displayed image to the remote CBIR server for assisting  ...  Gaze-Enhanced Content-Based Image Retrieval PicSOM CBIR System We have integrated the multimodal information collector with an existing CBIR server named PicSOM 3 [10] , which is a content-based image  ... 
doi:10.1007/978-3-642-24965-5_83 fatcat:uuzgpuwg5rfyflsdgenchxpj3a

Enabling multimodal interaction in web-based personal digital photo browsing

N.A. Ismail, E. A. O'Brien
2008 2008 International Conference on Computer and Communication Engineering  
In this paper, we describe an interactive web-based photo retrieval system that enables personal digital photo users to accomplish photo browsing by using multimodal interaction.  ...  Our approach also consists of human-computer speech dialogue based on photo browsing of image content by four main categories (Who? What? When? and Where?).  ...  Some of these systems provide browsing, free text searching and even a range of limited visual content based retrieval.  ... 
doi:10.1109/iccce.2008.4580737 fatcat:psid6eeulrfhldcjum6b4ounla

Understanding and Creating Art with AI: Review and Outlook [article]

Eva Cetinic, James She
2021 arXiv   pre-print
, similarity retrieval, multimodal representations, computational aesthetics, etc.  ...  In the context of AI-related research for art understanding, we present a comprehensive overview of artwork datasets and recent works that address a variety of tasks such as classification, object detection  ...  Many of the existing retrieval systems rely on retrieving images based on their corresponding metadata and textual descriptions.  ... 
arXiv:2102.09109v1 fatcat:ztuhedmgqrbdziwogxjq46spzq

The VoiceApp System: Speech Technologies to Access the Semantic Web [chapter]

David Griol, José Manuel Molina, Víctor Corrales
2011 Lecture Notes in Computer Science  
whereas Voice Browser provides a fast and effective multimodal interface to the Google web search engine.  ...  In this paper we present the VoiceApp multimodal dialog system, which enables to access and browse Internet by means of speech.  ...  This approach may acquire additional complexity in the case of Information Retrieval and Question Answering systems, such as in [9] .  ... 
doi:10.1007/978-3-642-25274-7_40 fatcat:hv3t6sghyff67ghg4ba32bakri

The participation payoff

Naeem Ramzan, Martha Larson, Frédéric Dufaux, Kai Clüver
2010 Proceedings of the international conference on Multimedia information retrieval - MIR '10  
This paper provides a survey of techniques that make use of a combination of three information sources: communitycontributed information (e.g., tags and ratings), network structure and techniques for multimedia  ...  We focus our survey on three areas important for multimedia access: annotation, distribution and retrieval.  ...  Peekaboom is a game for object recognition [29] and Phetch is for image retrieval [30] .  ... 
doi:10.1145/1743384.1743470 dblp:conf/mir/RamzanLDC10 fatcat:nhc6hjxcjvg5pidz3fufw556vm

Putting active learning into multimedia applications

Ming-yu Chen, Michael Christel, Alexander Hauptmann, Howard Wactlar
2005 Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA '05  
Visually dense displays of thumbnail imagery in storyboard views are used for shot-based video exploration and retrieval.  ...  Examples are given illustrating the iterative creation of a classifier for a concept of interest to be included in subsequent investigations, and for a concept typically deemed irrelevant to be weeded  ...  plan to assess the benefits of ENVIE and its active learning component for interactive video information retrieval, with ENVIE's development driven by the goal of providing efficient, effective access  ... 
doi:10.1145/1101149.1101342 dblp:conf/mm/ChenCHW05 fatcat:yiqc6355azatpfgte2fsuvdnpi

Ethnographic Observations Of Musicologists At The British Library: Implications For Music Information Retrieval

Mathieu Barthet, Simon Dixon
2011 Zenodo  
This study was conducted as part of the RCUK Digital Economy project EP/I001832/1, Musicology for the Masses 4 .  ...  ACKNOWLEDGMENTS The authors wish to thank the Edison Fellows and the British Library for their kind participation and help during this study.  ...  notes, or accompanying manuscript documents (e.g. a paper card system that an original collector had kept).  ... 
doi:10.5281/zenodo.1415669 fatcat:mcybxadyyzgrzeaxcimvrxpj5y

Research on Multimodal Music Emotion Recognition Method Based on Image Sequence

Zhao Yu, Bai Yuan Ding
2021 Scientific Programming  
Therefore, once the identification error occurs, it will not be able to create a good stage effect. Therefore, a multimodal music emotion recognition method based on image sequence is studied.  ...  The work of music performance system is to control the light change by identifying the emotional elements of music.  ...  Multimodal Music Emotion Recognition and Classification Based on Image Sequence In addition to the necessary music itself, a perfect music performance is a complementary live atmosphere.  ... 
doi:10.1155/2021/7087588 fatcat:5zbcovzo3veclggqszsqb62kte

Video Segmentation Using Hidden Markov Model with Multimodal Features [chapter]

Tae Meon Bae, Sung Ho Jin, Yong Man Ro
2004 Lecture Notes in Computer Science  
In this paper, a video segmentation algorithm based on Hidden Markov Model classifier with multimodal feature is proposed.  ...  Scalable color histogram is popularly used to measure the similarity between images in image retrieval [2] .  ...  Because video segmentation is basic operation in authoring and retrieving video contents, it is important to detect precise shot boundaries and segment a video into semantically homogeneous units for high-level  ... 
doi:10.1007/978-3-540-27814-6_48 fatcat:l6mtkvtxzfbrrkeuo6pougd4ha

Image Retrieval Technology of Smart Archives from the Perspective of National Reading

Qu Danqiu, Ren Hao, Zhiguo Qu
2022 Wireless Communications and Mobile Computing  
retrieval model for users of smart archives is constructed by using a standardized structure and mixed structure.  ...  In order to improve the image retrieval effect of smart archives, reduce retrieval time, and enhance retrieval performance, this paper proposes an image retrieval technology for smart archives from the  ...  on Countermeasures for Improving Emergency Management Ability of Jilin Universities Smart Library" (Project No: WK2020C176).  ... 
doi:10.1155/2022/9592726 fatcat:5klzbphwbbaazmj5oq7tral4v4

Multimodal Learning with Vision and Language

2019 2019 Ninth International Conference on Image Processing Theory, Tools and Applications (IPTA)  
Referring expressions), image question answering, one-shot novel concept captioning, multimodal word embedding, and multi-label classification.  ...  We first proposed an effective RNN-CNN framework (Recurrent Neural Network-Convolutional Neural Network) to address the task of image captioning (i.e. describing an image with a sentence).  ...  , such as image annotation, multimodal image search system, and visual question answering systems for the visually impaired person.  ... 
doi:10.1109/ipta.2019.8936104 fatcat:2nwkscl4sncwpbhvaat4zn5k2q

PaTac: Urban, Ubiquitous, Personalized Services for Citizens and Tourists

Luigi Ceccaroni, Victor Codina, Manel Palau, Marc Pous
2009 2009 Third International Conference on Digital Society  
The goal is to create a platform able to provide personalized services based on recommendation algorithms, and users' location, profile and preferences.  ...  This paper presents the general design of an architecture, based on software agents and oriented to the semantic Web, for the development and deployment of urban, ubiquitous services for citizens and tourist  ...  References [1] Dey, A. and Abowd, G., "Towards a Better Understanding of Context and Context-Awareness", 1999.  ... 
doi:10.1109/icds.2009.25 dblp:conf/icds/CeccaroniCPP09 fatcat:kdv5kwf7izhl7mtlay5rolrstq

A Multimodal Analytics Platform for Journalists Analyzing Large-Scale, Heterogeneous Multilingual, and Multimedia Content

Stefanos Vrochidis, Anastasia Moumtzidou, Ilias Gialampoukidis, Dimitris Liparas, Gerard Casamayor, Leo Wanner, Nicolaus Heise, Tilman Wagner, Andriy Bilous, Emmanuel Jamin, Boyan Simeonov, Vladimir Alexiev (+3 others)
2018 Frontiers in Robotics and AI  
High-level information is extracted from both textual and multimedia content for fast inspection using concept clouds.  ...  The textual and multimedia content is semantically integrated and indexed using a common representation, to be accessible through a web-based search engine.  ...  public and open images from Bing, Flickr, and Google, using their corresponding official APIs, and a manual annotation stage follows for verification of the retrieved images.  ... 
doi:10.3389/frobt.2018.00123 pmid:33501002 pmcid:PMC7805659 fatcat:lw73va4vrbaq5ir5ztc5caujnu

Personalization and Context Management

Andreas Zimmermann, Marcus Specht, Andreas Lorenz
2005 User modeling and user-adapted interaction  
The paper will introduce a base framework and tools for designing context management applications and decompose the underlying framework into its foundational components.  ...  Context management is a new approach to the design of context-aware systems in ubiquitous computing that combines personalization and contextualization.  ...  The value vectors stored with the content blocks are the basis for a filtering process that retrieves content in a specific context later on: The retrieval procedure compares the stored context snapshots  ... 
doi:10.1007/s11257-005-1092-2 fatcat:47fucuglp5cqxmdoz6xwiq3oga

The Functions of Visual Representational Meaning in Supporting the Ideational Meaning in Cambridge Guess What Pupil's Book

Syarifatusnain Maulida Wahyu Rabbani, Mursid Saleh, Djoko Sutopo
2021 English Education Journal  
Sometimes, the combination between image, color, sound, and action symbol have been considered as paralanguage no longer play a subordinate role in modern communication.  ...  In a learning context, students are usually faced with images and texts, especially in textbooks they carry around with them. Nowadays, meaning-making rarely depends on language alone.  ...  Practically, the findings could be used for anyone to provide figures with the proper participant-process agreement with the text.  ... 
doi:10.15294/eej.v11i1.45329 fatcat:kyavqmo675b6fgrizvbprauu4m
« Previous Showing results 1 — 15 out of 692 results