Filters








13,286 Hits in 4.7 sec

An Overview on Perceptually Motivated Audio Indexing and Classification

Gael Richard, Shiva Sundaram, Shrikanth Narayanan
2013 Proceedings of the IEEE  
In particular, we discuss several different strategies to integrate human perception including 1) the use of generic audition models, 2) the use of perceptually-relevant features for the analysis stage  ...  In the paper, we also illustrate some of the recent trends in semantic audio retrieval that approximate higher level perceptual processing and cognitive aspects of human audio recognition capabilities  ...  Then, to illustrate the relevance of perception for audio indexing and classification, we will focus on two applications namely audio sound categories recognition and music timbre recognition.  ... 
doi:10.1109/jproc.2013.2251591 fatcat:myywr5bztzeezi7mity4gwnpha

Spatial Context in Recognition

Moshe Bar, Shimon Ullman
1996 Perception  
The implications of these ndings to the organization of recognition memory are discussed, and a framework for a model for using spatial context, which u s e s the psychophysical ndings, is proposed.  ...  In recognizing objects and scenes, partial recognition of objects or their parts could be used to guide the recognition of other objects.  ...  Edelman for fruitful discussions and assistance with the experiments and E. Schechtman for help in the statistical analysis.  ... 
doi:10.1068/p250343 pmid:8804097 fatcat:5rquxjcpgzgnngprgxfajev4ci

Industrial Scene Text Detection with Refined Feature-attentive Network [article]

Tongkun Guan, Chaochen Gu, Changsheng Lu, Jingzheng Tu, Qi Feng, Kaijie Wu, Xinping Guan
2021 arXiv   pre-print
Specifically, we design a parallel feature integration mechanism to construct an adaptive feature representation from multi-resolution features, which enhances the perception of multi-scale texts at each  ...  Moreover, we construct two industrial scene text datasets, including a total of 102156 images and 1948809 text instances with various character structures and metal parts.  ...  The SFF module adaptively integrates multi-resolution features to enhance the perception of multi-scale text features at each scale-specific layer.  ... 
arXiv:2110.12663v1 fatcat:kfskmludrrccphhigsc7fepwki

Multi-Level Context Pyramid Network for Visual Sentiment Analysis

Haochun Ou, Chunmei Qing, Xiangmin Xu, Jianxiu Jin
2021 Sensors  
Next, the multi-scale adaptive context modules (MACM) are proposed to learn the sentiment correlation degree of different regions for different scale in the image, and to extract the multi-scale context  ...  In this paper, based on the alterable scale and multi-level local regional emotional affinity analysis under the global perspective, we propose a multi-level context pyramid network (MCPNet) for visual  ...  Finally, the multi-scale context feature Z l i for X l i will be obtained.  ... 
doi:10.3390/s21062136 pmid:33803744 fatcat:ddwzqg5yinfk3g2nqkf256bhwi

Author Index

2010 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition  
Appearance Sharing: Recursive Compositional Models for Multi-View Multi-Object Detection SUN Database: Large Scale Scene Recognition from Abbey to Zoo Torresani, Lorenzo Simultaneous Point Matching and  ...  On Growth and Formlets: Sparse Multi-Scale Coding of Planar Shape Oliva, Aude SUN Database: Large Scale Scene Recognition from Abbey to Zoo Oliveira, Francisco Workshop: Using a Vision Based Tracking  ... 
doi:10.1109/cvpr.2010.5539913 fatcat:y6m5knstrzfyfin6jzusc42p54

Similarity Matching in Computer Vision and Multimedia

Nicu Sebe, Qi Tian, Michael S. Lew, Thomas S. Huang
2008 Computer Vision and Image Understanding  
Therefore, ideas from statistical clustering, multi-dimensional indexing, and dimensionality reduction are extremely useful in this area.  ...  The task of finding point correspondences between two images of the same scene or object is part of many computer vision applications. In this context, Bay et al.  ... 
doi:10.1016/j.cviu.2008.04.001 fatcat:pquvskbt3falreu3vbh25j4brq

2020 Index IEEE Transactions on Image Processing Vol. 29

2020 IEEE Transactions on Image Processing  
., +, TIP 2020 5396-5407 Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition.  ...  Wang, Q., +, TIP 2020 7549-7564 Multi-Scale Multi-View Deep Feature Aggregation for Food Recognition.  ... 
doi:10.1109/tip.2020.3046056 fatcat:24m6k2elprf2nfmucbjzhvzk3m

Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids [article]

Saad Bin Ahmed, Saeeda Naz, Muhammad Imran Razzak, Rubiyah Yusof
2018 arXiv   pre-print
The numerous approaches are designed in recent years for scene text extraction and recognition and the efforts are underway to improve the accuracy.  ...  The performance was investigated by considering Arabic text on each image pyramid of EASTR-42k dataset. The error rate of 0.17% was reported on Arabic scene text recognition.  ...  Acknowledgement The authors would like to thank Ministry of Education Malaysia and Universiti Teknologi Malaysia for funding this research project.  ... 
arXiv:1809.10792v1 fatcat:cbui2kztlnarvafb746sbvk35q

3D Terrestrial lidar data classification of complex natural scenes using a multi-scale dimensionality criterion: applications in geomorphology [article]

Nicolas Brodu, Dimitri Lague
2012 arXiv   pre-print
We have thus defined a multi-scale measure of the point cloud dimensionality around each point, which characterizes the local 3D organization.  ...  Comparison with a single scale approach shows the superiority of the multi-scale analysis in enhancing class separability and spatial resolution.  ...  This allows for example recognition of the vegetation on complex scenes with very high accuracy (i.e. ≈ 99.6% in a context such as fig. 1 ).  ... 
arXiv:1107.0550v3 fatcat:l7chazuetrgbpfktxtyj6cltnu

Spatial pyramid local keypoints quantization for bag of visual patches image representation

Yousef Alqasrawi, Daniel Neagu, Peter Cowling
2010 2010 10th International Conference on Intelligent Systems Design and Applications  
Bag of visual patches (BOP) image representation has been the main research topic in computer vision literature for scene and object recognition tasks.  ...  We show, with experiments on multi-class classification task using 700 natural scene images, that the spatial pyramid vocabulary model is suitable and discriminative for bag-of-visual patches semantic  ...  We use oneagainst-one multi-classification approach that results in M(M-1)/2 two-class SVMs for M scene classes.  ... 
doi:10.1109/isda.2010.5687083 dblp:conf/isda/AlqasrawiNC10 fatcat:o376oiiqwvelppxmbtnayos2p4

3D terrestrial lidar data classification of complex natural scenes using a multi-scale dimensionality criterion: Applications in geomorphology

N. Brodu, D. Lague
2012 ISPRS journal of photogrammetry and remote sensing (Print)  
We have thus defined a multi-scale measure of the point cloud dimensionality around each point.  ...  Comparison with a single scale approach shows the superiority of the multi-scale analysis in enhancing class separability and spatial resolution of the classification.  ...  This allows for example recognition of the vegetation on complex scenes with very high accuracy (i.e. ≈ 99.6% in a context such as fig. 1 ).  ... 
doi:10.1016/j.isprsjprs.2012.01.006 fatcat:hk4vvoos5bg6fnemgzn45acli4

2021 Index IEEE Transactions on Image Processing Vol. 30

2021 IEEE Transactions on Image Processing  
The Author Index contains the primary entry for each item, listed under the first author's name.  ...  ., +, TIP 2021 2207-2219 SLOAN: Scale-Adaptive Orientation Attention Network for Scene Text Recognition.  ...  ., +, TIP 2021 3167-3178 F Face recognition 2D-LCoLBP: A Learning Two-Dimensional Co-Occurrence Local Binary Pattern for Image Recognition.  ... 
doi:10.1109/tip.2022.3142569 fatcat:z26yhwuecbgrnb2czhwjlf73qu

Text Recognition in the Wild: A Survey [article]

Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu, Canjie Luo, Tianwei Wang
2020 arXiv   pre-print
Therefore, text recognition in natural scenes has been an active research field in computer vision and pattern recognition.  ...  Related resources are available at our Github repository: https://github.com/HCIILAB/Scene-Text-Recognition.  ...  [31] comprehensively compared these two prediction approaches on large-scale real-world scene text sentence recognition tasks.  ... 
arXiv:2005.03492v3 fatcat:rmzmavxylnf6rbp52lje2mrgiy

Language and perception: introduction to the special issue speakers and listeners in the visual world

Mila Vulchanova, Valentin Vulchanov, Isabella Fritz, Evelyn A. Milburn
2019 Journal of Cultural Cognitive Science  
Language and perception are two central cognitive systems.  ...  In this editorial, we examine the link between language and perception across three domains.  ...  Introduction Language and perception are two central cognitive systems.  ... 
doi:10.1007/s41809-019-00047-z fatcat:p5z46i6drfg4fcsdmvc32vpbva

A survey on deep multimodal learning for computer vision: advances, trends, applications, and datasets

Khaled Bayoudh, Raja Knani, Fayçal Hamdaoui, Abdellatif Mtibaa
2021 The Visual Computer  
Extracting relevant patterns from this kind of data is still a motivating goal for researchers in deep learning.  ...  Finally, we highlight the limitations and challenges of deep multimodal learning and provide insights and directions for future research.  ...  and context recognition [33] .  ... 
doi:10.1007/s00371-021-02166-7 pmid:34131356 pmcid:PMC8192112 fatcat:jojwyc6slnevzk7eaiutlmlgfe
« Previous Showing results 1 — 15 out of 13,286 results