242,286 Hits in 4.8 sec

Guest Editorial Introduction to the Special Section on Intelligent Visual Content Analysis and Understanding

Hongliang Li, Lu Fang, Tianzhu Zhang
2020 IEEE transactions on circuits and systems for video technology (Print)  
The last two articles focus on designing robust recognition algorithms. Different kinds of tasks may require focusing on different aspects of the video.  ...  To achieve satisfying performance, effective recognition methods with specifically designed learning strategies are urgently needed for diverse tasks.  ... 
doi:10.1109/tcsvt.2020.3031416 fatcat:gpwbmydqbza5lddatxcfcidwcq

Indoor Space Recognition using Deep Convolutional Neural Network: A Case Study at MIT Campus [article]

Fan Zhang, Fabio Duarte, Ruixian Ma, Dimitrios Milioris, Hui Lin, Carlo Ratti
2016 arXiv   pre-print
In this paper, we propose a robust and parsimonious approach using Deep Convolutional Neural Network (DCNN) to recognize and interpret interior space.  ...  Third, we introduce a DCNN based approach to look into the visual similarity and visual distinctiveness of interior space.  ...  However, in this process the key human future of navigating space based on spatial recognition based on visual cues, is not taken into account.  ... 
arXiv:1610.02414v1 fatcat:xtgo2qrkmrd7zkp2csjcq36ngi

Facial Expression Recognition Research Based on Deep Learning [article]

Yongpei Zhu, Hongwei Fan, Kehong Yuan
2019 arXiv   pre-print
In this paper, we design and train a convolution neural network based on the expression recognition, and explore the classification mechanism of the network.  ...  expression recognition convolution neural network forms a detector for the specific facial action unit.  ...  Brief of traditional expression recognition method One of the traditional expression recognition methods is based on action unit (AU).  ... 
arXiv:1904.09737v3 fatcat:wshlfqk4ufhfvogce4ybgplbye

Influence of recurrent interactions on texture processing in networks with different visual map organizations

Hanna Kamyshanska, Dmitry Bibichkov, Matthias Kaschube
2015 BMC Neuroscience  
We design a network for the salt-and-pepper organization in an analogous way, assuming the same spatial extent of connectivity and its tuning-dependence as observed in mouse visual cortex [4] .  ...  For that purpose, we drive the network with oriented gratings to reconstruct the selectivity from the activities.  ...  We design a network for the salt-and-pepper organization in an analogous way, assuming the same spatial extent of connectivity and its tuning-dependence as observed in mouse visual cortex [4] .  ... 
doi:10.1186/1471-2202-16-s1-p115 pmcid:PMC4697518 fatcat:ru4tsrb7bbhhtmjlc7gmzef3e4

Violence Recognition Based on Auditory-Visual Fusion of Autoencoder Mapping

Jiu Lou, Decheng Zuo, Zhan Zhang, Hongwei Liu
2021 Electronics  
Therefore, this paper proposes a method for auditory–visual information fusion based on autoencoder mapping.  ...  the fusion of visual and auditory information on segment level features.  ...  Violent Behavior Recognition Model Based on Visual and Auditory Fusion Network Structure According to Sections 2 and 3.1, this paper designed a violent behavior recognition model based on the auditory-visual  ... 
doi:10.3390/electronics10212654 fatcat:wyhe5vdzunbepdgrkypaytkpg4

Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions [article]

Yang Wu, Dingheng Wang, Xiaotong Lu, Fan Yang, Guoqi Li, Weisheng Dong, Jianbo Shi
2021 arXiv   pre-print
Visual recognition is currently one of the most important and active research areas in computer vision, pattern recognition, and even the general field of artificial intelligence.  ...  While general surveys on the efficiency issue of DNNs have been done from various perspectives, as far as we are aware, scarcely any of them focused on visual recognition systematically, and thus it is  ...  compressed by tensor networks for visual recognition tasks.  ... 
arXiv:2108.13055v2 fatcat:nf3lymdbvzgl7otl7gjkk5qitq

Deep Neural Network for Image Recognition Based on the Caffe Framework

Myroslav Komar, Pavlo Yakobchuk, Vladimir Golovko, Vitaliy Dorosh, Anatoliy Sachenko
2018 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP)  
This paper describes the developing the deep neural network model for image recognition and a corresponding experimental research on an example of the MNIST data set.  ...  , speech recognition, computer vision, natural language processing, data visualization.  ...  the everyday life of most people on our planet and solve many problems of artificial intelligence, for example, speech recognition, computer vision, natural language processing, data visualization, etc  ... 
doi:10.1109/dsmp.2018.8478621 fatcat:smr7tskf4bfebhmczdcagyvzq4

Emotional computing based on cross-modal fusion and edge network data incentive

Lei Ma, Feng Ju, Jing Wan, Xiaoyan Shen
2019 Personal and Ubiquitous Computing  
In order to improve the computational efficiency and user experience quality, a data incentive algorithm for edge network is designed in this paper, based on the overlapping delay gaps and incentive weights  ...  In this space, all emotional events and emotional data elements are balanced. In this paper, an emotional computing algorithm based on cross-modal data fusion is designed.  ...  based on visual processing design of edge network data, can not only achieve the goal shown in Eq. (3) but also give full play to the advantages of cross-modal fusion.  ... 
doi:10.1007/s00779-019-01232-1 fatcat:6rpzw4lzyjcmrndoqmo2exoqty

Multi-Perspective LSTM for Joint Visual Representation Learning [article]

Alireza Sepas-Moghaddam, Fernando Pereira, Paulo Lobato Correia, Ali Etemad
2021 arXiv   pre-print
We demonstrate that by using the proposed cell to create a network, more effective and richer visual representations are learned for recognition tasks.  ...  We validate the performance of our proposed architecture in the context of two multi-perspective visual recognition tasks namely lip reading and face recognition.  ...  This cell architecture was designed for face recognition.  ... 
arXiv:2105.02802v1 fatcat:dsv35aeogvasnnefvk44ou7fsm

Neural network Feature Extraction for the Tasks of Visual Recognition

Dr.Basil Sh. Mahmood, Shefa A. Dawwd
2005 Al-Rafidain Engineering Journal  
In this Paper, a neural network image recognition system is used.  ...  The Neocognitron [8] in that system is used as feature extractor, then the feature are classified by using a multilayered feedforward network to generate recognition codes.  ...  Fig(2) Image Recognition Neural Network System The number of layers in the visual segment depends on the complexity of the input images.  ... 
doi:10.33899/rengj.2005.46198 fatcat:z2exem5zuneqpg6nxydbed5lhu

Multi-View Task-Driven Recognition in Visual Sensor Networks [article]

Ali Taalimi, Alireza Rahimpour, Liu Liu, Hairong Qi
2017 arXiv   pre-print
The proposed representation learning scheme is referred to as the multi-view task-driven learning for visual sensor network (MT-VSN).  ...  Nowadays, distributed smart cameras are deployed for a wide set of tasks in several application scenarios, ranging from object recognition, image retrieval, and forensic applications.  ...  Though, a major challenge in visual sensor networks is the limitation in terms of transmission bandwidth.  ... 
arXiv:1705.10715v2 fatcat:qlzcbg325fdupdq47zjkzxftba

Towards Open Set Deep Networks

Abhijit Bendale, Terrance E. Boult
2016 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)  
Deep networks have produced significant gains for various visual recognition problems, leading to high impact academic and commercial applications.  ...  We prove that the OpenMax concept provides bounded open space risk, thereby formally providing an open set recognition solution.  ...  Hence the need arises for designing visual recognition tools that formally account for the "unknown unknowns" [18] .  ... 
doi:10.1109/cvpr.2016.173 dblp:conf/cvpr/BendaleB16 fatcat:w7xtcfbeuzdcfbiondy7awxd2q

CAPTCHA design based on moving object recognition problem

JingSong Cui, LiJing Wang, JingTing Mei, Da Zhang, Xia Wang, Yang Peng, WuZhou Zhang
2010 The 3rd International Conference on Information Sciences and Interaction Sciences  
An attacker can log on the test service system, only after he solves the moving object recognition problem .  ...  The static plane visual CAPTCHA based on OCR problems with the advantages of implementation and operation [1] become the mainstream of the current CAPTCHA technology application form.  ...  Non-visual programs are usually designed for special populations and special occasion. Their applications aren't as broad as visual problems.  ... 
doi:10.1109/icicis.2010.5534730 fatcat:jl5xkfgodvculb7byfhvz6cg6m

Real-Time Dance Posture Tracking Method Based on Lightweight Network

Zhigang Wang, Xin Ning
2022 Wireless Communications and Mobile Computing  
The traditional motion tracking decomposition method, on the other hand, is unable to calculate the visual changes of adjacent key nodes, and the contour of 3D visual motion tracking remains ambiguous.  ...  The test results show that the proposed algorithm's recognition accuracy has improved.  ...  Firstly, the design space of lightweight network model is small, and the ability of feature representation is insufficient.  ... 
doi:10.1155/2022/5001896 fatcat:pa5krbvw5ffs3d27f2msj5yjyi

Book reports

1993 Computers and Mathematics with Applications  
Affme invariants for model-based recognition. 19. Object recognition based on moment (or algebraic) invariants. 20. Fast recognition using algebraic invariants. 21.  ...  Fault-tolerant design for multistage routing networks. lndej~aiteJ. By Molly Diesing. MIT Press, Cambridge, MA and London, England. (1992). 175 pages. $17.95. Contents: I.  ... 
doi:10.1016/0898-1221(93)90315-m fatcat:tnmkz3axfnekvgr4srsgsivuvy
« Previous Showing results 1 — 15 out of 242,286 results