A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2020; you can also visit the original URL.
The file type is application/pdf
.
Filters
Guest Editorial Introduction to the Special Section on Intelligent Visual Content Analysis and Understanding
2020
IEEE transactions on circuits and systems for video technology (Print)
The last two articles focus on designing robust recognition algorithms. Different kinds of tasks may require focusing on different aspects of the video. ...
To achieve satisfying performance, effective recognition methods with specifically designed learning strategies are urgently needed for diverse tasks. ...
doi:10.1109/tcsvt.2020.3031416
fatcat:gpwbmydqbza5lddatxcfcidwcq
Indoor Space Recognition using Deep Convolutional Neural Network: A Case Study at MIT Campus
[article]
2016
arXiv
pre-print
In this paper, we propose a robust and parsimonious approach using Deep Convolutional Neural Network (DCNN) to recognize and interpret interior space. ...
Third, we introduce a DCNN based approach to look into the visual similarity and visual distinctiveness of interior space. ...
However, in this process the key human future of navigating space based on spatial recognition based on visual cues, is not taken into account. ...
arXiv:1610.02414v1
fatcat:xtgo2qrkmrd7zkp2csjcq36ngi
Facial Expression Recognition Research Based on Deep Learning
[article]
2019
arXiv
pre-print
In this paper, we design and train a convolution neural network based on the expression recognition, and explore the classification mechanism of the network. ...
expression recognition convolution neural network forms a detector for the specific facial action unit. ...
Brief of traditional expression recognition method One of the traditional expression recognition methods is based on action unit (AU). ...
arXiv:1904.09737v3
fatcat:wshlfqk4ufhfvogce4ybgplbye
Influence of recurrent interactions on texture processing in networks with different visual map organizations
2015
BMC Neuroscience
We design a network for the salt-and-pepper organization in an analogous way, assuming the same spatial extent of connectivity and its tuning-dependence as observed in mouse visual cortex [4] . ...
For that purpose, we drive the network with oriented gratings to reconstruct the selectivity from the activities. ...
We design a network for the salt-and-pepper organization in an analogous way, assuming the same spatial extent of connectivity and its tuning-dependence as observed in mouse visual cortex [4] . ...
doi:10.1186/1471-2202-16-s1-p115
pmcid:PMC4697518
fatcat:ru4tsrb7bbhhtmjlc7gmzef3e4
Violence Recognition Based on Auditory-Visual Fusion of Autoencoder Mapping
2021
Electronics
Therefore, this paper proposes a method for auditory–visual information fusion based on autoencoder mapping. ...
the fusion of visual and auditory information on segment level features. ...
Violent Behavior Recognition Model Based on Visual and Auditory Fusion
Network Structure According to Sections 2 and 3.1, this paper designed a violent behavior recognition model based on the auditory-visual ...
doi:10.3390/electronics10212654
fatcat:wyhe5vdzunbepdgrkypaytkpg4
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
[article]
2021
arXiv
pre-print
Visual recognition is currently one of the most important and active research areas in computer vision, pattern recognition, and even the general field of artificial intelligence. ...
While general surveys on the efficiency issue of DNNs have been done from various perspectives, as far as we are aware, scarcely any of them focused on visual recognition systematically, and thus it is ...
compressed by tensor networks for visual recognition tasks. ...
arXiv:2108.13055v2
fatcat:nf3lymdbvzgl7otl7gjkk5qitq
Deep Neural Network for Image Recognition Based on the Caffe Framework
2018
2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP)
This paper describes the developing the deep neural network model for image recognition and a corresponding experimental research on an example of the MNIST data set. ...
, speech recognition, computer vision, natural language processing, data visualization. ...
the everyday life of most people on our planet and solve many problems of artificial intelligence, for example, speech recognition, computer vision, natural language processing, data visualization, etc ...
doi:10.1109/dsmp.2018.8478621
fatcat:smr7tskf4bfebhmczdcagyvzq4
Emotional computing based on cross-modal fusion and edge network data incentive
2019
Personal and Ubiquitous Computing
In order to improve the computational efficiency and user experience quality, a data incentive algorithm for edge network is designed in this paper, based on the overlapping delay gaps and incentive weights ...
In this space, all emotional events and emotional data elements are balanced. In this paper, an emotional computing algorithm based on cross-modal data fusion is designed. ...
based on visual processing design of edge network data, can not only achieve the goal shown in Eq. (3) but also give full play to the advantages of cross-modal fusion. ...
doi:10.1007/s00779-019-01232-1
fatcat:6rpzw4lzyjcmrndoqmo2exoqty
Multi-Perspective LSTM for Joint Visual Representation Learning
[article]
2021
arXiv
pre-print
We demonstrate that by using the proposed cell to create a network, more effective and richer visual representations are learned for recognition tasks. ...
We validate the performance of our proposed architecture in the context of two multi-perspective visual recognition tasks namely lip reading and face recognition. ...
This cell architecture was designed for face recognition. ...
arXiv:2105.02802v1
fatcat:dsv35aeogvasnnefvk44ou7fsm
Neural network Feature Extraction for the Tasks of Visual Recognition
2005
Al-Rafidain Engineering Journal
In this Paper, a neural network image recognition system is used. ...
The Neocognitron [8] in that system is used as feature extractor, then the feature are classified by using a multilayered feedforward network to generate recognition codes. ...
Fig(2) Image Recognition Neural Network System The number of layers in the visual segment depends on the complexity of the input images. ...
doi:10.33899/rengj.2005.46198
fatcat:z2exem5zuneqpg6nxydbed5lhu
Multi-View Task-Driven Recognition in Visual Sensor Networks
[article]
2017
arXiv
pre-print
The proposed representation learning scheme is referred to as the multi-view task-driven learning for visual sensor network (MT-VSN). ...
Nowadays, distributed smart cameras are deployed for a wide set of tasks in several application scenarios, ranging from object recognition, image retrieval, and forensic applications. ...
Though, a major challenge in visual sensor networks is the limitation in terms of transmission bandwidth. ...
arXiv:1705.10715v2
fatcat:qlzcbg325fdupdq47zjkzxftba
Towards Open Set Deep Networks
2016
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Deep networks have produced significant gains for various visual recognition problems, leading to high impact academic and commercial applications. ...
We prove that the OpenMax concept provides bounded open space risk, thereby formally providing an open set recognition solution. ...
Hence the need arises for designing visual recognition tools that formally account for the "unknown unknowns" [18] . ...
doi:10.1109/cvpr.2016.173
dblp:conf/cvpr/BendaleB16
fatcat:w7xtcfbeuzdcfbiondy7awxd2q
CAPTCHA design based on moving object recognition problem
2010
The 3rd International Conference on Information Sciences and Interaction Sciences
An attacker can log on the test service system, only after he solves the moving object recognition problem . ...
The static plane visual CAPTCHA based on OCR problems with the advantages of implementation and operation [1] become the mainstream of the current CAPTCHA technology application form. ...
Non-visual programs are usually designed for special populations and special occasion. Their applications aren't as broad as visual problems. ...
doi:10.1109/icicis.2010.5534730
fatcat:jl5xkfgodvculb7byfhvz6cg6m
Real-Time Dance Posture Tracking Method Based on Lightweight Network
2022
Wireless Communications and Mobile Computing
The traditional motion tracking decomposition method, on the other hand, is unable to calculate the visual changes of adjacent key nodes, and the contour of 3D visual motion tracking remains ambiguous. ...
The test results show that the proposed algorithm's recognition accuracy has improved. ...
Firstly, the design space of lightweight network model is small, and the ability of feature representation is insufficient. ...
doi:10.1155/2022/5001896
fatcat:pa5krbvw5ffs3d27f2msj5yjyi
Book reports
1993
Computers and Mathematics with Applications
Affme invariants for model-based recognition. 19. Object recognition based on moment (or algebraic) invariants. 20. Fast recognition using algebraic invariants. 21. ...
Fault-tolerant design for multistage routing networks. lndej~aiteJ. By Molly Diesing. MIT Press, Cambridge, MA and London, England. (1992). 175 pages. $17.95. Contents: I. ...
doi:10.1016/0898-1221(93)90315-m
fatcat:tnmkz3axfnekvgr4srsgsivuvy
« Previous
Showing results 1 — 15 out of 242,286 results