Filters








1,060 Hits in 6.7 sec

RGB-D salient object detection: A survey

Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, Ling Shao
2021 Computational Visual Media  
Finally, we discuss several challenges and open directions of RGB-D based salient object detection for future research.  ...  Moreover, to investigate the ability of existing models to detect salient objects, we have carried out a comprehensive attribute-based evaluation of several representative RGB-D based salient object detection  ...  Acknowledgements This research was supported by a Major Project for a New Generation of AI under Grant No. 2018AAA0100400, National Natural Science Foundation of China (61922046), and Tianjin Natural Science  ... 
doi:10.1007/s41095-020-0199-z pmid:33432275 pmcid:PMC7788385 fatcat:foiz2zth4vckjfuhvh524hwdtq

MutualFormer: Multi-Modality Representation Learning via Mutual Transformer [article]

Xixi Wang, Bo Jiang, Xiao Wang, Bin Luo
2021 arXiv   pre-print
We successfully apply the MutualFormer to the saliency detection problem and propose a novel approach to obtain the reinforced features of RGB and Depth images.  ...  Recent studies demonstrate that Transformer models usually work comparable or even better than CNN for multi-modality task, but they simply adopt concatenation or cross-attention for feature fusion which  ...  RGB-D SOD RGB-D salient object detection aims to locate the most salient objects (or regions) from visual image(s).  ... 
arXiv:2112.01177v2 fatcat:h424ta636jhivojwe2s2atmine

LIANet: Layer Interactive Attention Network for RGB-D Salient Object Detection

Yibo Han, Liejun Wang, Anyu Du, Shaochen Jiang
2022 IEEE Access  
RGB-D salient object detection (SOD) usually describes two modes' classification or regression problem, namely RGB and depth.  ...  RGB and depth maps are used alternately for layered interaction and fusion to enhance RGB feature information and gradually integrate global context information at a single scale.  ...  RGB-D SALIENT OBJECT DETECTION Traditional RGB-D saliency target detection methods [24] [25] [26] design hand-made features, for instance contrast [26] , shape [27] , local background closure [24]  ... 
doi:10.1109/access.2022.3156935 fatcat:e6oc6uw5dbdhzabxb52g3x2eyq

RGB-D Salient Object Detection with Ubiquitous Target Awareness

Yifan Zhao, Jiawei Zhao, Jia Li, Xiaowu Chen
2021 IEEE Transactions on Image Processing  
Conventional RGB-D salient object detection methods aim to leverage depth as complementary information to find the salient regions in both modalities.  ...  In this work, we make the first attempt to solve the RGB-D salient object detection problem with a novel depth-awareness framework.  ...  Overview In this section, we introduce a novel Ubiquitous Target Awareness (UTA) network for RGB-D salient object detection.  ... 
doi:10.1109/tip.2021.3108412 pmid:34478368 fatcat:hwd4tadcinfv5gd2mgpco3sm2y

Multi-interactive Dual-decoder for RGB-thermal Salient Object Detection [article]

Zhengzheng Tu, Zhun Li, Chenglong Li, Yang Lang, Jin Tang
2021 arXiv   pre-print
RGB-thermal salient object detection (SOD) aims to segment the common prominent regions of visible image and corresponding thermal infrared image that we call it RGBT SOD.  ...  In this paper, we propose a multi-interactive dual-decoder to mine and model the multi-type interactions for accurate RGBT SOD.  ...  FUTURE WORKS The extended tasks of salient object detection like RGB-T and RGB-D SOD have been explored a lot in recent years.  ... 
arXiv:2005.02315v3 fatcat:ck2iowqlcfaw3orjykqoe42jha

RGB-D Salient Object Detection: A Survey [article]

Tao Zhou, Deng-Ping Fan, Ming-Ming Cheng, Jianbing Shen, Ling Shao
2020 arXiv   pre-print
Salient object detection (SOD), which simulates the human visual perception system to locate the most attractive object(s) in a scene, has been widely applied to various computer vision tasks.  ...  Finally, we discuss several challenges and open directions of RGB-D based SOD for future research.  ...  In § III, we summarize and provide details for current benchmark datasets for RGB-D salient object detection.  ... 
arXiv:2008.00230v3 fatcat:n52weyun25fq3aug4ughjqs2ru

cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks [article]

Bo Jiang, Zitai Zhou, Xiao Wang, Jin Tang, Bin Luo
2020 arXiv   pre-print
Fusing complementary information of RGB and depth has been demonstrated to be effective for image salient object detection which is known as RGB-D salient object detection problem.  ...  for RGB-D saliency detection problem.  ...  We develop a novel cross-modality Saliency Generative Adversarial Network (cmSalGAN) for RGB-D salient object detection.  ... 
arXiv:1912.10280v2 fatcat:zaxsm6o6mfbv5oslfg5wjwxywy

Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection

Jia-Xing Zhao, Yang Cao, Deng-Ping Fan, Ming-Ming Cheng, Xuan-Yi Li, Le Zhang
2019 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)  
The large availability of depth sensors provides valuable complementary information for salient object detection (SOD) in RGBD images.  ...  The enhanced depth cues are further integrated with RGB features for SOD, using a novel fluid pyramid integration, which can make better use of multi-scale cross-modal features.  ...  We would like to thank the anonymous reviewers for their useful feedback.  ... 
doi:10.1109/cvpr.2019.00405 dblp:conf/cvpr/ZhaoCFCLZ19 fatcat:jbf7qmti4rfavdojywljdhk5va

Learning deep representations for semantic image parsing: a comprehensive overview

Lili Huang, Jiefeng Peng, Ruimao Zhang, Guanbin Li, Liang Lin
2018 Frontiers of Computer Science  
Specifically, we first review the general frameworks for each task and introduce the relevant variants. The advantages and limitations of each method are also discussed.  ...  Additionally, it contains labeled structural support relationships for support relation classification. • SUN RGB-D. SUN RGB-D [97] is the largest RGB-D dataset currently available.  ...  , SUN RGB-D, ATR, and Fashionista.  ... 
doi:10.1007/s11704-018-7195-8 fatcat:p5hvfwhl5rbork5vf4rpnx3h6u

Accurate salient object detection via dense recurrent connections and residual-based hierarchical feature integration

Yanpeng Cao, Guizhong Fu, Jiangxin Yang, Yanlong Cao, Michael Ying Yang
2019 Signal processing. Image communication  
Recently, the convolutional neural network (CNN) has achieved great progress in many computer vision tasks including object detection, image restoration, and scene understanding.  ...  Then we present a residual-based architecture with short connections for deep supervision which hierarchically combines both coarse-level and fine-level feature representations.  ...  Zhang et al. proposed a novel bi-directional message model to integrate multi-level features for salient object detection [43] .  ... 
doi:10.1016/j.image.2019.06.004 fatcat:k5dxyhyu7vdzrccq3o7zpdkwlq

Visual saliency detection for RGB-D images under a Bayesian framework

Songtao Wang, Zhen Zhou, Wei Jin, Hanbing Qu
2018 IPSJ Transactions on Computer Vision and Applications  
In this paper, we propose a saliency detection model for RGB-D images based on the deep features of RGB images and depth images within a Bayesian framework.  ...  network; then, the posterior probability of the RGB-D saliency is formulated by applying Bayes' theorem.  ...  Qu et al. designed a new CNN to fuse different low-level saliency cues into hierarchical features for automatically detecting salient objects in RGB-D images [26] .  ... 
doi:10.1186/s41074-017-0037-0 fatcat:jklxzq46vrbh3mk6n7ocbqhoiu

Generative Transformer for Accurate and Reliable Salient Object Detection [article]

Yuxin Mao, Jing Zhang, Zhexiong Wan, Yuchao Dai, Aixuan Li, Yunqiu Lv, Xinyu Tian, Deng-Ping Fan, Nick Barnes
2022 arXiv   pre-print
We first investigate transformers for accurate salient object detection with deterministic neural networks, and explain that the effective structure modeling and global context modeling abilities lead  ...  Then, we design stochastic networks to evaluate the transformers' ability in reliable salient object detection.  ...  for large salient object detection.  ... 
arXiv:2104.10127v4 fatcat:yaunhucuvbba3bvp3iatzdnnxa

A Review on Human Activity Recognition Using Vision-Based Method

Shugang Zhang, Zhiqiang Wei, Jie Nie, Lei Huang, Shuang Wang, Zhen Li
2017 Journal of Healthcare Engineering  
The vision-based HAR research is the basis of many applications including video surveillance, health care, and human-computer interaction (HCI).  ...  Finally, we investigate the directions for future research.  ...  Shandong Province (no. 2015ZDZX05002); Qingdao Science and Technology Development Plan (no. 16-5-1-13-jch); and The Aoshan Innovation Project in Science and Technology of Qingdao National Laboratory for  ... 
doi:10.1155/2017/3090343 pmid:29065585 pmcid:PMC5541824 fatcat:g6qbbbjpcref3p54kvquu5rltq

RGB-D Data-Based Action Recognition: A Review

Muhammad Bilal Shaikh, Douglas Chai
2021 Sensors  
In this paper, we focus solely on data fusion and recognition techniques in the context of vision with an RGB-D perspective.  ...  Naturally, each action-data modality—such as RGB, depth, skeleton, and infrared (IR)—has distinct characteristics; therefore, it is important to exploit the value of each modality for better action recognition  ...  Acknowledgments: The authors would like to thank the anonymous reviewers for their careful reading and valuable remarks, which have greatly helped extend the scope of this paper.  ... 
doi:10.3390/s21124246 fatcat:7dvocdy63rckne5yunhfsnr4p4

Human Action Recognition and Prediction: A Survey [article]

Yu Kong, Yun Fu
2022 arXiv   pre-print
Many attempts have been devoted in the last a few decades in order to build a robust and effective framework for action recognition and prediction.  ...  (a) single person's action; (b) human interaction; (c) human-object interaction; (d) group action; (e) RGB-D action; (f) multi-view action.  ...  actions and human-object interactions; 4) a group action in Hollywood 2 dataset [170] (Fig. 2(d) ); 5) an action captured by a RGB-D sensor in UTKinect dataset [295] (Fig. 2(e )); and 6) a multi-view  ... 
arXiv:1806.11230v3 fatcat:2a2d7fuezbdqzfgrjwkcuqvmbu
« Previous Showing results 1 — 15 out of 1,060 results