Filters








253 Hits in 6.6 sec

Monocular Relative Depth Perception with Web Stereo Data Supervision

Ke Xian, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, Zhenbo Luo
2018 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition  
In this paper we study the problem of monocular relative depth perception in the wild.  ...  We introduce a simple yet effective method to automatically generate dense relative depth annotations from web stereo images, and propose a new dataset that consists of diverse images as well as corresponding  ...  As shown in Figure 4 ,w e formulate monocular relative depth perception as a regression task.  ... 
doi:10.1109/cvpr.2018.00040 dblp:conf/cvpr/XianSCL0LL18 fatcat:gp5jcobwnrc3re2lq34afllyqu

SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments [article]

Hanjiang Hu, Baoquan Yang, Zhijian Qiao, Ding Zhao, Hesheng Wang
2021 arXiv   pre-print
depth prediction is still challenging even with fine-tuning.  ...  We further give promising avenues that self-supervised training and stereo geometry constraint help to enhance the robustness to changing environments.  ...  Monocular relative of depth, camera motion, optical flow and motion depth perception with web stereo data supervision. segmentation.  ... 
arXiv:2011.04408v5 fatcat:vout2lb2lffrdlpgte2zk3omoi

Real-time single image depth perception in the wild with handheld devices [article]

Filippo Aleotti, Giulio Zaccaroni, Luca Bartolomei, Matteo Poggi, Fabio Tosi, Stefano Mattoccia
2020 arXiv   pre-print
Depth perception is paramount to tackle real-world problems, ranging from autonomous driving to consumer applications.  ...  Indeed, to further support this evidence, we report experimental results concerning real-time depth-aware augmented reality and image blurring with smartphones in-the-wild.  ...  ACKNOWLEDGEMENT We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.  ... 
arXiv:2006.05724v1 fatcat:llf5ld6tcnd3fgtqfnetsoh4gq

Real-Time Single Image Depth Perception in the Wild with Handheld Devices

Filippo Aleotti, Giulio Zaccaroni, Luca Bartolomei, Matteo Poggi, Fabio Tosi, Stefano Mattoccia
2020 Sensors  
Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications.  ...  Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve  ...  Acknowledgments: We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.  ... 
doi:10.3390/s21010015 pmid:33375010 pmcid:PMC7792771 fatcat:xspclmpdvneofeqqem2vuolplu

Holopix50k: A Large-Scale In-the-wild Stereo Image Dataset [article]

Yiwen Hua, Puneet Kohli, Pritish Uplavikar, Anand Ravi, Saravana Gunaseelan, Jason Orozco, Edward Li
2020 arXiv   pre-print
We experimentally show that using our dataset significantly improves results for tasks such as stereo super-resolution and self-supervised monocular depth estimation.  ...  In this work, we describe our data collection process and statistically compare our dataset to other popular stereo datasets.  ...  From Table 2 Self-supervised Monocular Depth Estimation We use the Holopix50k dataset to fine-tune a self-supervised monocular depth estimation model, namely Monodepth2 [19] .  ... 
arXiv:2003.11172v1 fatcat:5p2audjw7zdghbx6hoojpvwaw4

3-D Depth Reconstruction from a Single Still Image

Ashutosh Saxena, Sung H. Chung, Andrew Y. Ng
2007 International Journal of Computer Vision  
We further propose a model that incorporates both monocular cues and stereo (triangulation) cues, to obtain significantly more accurate depth estimates than is possible using either monocular or stereo  ...  We take a supervised learning approach to this problem, in which we begin by collecting a training set of monocular images (of unstructured indoor and outdoor environments which include forests, sidewalks  ...  Acknowledgments We give warm thanks to Jamie Schulte, who designed and built the 3-d scanner, and to Andrew Lookingbill, who helped us with collecting the data used in this work.  ... 
doi:10.1007/s11263-007-0071-y fatcat:mc7g2mty5zfw3abiffim5c5b4q

Stereopsis and amblyopia: A mini-review

Dennis M. Levi, David C. Knill, Daphne Bavelier
2015 Vision Research  
with direct stereo training.  ...  Impaired stereoscopic depth perception is the most common deficit associated with amblyopia under ordinary (binocular) viewing conditions (Webber & Wood, 2005) .  ...  et al. to compute the relative weights given to stereo and monocular slant cues.  ... 
doi:10.1016/j.visres.2015.01.002 pmid:25637854 pmcid:PMC4519435 fatcat:csxzpqlnljfspadshmyken6h54

Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video with Applications for Virtual Reality [article]

Alisha Sharma, Ryan Nett, Jonathan Ventura
2020 arXiv   pre-print
We also apply our network to the problem of converting monocular panoramas to stereo panoramas.  ...  Our evaluation of synthetic and real data shows that unsupervised learning of depth and ego-motion on cylindrical panoramic images can produce high-quality depth maps and that an increased field-of-view  ...  Supervised Monocular Depth Prediction Early research focused on detecting structure from stereo-or multi-source-imagery.  ... 
arXiv:2010.07704v1 fatcat:znyt36i2mzgixnnyeo2eknvmo4

CNN Based Monocular Depth Estimation

K. Swaraja, K. Naga Siva Pavan, S. Suryakanth Reddy, K. Ajay, P. Uday Kiran Reddy, Padmavathi Kora, K. Meenakshi, Duggineni Chaitanya, Himabindu Valiveti, S. Tummala, S. Kosaraju, P. Bobba (+1 others)
2021 E3S Web of Conferences  
We demonstrate how, even with a very basic decoder, our approach can provide complete high-resolution depth maps.  ...  With the use of transfer learning, this research executes a convolutional neural network for generating a high-resolution depth map from a single RGB image.  ...  Fig 3 NYU Depth v2 dataset It is the common benchmark and the main training dataset in the supervised monocular depth estimation.  ... 
doi:10.1051/e3sconf/202130901070 fatcat:ync6p26mvvbircvpicv6vneiri

Pyramid frequency network with spatial attention residual refinement module for monocular depth estimation

Zhengyang Lu, Ying Chen
2022 Journal of Electronic Imaging (JEI)  
To reconstruct depth maps with accurate details, the SARRM constructs a residual fusion method with an attention mechanism to refine the blur depth.  ...  With the frequency strategy, PFN achieves better visual accuracy than state-of-the-art methods in both indoor and outdoor scenes on Make3D, KITTI depth, and NYUv2 datasets.  ...  Monocular Relative Depth Perception 16 and Structure-guided Ranking Loss for depth prediction 17 also involved ordinal regression in the task of depth prediction.  ... 
doi:10.1117/1.jei.31.2.023005 fatcat:uz5l3up77jfizjew4p5d52jdya

An Overview on Visual SLAM: From Tradition to Semantic

Weifeng Chen, Guangtao Shang, Aihong Ji, Chengjun Zhou, Xiyang Wang, Chonghui Xu, Zhenxiong Li, Kai Hu
2022 Remote Sensing  
Starting with typical neural networks CNN and RNN, we summarize the improvement of neural networks for the VSLAM system in detail.  ...  This paper introduces the development of VSLAM technology from two aspects: traditional VSLAM and semantic VSLAM combined with deep learning.  ...  Method Year Sensor Neural Network Supervision CNN-SLAM [158] 2017 Monocular CNN Supervised DeepVo [191] 2017 Monocular R-CNN Supervised Code-SLAM [44] 2018 Monocular U-Net Supervised DVSO [159]  ... 
doi:10.3390/rs14133010 fatcat:g45tav2qc5gchp46n6eunjvb2i

Evaluation of visual localization systems in underground mining

Christoforos Kanellakis, George Nikolakopoulos
2016 2016 24th Mediterranean Conference on Control and Automation (MED)  
In this experimental evaluation the sensors employed were a RGB-D camera (Kinect for Windows) and a web camera (Playstation 3 Eye) in two configurations, as a stereo rig and as a monocular visual sensor  ...  The processing of the stored data from the experiments will provide an insight into the efficiency of these sensors.  ...  Therefore a low cost web camera and a well known RGB-D sensor were utilized. The cameras were used both as a stereo camera rig and as a monocular visual sensor. A.  ... 
doi:10.1109/med.2016.7535853 dblp:conf/med/KanellakisN16 fatcat:wgu4ah4r6nbqbovant7s4axinu

Computationally Inexpensive Appearance Based Terrain Learning in Unknown Environments

Prabhakar Mishra, Anirudh Viswanathan
2013 Journal of Artificial Intelligence and Soft Computing Research  
Self-supervised learning is used to identify navigable terrain in the perception space. Training data is obtained using pre-filtered pixels, which correspond to near-range traversable terrain.  ...  We adopt a monocular vision based framework, using a single consumer grade camera as the primary sensor, and model the terrain as a Mixture of Gaussians.  ...  Our approach relies on a self-supervised learning algorithm, trained by prefiltering pixels in the perception space.  ... 
doi:10.2478/jaiscr-2014-0014 fatcat:g32rwiosv5daxe5yuu225bwogq

When the Sun Goes Down: Repairing Photometric Losses for All-Day Depth Estimation [article]

Madhu Vankadari, Stuart Golodetz, Sourav Garg, Sangyun Shin, Andrew Markham, Niki Trigoni
2022 arXiv   pre-print
Self-supervised deep learning methods for joint depth and ego-motion estimation can yield accurate trajectories without needing ground-truth training data.  ...  Second, we predict a per-pixel residual flow map that we use to correct the reprojection correspondences induced by the estimated ego-motion and depth from the networks.  ...  Acknowledgments This work was supported by Amazon Web Services via the Oxford-Singapore Human-Machine Collaboration Programme.  ... 
arXiv:2206.13850v1 fatcat:f5ku34a5znc2bhmlutxjcfowka

Author Index

2010 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition  
Learning for Image Classification Improving Web Image Search Results using Query-relative Classifiers Vezhnevets, Alexander Towards Weakly Supervised Semantic Segmentation by Means of Multiple Instance  ...  Videos Demo: YoutubeCat: Categorizing Wild Youtube Videos Workshop: Synthesis of Stereoscopic Views from Monocular Endoscopic Videos Li, Eric Bundled Depth-Map Merging for Multi-View Stereo Workshop  ... 
doi:10.1109/cvpr.2010.5539913 fatcat:y6m5knstrzfyfin6jzusc42p54
« Previous Showing results 1 — 15 out of 253 results