scholarly journals Depth estimation of a single RGB image with semi-supervised two-stage regression

Author(s):  
Jun Chi ◽  
Jie Gao ◽  
Lin Qi ◽  
Shu Zhang ◽  
Junyu Dong ◽  
...  
Keyword(s):  
Author(s):  
Jianyuan Sun ◽  
Zidong Wang ◽  
Hui Yu ◽  
Shu Zhang ◽  
Junyu Dong ◽  
...  
Keyword(s):  

2020 ◽  
Vol 22 (5) ◽  
pp. 1220-1233
Author(s):  
Wenfeng Song ◽  
Shuai Li ◽  
Ji Liu ◽  
Aimin Hao ◽  
Qinping Zhao ◽  
...  
Keyword(s):  

2020 ◽  
Author(s):  
Guoliang Liu

Full resolution depth is required in many realworld engineering applications. However, exist depth sensorsonly offer sparse depth sample points with limited resolutionand noise, e.g., LiDARs. We here propose a deep learningbased full resolution depth recovery method from monocularimages and corresponding sparse depth measurements of targetenvironment. The novelty of our idea is that the structure similarinformation between the RGB image and depth image is used torefine the dense depth estimation result. This important similarstructure information can be found using a correlation layerin the regression neural network. We show that the proposedmethod can achieve higher estimation accuracy compared tothe state of the art methods. The experiments conducted on theNYU Depth V2 prove the novelty of our idea.<br>


Author(s):  
X. Yuan ◽  
J. Tian ◽  
P. Reinartz

Abstract. Near infrared bands (NIR) provide rich information for many remote sensing applications. In addition to deriving useful indices to delineate water and vegetation, near infrared channels could also be used to facilitate image pre-processing. However, synthesizing bands from RGB spectrum is not an easy task. The inter-correlations between bands are not clearly identified in physical models. Generative adversarial networks (GAN) have been used in many tasks such as generating photorealistic images, monocular depth estimation and Digital Surface Model (DSM) refinement etc. Conditional GAN is different in that it observes some data as a condition. In this paper, we explore a cGAN network structure to generate a NIR spectral band that is conditioned on the input RGB image. We test different discriminators and loss functions, and evaluate results using various metrics. The best simulated NIR channel has a mean absolute error of around 5 percent in Sentinel-2 dataset. In addition, the simulated NIR image can correctly distinguish between various classes of landcover.


2021 ◽  
Author(s):  
Yuyan Li ◽  
Zhixin Yan ◽  
Ye Duan ◽  
Liu Ren
Keyword(s):  

2020 ◽  
Vol 2020 (14) ◽  
pp. 377-1-377-7
Author(s):  
Bruno Artacho ◽  
Nilesh Pandey ◽  
Andreas Savakis

Monocular depth estimation is an important task in scene understanding with applications to pose, segmentation and autonomous navigation. Deep Learning methods relying on multilevel features are currently used for extracting local information that is used to infer depth from a single RGB image. We present an efficient architecture that utilizes the features from multiple levels with fewer connections compared to previous networks. Our model achieves comparable scores for monocular depth estimation with better efficiency on the memory requirements and computational burden.


Sign in / Sign up

Export Citation Format

Share Document