Deep Monocular Video Depth Estimation Using Temporal Attention

Depth estimation has made great progress in the last few years due to its applications in robotics science and computer vision. Various methods have been implemented and enhanced to estimate the depth without flickers and missing holes. Despite this progress, it is still one of the main challenges for researchers, especially for the video applications which have more complexity of the neural network which af ects the run time. Moreover to use such input like monocular video for depth estimation is considered an attractive idea, particularly for hand-held devices such as mobile phones, they are very popular for capturing pictures and videos, in addition to having a limited amount of RAM. Here in this work, we focus on enhancing the existing consistent depth estimation for monocular videos approach to be with less usage of RAM and with using less number of parameters without having a significant reduction in the quality of the depth estimation.

Download Full-text

Online supervised attention-based recurrent depth estimation from monocular video

PeerJ Computer Science ◽

10.7717/peerj-cs.317 ◽

2020 ◽

Vol 6 ◽

pp. e317

Author(s):

Dmitrii Maslov ◽

Ilya Makarov

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Short Term Memory ◽

Depth Estimation ◽

Autonomous Driving ◽

Temporal Information ◽

Depth Information ◽

Safe Driving ◽

Monocular Video ◽

Depth Reconstruction

Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction.

Download Full-text

Realtime Depth Estimation and Obstacle Detection from Monocular Video

Lecture Notes in Computer Science - Pattern Recognition ◽

10.1007/11861898_48 ◽

2006 ◽

pp. 475-484 ◽

Cited By ~ 24

Author(s):

Andreas Wedel ◽

Uwe Franke ◽

Jens Klappstein ◽

Thomas Brox ◽

Daniel Cremers

Keyword(s):

Depth Estimation ◽

Obstacle Detection ◽

Monocular Video

Download Full-text

Unsupervised Ego-Motion and Dense Depth Estimation with Monocular Video

2018 IEEE 18th International Conference on Communication Technology (ICCT) ◽

10.1109/icct.2018.8600039 ◽

2018 ◽

Cited By ~ 2

Author(s):

Yufan Xu ◽

Yan Wang ◽

Lei Guo

Keyword(s):

Depth Estimation ◽

Monocular Video ◽

Dense Depth Estimation

Download Full-text

Don’t Forget The Past: Recurrent Depth Estimation from Monocular Video

IEEE Robotics and Automation Letters ◽

10.1109/lra.2020.3017478 ◽

2020 ◽

Vol 5 (4) ◽

pp. 6813-6820 ◽

Cited By ~ 1

Author(s):

Vaishakh Patil ◽

Wouter Van Gansbeke ◽

Dengxin Dai ◽

Luc Van Gool

Keyword(s):

Depth Estimation ◽

The Past ◽

Monocular Video

Download Full-text

Bootstrapped Self-Supervised Training with Monocular Video for Semantic Segmentation and Depth Estimation

10.1109/iros51168.2021.9636330 ◽

2021 ◽

Author(s):

Yihao Zhang ◽

John J. Leonard

Keyword(s):

Semantic Segmentation ◽

Depth Estimation ◽

Supervised Training ◽

Monocular Video

Download Full-text

Attention meets Geometry: Geometry Guided Spatial-Temporal Attention for Consistent Self-Supervised Monocular Depth Estimation

10.1109/3dv53792.2021.00092 ◽

2021 ◽

Author(s):

Patrick Ruhkamp ◽

Daoyi Gao ◽

Hanzhi Chen ◽

Nassir Navab ◽

Beniamin Busam

Keyword(s):

Depth Estimation ◽

Temporal Attention ◽

Monocular Depth

Download Full-text

Enhancement of Consistent Depth Estimation for Monocular Videos Approach

10.5121/csit.2021.110910 ◽

2021 ◽

Author(s):

Mohamed N. Sweilam ◽

Nikolay Tolstokulakov

Keyword(s):

Neural Network ◽

Computer Vision ◽

Mobile Phones ◽

Depth Estimation ◽

The Neural Network ◽

Monocular Video ◽

Great Progress ◽

Run Time

Depth estimation has made great progress in the last few years due to its applications in robotics science and computer vision. Various methods have been developed and implemented to estimate the depth, without flickers and missing holes. Despite this progress, it is still one of the main challenges for researchers, especially for the video applications which have more difficulties such as the complexity of the neural network which affects the run time. Moreover to use such input like monocular video for depth estimation is considered an attractive idea, particularly for hand-held devices such as mobile phones, nowadays they are very popular for capturing pictures and videos. Here in this work, we focus on enhancing the existing consistent depth estimation for monocular videos approach to be with less usage of memory and with using less number of parameters without having a significant reduction in the quality of the depth estimation.

Download Full-text