Unknown

Dataset Information

0

Online supervised attention-based recurrent depth estimation from monocular video.


ABSTRACT: Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction.

SUBMITTER: Maslov D 

PROVIDER: S-EPMC7924529 | biostudies-literature | 2020

REPOSITORIES: biostudies-literature

altmetric image

Publications

Online supervised attention-based recurrent depth estimation from monocular video.

Maslov Dmitrii D   Makarov Ilya I  

PeerJ. Computer science 20201123


Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame  ...[more]

Similar Datasets

| S-EPMC10505201 | biostudies-literature
| S-EPMC9680869 | biostudies-literature
| S-EPMC9768171 | biostudies-literature
| S-EPMC4202214 | biostudies-literature
| S-EPMC6402382 | biostudies-literature
| S-EPMC10616293 | biostudies-literature
| S-EPMC3970055 | biostudies-other
| S-EPMC10583924 | biostudies-literature
| S-EPMC9489205 | biostudies-literature
| S-EPMC8099131 | biostudies-literature