Global ETD Search

Return to search

Monocular Depth Estimation Using Deep Convolutional Neural Networks

For a long time stereo-cameras have been deployed in visual Simultaneous Localization And Mapping (SLAM) systems to gain 3D information. Even though stereo-cameras show good performance, the main disadvantage is the complex and expensive hardware setup it requires, which limits the use of the system. A simpler and cheaper alternative are monocular cameras, however monocular images lack the important depth information. Recent works have shown that having access to depth maps in monocular SLAM system is beneficial since they can be used to improve the 3D reconstruction. This work proposes a deep neural network that predicts dense high-resolution depth maps from monocular RGB images by casting the problem as a supervised regression task. The network architecture follows an encoder-decoder structure in which multi-scale information is captured and skip-connections are used to recover details. The network is trained and evaluated on the KITTI dataset achieving results comparable to state-of-the-art methods. With further development, this network shows good potential to be incorporated in a monocular SLAM system to improve the 3D reconstruction.

http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-159981

pixelwise depth prediction

encoder-decoder network

Signal Processing

Signalbehandling

Identifer	oai:union.ndltd.org:UPSALLA1/oai:DiVA.org:liu-159981
Date	January 2019
Creators	Larsson, Susanna
Publisher	Linköpings universitet, Datorseende
Source Sets	DiVA Archive at Upsalla University
Language	English
Detected Language	English
Type	Student thesis, info:eu-repo/semantics/bachelorThesis, text
Format	application/pdf
Rights	info:eu-repo/semantics/openAccess

Page generated in 0.0024 seconds

Monocular Depth Estimation Using Deep Convolutional Neural Networks

Description

Links & Downloads

Tags

Additional Fields