Object depth estimation from single image

With the rapid development of computer vision technology, depth estimation is widely used in autonomous driving. Combined with object detection, the effect of pseudo-laser detection or three-dimensional reconstruction can be achieved; Combined with semantic segmentation, it can be extended from 2...

Full description

Saved in:

Bibliographic Details
Main Author:	Long, Zhongtian
Other Authors:	Mao Kezhi
Format:	Thesis-Master by Coursework
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/167156
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-167156
record_format	dspace
spelling	sg-ntu-dr.10356-1671562023-07-04T16:46:09Z Object depth estimation from single image Long, Zhongtian Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering With the rapid development of computer vision technology, depth estimation is widely used in autonomous driving. Combined with object detection, the effect of pseudo-laser detection or three-dimensional reconstruction can be achieved; Combined with semantic segmentation, it can be extended from 2D to 3D to obtain semantic and depth information of pixels, such as lane line detection; In addition, depth estimation can also be used for general obstacle detection[1]. Therefore, depth estimation is an important visual task in autonomous driving. The method of monocular[2] depth estimation is to estimate the depth from a single or a series of visible light photos taken simultaneously in the same scene. It also includes methods based on monocular vision, stereo matching, multi-view stereoscopic and 3D reconstruction. This dissertation first introduces some basic technology and several commonly used methods for depth estimation. Then, the paper presents a comprehensive study of monocular depth estimation using the Monodepth2 model[25]. The Monodepth2 model is explained in detail, including its network structure, components, and loss function. The environment setup and datasets used for pre-training the model on the Cityscapes dataset[28] and testing and fine-tuning it on the KITTI dataset[27] are described in the experimental section. This study evaluates the model using acceptable depth estimation indices as MSE, MAE, and Abs.rel. The outcomes of this experiment are evaluated using three different training techniques: monocular training, stereo training, and monocular plus stereo training[25]. In the end, it is discovered that the experimental results that have been examined and replicated are nearly identical to the original experimental results. Based on the experimental findings, a direction and method for enhancing the Monodepth2 model in future studies are suggested. Overall, this study offers insightful information about monocular depth estimation using the traditional Monodepth2 method. Master of Science (Computer Control and Automation) 2023-05-15T06:57:23Z 2023-05-15T06:57:23Z 2023 Thesis-Master by Coursework Long, Z. (2023). Object depth estimation from single image. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/167156 https://hdl.handle.net/10356/167156 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Long, Zhongtian Object depth estimation from single image
description	With the rapid development of computer vision technology, depth estimation is widely used in autonomous driving. Combined with object detection, the effect of pseudo-laser detection or three-dimensional reconstruction can be achieved; Combined with semantic segmentation, it can be extended from 2D to 3D to obtain semantic and depth information of pixels, such as lane line detection; In addition, depth estimation can also be used for general obstacle detection[1]. Therefore, depth estimation is an important visual task in autonomous driving. The method of monocular[2] depth estimation is to estimate the depth from a single or a series of visible light photos taken simultaneously in the same scene. It also includes methods based on monocular vision, stereo matching, multi-view stereoscopic and 3D reconstruction. This dissertation first introduces some basic technology and several commonly used methods for depth estimation. Then, the paper presents a comprehensive study of monocular depth estimation using the Monodepth2 model[25]. The Monodepth2 model is explained in detail, including its network structure, components, and loss function. The environment setup and datasets used for pre-training the model on the Cityscapes dataset[28] and testing and fine-tuning it on the KITTI dataset[27] are described in the experimental section. This study evaluates the model using acceptable depth estimation indices as MSE, MAE, and Abs.rel. The outcomes of this experiment are evaluated using three different training techniques: monocular training, stereo training, and monocular plus stereo training[25]. In the end, it is discovered that the experimental results that have been examined and replicated are nearly identical to the original experimental results. Based on the experimental findings, a direction and method for enhancing the Monodepth2 model in future studies are suggested. Overall, this study offers insightful information about monocular depth estimation using the traditional Monodepth2 method.
author2	Mao Kezhi
author_facet	Mao Kezhi Long, Zhongtian
format	Thesis-Master by Coursework
author	Long, Zhongtian
author_sort	Long, Zhongtian
title	Object depth estimation from single image
title_short	Object depth estimation from single image
title_full	Object depth estimation from single image
title_fullStr	Object depth estimation from single image
title_full_unstemmed	Object depth estimation from single image
title_sort	object depth estimation from single image
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/167156
_version_	1772828218760364032

Object depth estimation from single image

Similar Items