Improve depth estimation based on deep learning and information fusion

Depth estimation is a highly focused research direction in the field of computer vision, and it has seen rapid development and a wealth of research results in recent years. However, current mainstream depth estimation technologies rely on computationally expensive deep learning methods or direct dep...

Full description

Saved in:
Bibliographic Details
Main Author: Xue, Mingqing
Other Authors: Mao Kezhi
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/173222
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Depth estimation is a highly focused research direction in the field of computer vision, and it has seen rapid development and a wealth of research results in recent years. However, current mainstream depth estimation technologies rely on computationally expensive deep learning methods or direct depth acquisition technologies that require costly, specialized sensor equipment, such as RGB-D cameras and LIDAR. These technologies have some practical limitations, such as the need for high computational power and reliance on specialized hardware. In response to these issues, this study proposes a depth estimation algorithm suitable for deployment in smartphone applications, aiming to achieve fast and accurate monocular depth estimation on low-power devices. We adopted an innovative approach that combines deep learning techniques with classic geometric depth estimation methods (such as SfM), leveraging geometric constraints to reduce computational complexity and runtime. This hybrid approach not only optimizes the efficiency of depth estimation but also maintains the accuracy and robustness of the results. Through a series of rigorous experimental designs and validations, the research results demonstrate the advantages of the proposed method over traditional algorithms in low-power environments. This research not only provides a new academic perspective but also has broad application prospects in practical applications, especially in the field of mobile device applications. With the continuous improvement of smartphone processing capabilities and further optimization of deep learning technologies, it is expected that the method proposed by this study will provide a new solution for mobile visual applications, pushing the ability of smartphones in three-dimensional space perception to a new height.