Improve depth estimation based on deep learning and information fusion

Depth estimation is a highly focused research direction in the field of computer vision, and it has seen rapid development and a wealth of research results in recent years. However, current mainstream depth estimation technologies rely on computationally expensive deep learning methods or direct dep...

全面介紹

Saved in:
書目詳細資料
主要作者: Xue, Mingqing
其他作者: Mao Kezhi
格式: Thesis-Master by Coursework
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/173222
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:Depth estimation is a highly focused research direction in the field of computer vision, and it has seen rapid development and a wealth of research results in recent years. However, current mainstream depth estimation technologies rely on computationally expensive deep learning methods or direct depth acquisition technologies that require costly, specialized sensor equipment, such as RGB-D cameras and LIDAR. These technologies have some practical limitations, such as the need for high computational power and reliance on specialized hardware. In response to these issues, this study proposes a depth estimation algorithm suitable for deployment in smartphone applications, aiming to achieve fast and accurate monocular depth estimation on low-power devices. We adopted an innovative approach that combines deep learning techniques with classic geometric depth estimation methods (such as SfM), leveraging geometric constraints to reduce computational complexity and runtime. This hybrid approach not only optimizes the efficiency of depth estimation but also maintains the accuracy and robustness of the results. Through a series of rigorous experimental designs and validations, the research results demonstrate the advantages of the proposed method over traditional algorithms in low-power environments. This research not only provides a new academic perspective but also has broad application prospects in practical applications, especially in the field of mobile device applications. With the continuous improvement of smartphone processing capabilities and further optimization of deep learning technologies, it is expected that the method proposed by this study will provide a new solution for mobile visual applications, pushing the ability of smartphones in three-dimensional space perception to a new height.