Reconstruct 3D information from a single image

The computer vision systems today may be advance but the problem of 3d reconstruction from a single two dimensional image was still considered as an extremely challenging task. On the contrary, we humans could easily reconstruct 3d information from a single two dimensional image. This was because hu...

Full description

Saved in:
Bibliographic Details
Main Author: Phua, Chuan Leong.
Other Authors: He Ying
Format: Final Year Project
Language:English
Published: 2012
Subjects:
Online Access:http://hdl.handle.net/10356/48467
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-48467
record_format dspace
spelling sg-ntu-dr.10356-484672023-03-03T20:48:54Z Reconstruct 3D information from a single image Phua, Chuan Leong. He Ying School of Computer Engineering Centre for Advanced Media Technology DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision The computer vision systems today may be advance but the problem of 3d reconstruction from a single two dimensional image was still considered as an extremely challenging task. On the contrary, we humans could easily reconstruct 3d information from a single two dimensional image. This was because humans made use of various visual cues from a single two dimensional image and related these visual cues together in order to be able to visualize 3d information. This marked the objective of this project which was to create a program with similar ability like humans to be able to reconstruct 3d information from a single two dimensional image. All images contained many different scenes and objects taken at various angles and orientations. Therefore the adopted algorithm made a general assumption that the environment was made up of a number of small planes. There were no other explicit assumptions made on the scene structure so as to allow the adopted algorithm to capture as much details of the 3d environment as possible. The adopted algorithm used the superpixel segmentation algorithm where a single image was divided into smaller homogenous patch and a machine learning algorithm, the Markov Random Field (MRF) was used to infer a set of plane parameters that captures both the 3d orientation and 3d location of these patches of superpixels. The MRF which was trained via supervised learning, models the relationship between different parts of the image, determines image occlusions and captures various monocular cues used by humans. The adopted algorithm produced relatively visually pleasing VRML output at a reasonable speed. However, there was still some room for improvement in terms of the overall output quality and speed. Therefore, an option was explored to allow the user to tune the program to either have a faster computational speed or have a higher quality output. The ease of use and user-friendliness of the program were also taken into consideration during the development of program where the target audience need not be computer savvy. Bachelor of Engineering (Computer Science) 2012-04-24T06:36:53Z 2012-04-24T06:36:53Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/48467 en Nanyang Technological University 57 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Phua, Chuan Leong.
Reconstruct 3D information from a single image
description The computer vision systems today may be advance but the problem of 3d reconstruction from a single two dimensional image was still considered as an extremely challenging task. On the contrary, we humans could easily reconstruct 3d information from a single two dimensional image. This was because humans made use of various visual cues from a single two dimensional image and related these visual cues together in order to be able to visualize 3d information. This marked the objective of this project which was to create a program with similar ability like humans to be able to reconstruct 3d information from a single two dimensional image. All images contained many different scenes and objects taken at various angles and orientations. Therefore the adopted algorithm made a general assumption that the environment was made up of a number of small planes. There were no other explicit assumptions made on the scene structure so as to allow the adopted algorithm to capture as much details of the 3d environment as possible. The adopted algorithm used the superpixel segmentation algorithm where a single image was divided into smaller homogenous patch and a machine learning algorithm, the Markov Random Field (MRF) was used to infer a set of plane parameters that captures both the 3d orientation and 3d location of these patches of superpixels. The MRF which was trained via supervised learning, models the relationship between different parts of the image, determines image occlusions and captures various monocular cues used by humans. The adopted algorithm produced relatively visually pleasing VRML output at a reasonable speed. However, there was still some room for improvement in terms of the overall output quality and speed. Therefore, an option was explored to allow the user to tune the program to either have a faster computational speed or have a higher quality output. The ease of use and user-friendliness of the program were also taken into consideration during the development of program where the target audience need not be computer savvy.
author2 He Ying
author_facet He Ying
Phua, Chuan Leong.
format Final Year Project
author Phua, Chuan Leong.
author_sort Phua, Chuan Leong.
title Reconstruct 3D information from a single image
title_short Reconstruct 3D information from a single image
title_full Reconstruct 3D information from a single image
title_fullStr Reconstruct 3D information from a single image
title_full_unstemmed Reconstruct 3D information from a single image
title_sort reconstruct 3d information from a single image
publishDate 2012
url http://hdl.handle.net/10356/48467
_version_ 1759856672117882880