Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation

Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate...

Full description

Saved in:

Bibliographic Details
Main Author:	Ilao, Joel P.
Format:	text
Language:	English
Published:	Animo Repository 2008
Subjects:	Computer vision Image segmentation Computer Sciences
Online Access:	https://animorepository.dlsu.edu.ph/etd_masteral/3878 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University
Language:	English

id	oai:animorepository.dlsu.edu.ph:etd_masteral-10716
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etd_masteral-107162022-07-16T01:37:26Z Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation Ilao, Joel P. Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate relative depth. However, monocular vision is more prone to image noise and is computationally expensive. This research proposes an algorithm for depth estimation for use in mobile robotic navigation. Depth estimation in real-world image sequences of a visual scene captured by a single moving camera using optic flow information still suffer from accuracy problems due imperfection in optic flow estimates. Since the Structure from Motion problem of Monocular Vision is regarded as non-linear, the initial optic flow estimate, hence, is further enhanced using a novel approach of applying Extended Kalman Filter formulation on the corresponding divergence fields. Raw optic flow estimates of consecutive frames in any given image sequence were computed using the pyramidal Lucas-Kanade algorithm. The resulting optic flow field is then used as basis for estimating the 3-D scene structure via construction of a depth/range map. The depth maps were constructed following a Camus formulation assuming monocular image sequences captured by a camera undergoing uniform forward translation. These depth maps were refined further by application of median filtering as a post-processing mechanism. Standard tests on synthetic and real-world images indicate that the Extended Kalman Filter has been effective in making the depth estimation process consistent, most especially if the optic flow estimates of the initial frame were made very close to the ideal (57.8% and 14.7% reduction in the standard deviation of divergence magnitude error values for the Kalman-filtered divergence data with and without ground truth values, respectively, over that which used only raw optic flow data). The system developed, however, cannot still effectively apply to real-world image data due to limiting assumptions on the observer motion type and imaged surface orientation relative to the camera observers focal axis, as well as lack of textural content and ambient lighting noise. 2008-01-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/3878 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf Master's Theses English Animo Repository Computer vision Image segmentation Computer Sciences
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Computer vision Image segmentation Computer Sciences
spellingShingle	Computer vision Image segmentation Computer Sciences Ilao, Joel P. Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
description	Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate relative depth. However, monocular vision is more prone to image noise and is computationally expensive. This research proposes an algorithm for depth estimation for use in mobile robotic navigation. Depth estimation in real-world image sequences of a visual scene captured by a single moving camera using optic flow information still suffer from accuracy problems due imperfection in optic flow estimates. Since the Structure from Motion problem of Monocular Vision is regarded as non-linear, the initial optic flow estimate, hence, is further enhanced using a novel approach of applying Extended Kalman Filter formulation on the corresponding divergence fields. Raw optic flow estimates of consecutive frames in any given image sequence were computed using the pyramidal Lucas-Kanade algorithm. The resulting optic flow field is then used as basis for estimating the 3-D scene structure via construction of a depth/range map. The depth maps were constructed following a Camus formulation assuming monocular image sequences captured by a camera undergoing uniform forward translation. These depth maps were refined further by application of median filtering as a post-processing mechanism. Standard tests on synthetic and real-world images indicate that the Extended Kalman Filter has been effective in making the depth estimation process consistent, most especially if the optic flow estimates of the initial frame were made very close to the ideal (57.8% and 14.7% reduction in the standard deviation of divergence magnitude error values for the Kalman-filtered divergence data with and without ground truth values, respectively, over that which used only raw optic flow data). The system developed, however, cannot still effectively apply to real-world image data due to limiting assumptions on the observer motion type and imaged surface orientation relative to the camera observers focal axis, as well as lack of textural content and ambient lighting noise.
format	text
author	Ilao, Joel P.
author_facet	Ilao, Joel P.
author_sort	Ilao, Joel P.
title	Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_short	Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_full	Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_fullStr	Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_full_unstemmed	Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_sort	robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
publisher	Animo Repository
publishDate	2008
url	https://animorepository.dlsu.edu.ph/etd_masteral/3878 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf
_version_	1792202530159067136

Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation

Similar Items