Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation

Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate...

Full description

Saved in:
Bibliographic Details
Main Author: Ilao, Joel P.
Format: text
Language:English
Published: Animo Repository 2008
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_masteral/3878
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_masteral-10716
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_masteral-107162022-07-16T01:37:26Z Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation Ilao, Joel P. Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate relative depth. However, monocular vision is more prone to image noise and is computationally expensive. This research proposes an algorithm for depth estimation for use in mobile robotic navigation. Depth estimation in real-world image sequences of a visual scene captured by a single moving camera using optic flow information still suffer from accuracy problems due imperfection in optic flow estimates. Since the Structure from Motion problem of Monocular Vision is regarded as non-linear, the initial optic flow estimate, hence, is further enhanced using a novel approach of applying Extended Kalman Filter formulation on the corresponding divergence fields. Raw optic flow estimates of consecutive frames in any given image sequence were computed using the pyramidal Lucas-Kanade algorithm. The resulting optic flow field is then used as basis for estimating the 3-D scene structure via construction of a depth/range map. The depth maps were constructed following a Camus formulation assuming monocular image sequences captured by a camera undergoing uniform forward translation. These depth maps were refined further by application of median filtering as a post-processing mechanism. Standard tests on synthetic and real-world images indicate that the Extended Kalman Filter has been effective in making the depth estimation process consistent, most especially if the optic flow estimates of the initial frame were made very close to the ideal (57.8% and 14.7% reduction in the standard deviation of divergence magnitude error values for the Kalman-filtered divergence data with and without ground truth values, respectively, over that which used only raw optic flow data). The system developed, however, cannot still effectively apply to real-world image data due to limiting assumptions on the observer motion type and imaged surface orientation relative to the camera observers focal axis, as well as lack of textural content and ambient lighting noise. 2008-01-01T08:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etd_masteral/3878 https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf Master's Theses English Animo Repository Computer vision Image segmentation Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Computer vision
Image segmentation
Computer Sciences
spellingShingle Computer vision
Image segmentation
Computer Sciences
Ilao, Joel P.
Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
description Monocular vision techniques use information taken from a single moving camera in inferring the 3-D structure of a camera observers environment. Compared to polynocular vision techniques, monocular vision techniques require less hardware and information about the camera geometry, in order to estimate relative depth. However, monocular vision is more prone to image noise and is computationally expensive. This research proposes an algorithm for depth estimation for use in mobile robotic navigation. Depth estimation in real-world image sequences of a visual scene captured by a single moving camera using optic flow information still suffer from accuracy problems due imperfection in optic flow estimates. Since the Structure from Motion problem of Monocular Vision is regarded as non-linear, the initial optic flow estimate, hence, is further enhanced using a novel approach of applying Extended Kalman Filter formulation on the corresponding divergence fields. Raw optic flow estimates of consecutive frames in any given image sequence were computed using the pyramidal Lucas-Kanade algorithm. The resulting optic flow field is then used as basis for estimating the 3-D scene structure via construction of a depth/range map. The depth maps were constructed following a Camus formulation assuming monocular image sequences captured by a camera undergoing uniform forward translation. These depth maps were refined further by application of median filtering as a post-processing mechanism. Standard tests on synthetic and real-world images indicate that the Extended Kalman Filter has been effective in making the depth estimation process consistent, most especially if the optic flow estimates of the initial frame were made very close to the ideal (57.8% and 14.7% reduction in the standard deviation of divergence magnitude error values for the Kalman-filtered divergence data with and without ground truth values, respectively, over that which used only raw optic flow data). The system developed, however, cannot still effectively apply to real-world image data due to limiting assumptions on the observer motion type and imaged surface orientation relative to the camera observers focal axis, as well as lack of textural content and ambient lighting noise.
format text
author Ilao, Joel P.
author_facet Ilao, Joel P.
author_sort Ilao, Joel P.
title Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_short Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_full Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_fullStr Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_full_unstemmed Robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
title_sort robust estimation of depth perception and image segmentation of monocular image sequences with known camera translation
publisher Animo Repository
publishDate 2008
url https://animorepository.dlsu.edu.ph/etd_masteral/3878
https://animorepository.dlsu.edu.ph/context/etd_masteral/article/10716/viewcontent/CDTG004775_P.pdf
_version_ 1792202530159067136