Visual data processing over structured dictionaries with applications in light field imaging

Structure exists in all forms of visual data in the nature. In this thesis, we focus on modeling these data structures using sparse representation techniques over a redundant dictionary, which has been proven as an efficient tool in numerous visual signal processing applications. We propose several...

Full description

Saved in:

Bibliographic Details
Main Author:	Chen, Jie
Other Authors:	Chau Lap Pui
Format:	Theses and Dissertations
Language:	English
Published:	2016
Subjects:	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Online Access:	https://hdl.handle.net/10356/68594
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-68594
record_format	dspace
spelling	sg-ntu-dr.10356-685942023-07-04T16:37:09Z Visual data processing over structured dictionaries with applications in light field imaging Chen, Jie Chau Lap Pui School of Electrical and Electronic Engineering Centre for Signal Processing DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Structure exists in all forms of visual data in the nature. In this thesis, we focus on modeling these data structures using sparse representation techniques over a redundant dictionary, which has been proven as an efficient tool in numerous visual signal processing applications. We propose several types of redundant dictionaries with specially designed structures that adapts to the unique application scenarios. Different sparse coding and dictionary training strategies are investigated. Considerations are given to challenging issues such as multi-scale dictionary cross-scale interactions, dictionary disparity segment de-correlation, perspective-shifted dictionary sparse coding acceleration etc. All efforts aim to provide a more powerful frame for representing complicated visual data structures. For sparse signal representation, the sparsity across the scales is a promising yet under investigated direction. In this thesis, we design a multi-scale sparse representation scheme to explore such potential. A multi-scale dictionary (MD) structure is designed. A Cross-scale Matching Pursuit (CMP) algorithm is proposed for multi-scale sparse coding. Two dictionary learning methods: Cross-scale Cooperative Learning MD/CCL), and Cross-scale Atom Clustering (MD/CAC) are proposed with each focusing on one of the two important attributes of an efficient multi-scale dictionary: the similarity, and uniqueness of corresponding atoms in different scales. We analyze and compare their di erent advantages in the application of image denoising under different noise levels, where both methods produce state-of-the-art denoising results. The light field (LF) is a function that describes the intensities of light rays in all possible propagation directions. The LF contains large volumes of visual information that can provide a comprehensive understanding of the 3D environment of the scene. In this thesis, a light field dictionary (LFD) based on perspective-shifting is proposed for sparse representation of the highly correlated light field. A two-stage coding algorithm is proposed which uses the Winner-Take-All (WTA) hashing strategy to narrow done the search range for light field sparse coding. The algorithm proves to be able to increase the coding efficiency by almost three times and keep the reconstruction quality almost the same with the original OMP coding. A compressed sensing framework is proposed for the sampling and reconstruction of a high resolution light field based on a coded aperture camera. Two separate methods, i.e., Sub-Aperture Scan (SAS) and Normalized Fluctuation (NF) are proposed to acquire/calculate the scene disparity, which will be used during the light field reconstruction with the proposed disparity-aware dictionary. A hardware implementation of the proposed light field acquisition/reconstruction scheme is carried out. Both quantitative and qualitative evaluation shows the proposed methods produce state-of- the-art performance in both reconstruction quality and computation efficiency. Then, a light field compression framework based on the LFD is proposed for efficient storage and transmission of the bulky LF data. A highly efficient adaptive guided filtering algorithm is also proposed for the LF disparity/depth map post-processing. Both Quantitative and qualitative simulations validate the efficiency of the proposed methods. DOCTOR OF PHILOSOPHY (EEE) 2016-05-27T07:56:59Z 2016-05-27T07:56:59Z 2016 Thesis https://hdl.handle.net/10356/68594 10.32657/10356/68594 en 158 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Chen, Jie Visual data processing over structured dictionaries with applications in light field imaging
description	Structure exists in all forms of visual data in the nature. In this thesis, we focus on modeling these data structures using sparse representation techniques over a redundant dictionary, which has been proven as an efficient tool in numerous visual signal processing applications. We propose several types of redundant dictionaries with specially designed structures that adapts to the unique application scenarios. Different sparse coding and dictionary training strategies are investigated. Considerations are given to challenging issues such as multi-scale dictionary cross-scale interactions, dictionary disparity segment de-correlation, perspective-shifted dictionary sparse coding acceleration etc. All efforts aim to provide a more powerful frame for representing complicated visual data structures. For sparse signal representation, the sparsity across the scales is a promising yet under investigated direction. In this thesis, we design a multi-scale sparse representation scheme to explore such potential. A multi-scale dictionary (MD) structure is designed. A Cross-scale Matching Pursuit (CMP) algorithm is proposed for multi-scale sparse coding. Two dictionary learning methods: Cross-scale Cooperative Learning MD/CCL), and Cross-scale Atom Clustering (MD/CAC) are proposed with each focusing on one of the two important attributes of an efficient multi-scale dictionary: the similarity, and uniqueness of corresponding atoms in different scales. We analyze and compare their di erent advantages in the application of image denoising under different noise levels, where both methods produce state-of-the-art denoising results. The light field (LF) is a function that describes the intensities of light rays in all possible propagation directions. The LF contains large volumes of visual information that can provide a comprehensive understanding of the 3D environment of the scene. In this thesis, a light field dictionary (LFD) based on perspective-shifting is proposed for sparse representation of the highly correlated light field. A two-stage coding algorithm is proposed which uses the Winner-Take-All (WTA) hashing strategy to narrow done the search range for light field sparse coding. The algorithm proves to be able to increase the coding efficiency by almost three times and keep the reconstruction quality almost the same with the original OMP coding. A compressed sensing framework is proposed for the sampling and reconstruction of a high resolution light field based on a coded aperture camera. Two separate methods, i.e., Sub-Aperture Scan (SAS) and Normalized Fluctuation (NF) are proposed to acquire/calculate the scene disparity, which will be used during the light field reconstruction with the proposed disparity-aware dictionary. A hardware implementation of the proposed light field acquisition/reconstruction scheme is carried out. Both quantitative and qualitative evaluation shows the proposed methods produce state-of- the-art performance in both reconstruction quality and computation efficiency. Then, a light field compression framework based on the LFD is proposed for efficient storage and transmission of the bulky LF data. A highly efficient adaptive guided filtering algorithm is also proposed for the LF disparity/depth map post-processing. Both Quantitative and qualitative simulations validate the efficiency of the proposed methods.
author2	Chau Lap Pui
author_facet	Chau Lap Pui Chen, Jie
format	Theses and Dissertations
author	Chen, Jie
author_sort	Chen, Jie
title	Visual data processing over structured dictionaries with applications in light field imaging
title_short	Visual data processing over structured dictionaries with applications in light field imaging
title_full	Visual data processing over structured dictionaries with applications in light field imaging
title_fullStr	Visual data processing over structured dictionaries with applications in light field imaging
title_full_unstemmed	Visual data processing over structured dictionaries with applications in light field imaging
title_sort	visual data processing over structured dictionaries with applications in light field imaging
publishDate	2016
url	https://hdl.handle.net/10356/68594
_version_	1772828504996446208

Visual data processing over structured dictionaries with applications in light field imaging

Similar Items