Multiview video coding for 3D telepresence

In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion f...

Full description

Saved in:
Bibliographic Details
Main Author: Yang, Wenxian
Other Authors: King Ngi Ngan
Format: Theses and Dissertations
Published: 2008
Subjects:
Online Access:https://hdl.handle.net/10356/2466
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
id sg-ntu-dr.10356-2466
record_format dspace
spelling sg-ntu-dr.10356-24662023-03-04T00:43:37Z Multiview video coding for 3D telepresence Yang, Wenxian King Ngi Ngan Cai Jianfei School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously. Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC. DOCTOR OF PHILOSOPHY (SCE) 2008-09-17T09:03:47Z 2008-09-17T09:03:47Z 2006 2006 Thesis Yang, W. X. (2006). Multiview video coding for 3D telepresence. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/2466 10.32657/10356/2466 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Yang, Wenxian
Multiview video coding for 3D telepresence
description In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously. Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC.
author2 King Ngi Ngan
author_facet King Ngi Ngan
Yang, Wenxian
format Theses and Dissertations
author Yang, Wenxian
author_sort Yang, Wenxian
title Multiview video coding for 3D telepresence
title_short Multiview video coding for 3D telepresence
title_full Multiview video coding for 3D telepresence
title_fullStr Multiview video coding for 3D telepresence
title_full_unstemmed Multiview video coding for 3D telepresence
title_sort multiview video coding for 3d telepresence
publishDate 2008
url https://hdl.handle.net/10356/2466
_version_ 1759855253357854720