Multiview video coding for 3D telepresence
In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion f...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Published: |
2008
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/2466 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
id |
sg-ntu-dr.10356-2466 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-24662023-03-04T00:43:37Z Multiview video coding for 3D telepresence Yang, Wenxian King Ngi Ngan Cai Jianfei School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously. Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC. DOCTOR OF PHILOSOPHY (SCE) 2008-09-17T09:03:47Z 2008-09-17T09:03:47Z 2006 2006 Thesis Yang, W. X. (2006). Multiview video coding for 3D telepresence. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/2466 10.32657/10356/2466 Nanyang Technological University application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Yang, Wenxian Multiview video coding for 3D telepresence |
description |
In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously.
Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC. |
author2 |
King Ngi Ngan |
author_facet |
King Ngi Ngan Yang, Wenxian |
format |
Theses and Dissertations |
author |
Yang, Wenxian |
author_sort |
Yang, Wenxian |
title |
Multiview video coding for 3D telepresence |
title_short |
Multiview video coding for 3D telepresence |
title_full |
Multiview video coding for 3D telepresence |
title_fullStr |
Multiview video coding for 3D telepresence |
title_full_unstemmed |
Multiview video coding for 3D telepresence |
title_sort |
multiview video coding for 3d telepresence |
publishDate |
2008 |
url |
https://hdl.handle.net/10356/2466 |
_version_ |
1759855253357854720 |