Multiview video coding for 3D telepresence

In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion f...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Yang, Wenxian
مؤلفون آخرون: King Ngi Ngan
التنسيق: Theses and Dissertations
منشور في: 2008
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/2466
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
id sg-ntu-dr.10356-2466
record_format dspace
spelling sg-ntu-dr.10356-24662023-03-04T00:43:37Z Multiview video coding for 3D telepresence Yang, Wenxian King Ngi Ngan Cai Jianfei School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously. Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC. DOCTOR OF PHILOSOPHY (SCE) 2008-09-17T09:03:47Z 2008-09-17T09:03:47Z 2006 2006 Thesis Yang, W. X. (2006). Multiview video coding for 3D telepresence. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/2466 10.32657/10356/2466 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Yang, Wenxian
Multiview video coding for 3D telepresence
description In this thesis, we develope several novel algorithms and two complete encoders for multiview video coding (MVC). The major contributions can be summarized in three aspects. In particular, we first propose an edge-preserving regularization scheme to calculate either 1D disparity fields or 2D motion fields. After confirming its performance by comparing with the existing algorithms, the separate regularization scheme is extended to a joint estimation scheme that calculates two disparity fields and two motion fields for two successive image pairs simultaneously. Secondly, we develop an MPEG-4 compatible multiview video encoder which integrates with the joint disparity and motion estimation scheme. Besides, various aspects of the encoder are investigated, including a comparative study of several view-level frame prediction structures and the rate control algorithm for multiview video coding. Thirdly, we propose a framework of scalable MVC using 4D wavelet. The wavelet-based multiview video coding system is more flexible and extendable, and it can provide temporal, spatial, SNR (Signal-to-Noise-Ratio) and view scalabilities. In the wavelet MVC scheme, general lifting structure is introduced, based on which we propose a disparity compensated view filtering (DCVF) for wavelet decomposition along the view direction. We also propose a flexible decomposition structure based on the analysis of the temporal correlation and the view correlation. In addition, we develop the entire codec, including the design of the macroblock coding modes, subband coding, rate control, etc. The proposed scheme overcomes a variety of limitations exhibited by existing methods and provides a natural and elegant solution to MVC.
author2 King Ngi Ngan
author_facet King Ngi Ngan
Yang, Wenxian
format Theses and Dissertations
author Yang, Wenxian
author_sort Yang, Wenxian
title Multiview video coding for 3D telepresence
title_short Multiview video coding for 3D telepresence
title_full Multiview video coding for 3D telepresence
title_fullStr Multiview video coding for 3D telepresence
title_full_unstemmed Multiview video coding for 3D telepresence
title_sort multiview video coding for 3d telepresence
publishDate 2008
url https://hdl.handle.net/10356/2466
_version_ 1759855253357854720