Distortion measure analysis for efficient block matching and face region based video coding

Video coding has been intensively studied for over one decade and a few video compression standards have been well established with wide applications. These coding paradigms or standards have a number of commonalities, the most salient one among which is the hybrid coding structure consisting of div...

Full description

Saved in:
Bibliographic Details
Main Author: Xiong, Bing
Other Authors: Charayaphan Cheroensak
Format: Theses and Dissertations
Language:English
Published: 2009
Subjects:
Online Access:https://hdl.handle.net/10356/14800
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-14800
record_format dspace
spelling sg-ntu-dr.10356-148002023-07-04T17:25:55Z Distortion measure analysis for efficient block matching and face region based video coding Xiong, Bing Charayaphan Cheroensak Zhu Ce School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Video coding has been intensively studied for over one decade and a few video compression standards have been well established with wide applications. These coding paradigms or standards have a number of commonalities, the most salient one among which is the hybrid coding structure consisting of diverse techniques to remove both the temporal and the spatial redundancy. One of the major techniques is block matching which has been used in inter-prediction like block motion estimation as well as in intraprediction such as intra-mode selection in H.264/AVC. Another important topic is bit allocation for rate control to optimize the coding performance. In this thesis, we investigate the block matching technique and face region priority based bit allocation for video coding. While the block matching based motion estimation has been widely studied, this thesis addresses the efficient block matching from the distortion measurement perspective. Specifically, three aspects on the distortion measurement analysis are considered, namely, a new multiplication-free distortion metric, transform-exempt sum of absolute Hadamard transformed differences (SATD) calculation, and a subblock-based distortion metric for efficient block matching, which are summarized in order as follows. It is known that the distortion measure function plays a pivotal role in both matching accuracy and computational complexity. To avoid multiplication operations for simpler implementation, sum of absolute difference (SAD) is normally taken as a substitute to approximate the most widely accepted benchmark, mean squared error (MSE). Although it is well known that SAD is an approximate to MSE, it is necessary to know how accurate the approximate is quantitatively, which can provide us more insights and guidance in selecting an appropriate matching criterion function and developing some new ones. However, there is a lack of such quantitative treatment to this fundamental question. In this thesis, we firstly examine the quantitative deviation of SAD from MSE. In order to reduce the deviation, a new matching criterion, namely weighted sum of absolute difference (WSAD), is thereafter proposed, which enhances the matching accuracy while still maintaining the desirable multiplication-free property. The proposed WSAD is also experimentally validated by applying it in block motion estimation for video coding, showing better rate-distortion performance than SAD. Secondly, in the latest video coding standard H.264/AVC, the sum of absolute Hadamard transformed differences (SATD) is a new distortion metric adopted as an alternative to the SAD to improve coding efficiency. DOCTOR OF PHILOSOPHY (EEE) 2009-02-06T04:08:38Z 2009-02-06T04:08:38Z 2009 2009 Thesis Xiong, B. (2009). Distortion measure analysis for efficient block matching and face region based video coding. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/14800 10.32657/10356/14800 en 148 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Xiong, Bing
Distortion measure analysis for efficient block matching and face region based video coding
description Video coding has been intensively studied for over one decade and a few video compression standards have been well established with wide applications. These coding paradigms or standards have a number of commonalities, the most salient one among which is the hybrid coding structure consisting of diverse techniques to remove both the temporal and the spatial redundancy. One of the major techniques is block matching which has been used in inter-prediction like block motion estimation as well as in intraprediction such as intra-mode selection in H.264/AVC. Another important topic is bit allocation for rate control to optimize the coding performance. In this thesis, we investigate the block matching technique and face region priority based bit allocation for video coding. While the block matching based motion estimation has been widely studied, this thesis addresses the efficient block matching from the distortion measurement perspective. Specifically, three aspects on the distortion measurement analysis are considered, namely, a new multiplication-free distortion metric, transform-exempt sum of absolute Hadamard transformed differences (SATD) calculation, and a subblock-based distortion metric for efficient block matching, which are summarized in order as follows. It is known that the distortion measure function plays a pivotal role in both matching accuracy and computational complexity. To avoid multiplication operations for simpler implementation, sum of absolute difference (SAD) is normally taken as a substitute to approximate the most widely accepted benchmark, mean squared error (MSE). Although it is well known that SAD is an approximate to MSE, it is necessary to know how accurate the approximate is quantitatively, which can provide us more insights and guidance in selecting an appropriate matching criterion function and developing some new ones. However, there is a lack of such quantitative treatment to this fundamental question. In this thesis, we firstly examine the quantitative deviation of SAD from MSE. In order to reduce the deviation, a new matching criterion, namely weighted sum of absolute difference (WSAD), is thereafter proposed, which enhances the matching accuracy while still maintaining the desirable multiplication-free property. The proposed WSAD is also experimentally validated by applying it in block motion estimation for video coding, showing better rate-distortion performance than SAD. Secondly, in the latest video coding standard H.264/AVC, the sum of absolute Hadamard transformed differences (SATD) is a new distortion metric adopted as an alternative to the SAD to improve coding efficiency.
author2 Charayaphan Cheroensak
author_facet Charayaphan Cheroensak
Xiong, Bing
format Theses and Dissertations
author Xiong, Bing
author_sort Xiong, Bing
title Distortion measure analysis for efficient block matching and face region based video coding
title_short Distortion measure analysis for efficient block matching and face region based video coding
title_full Distortion measure analysis for efficient block matching and face region based video coding
title_fullStr Distortion measure analysis for efficient block matching and face region based video coding
title_full_unstemmed Distortion measure analysis for efficient block matching and face region based video coding
title_sort distortion measure analysis for efficient block matching and face region based video coding
publishDate 2009
url https://hdl.handle.net/10356/14800
_version_ 1772828100045832192