Blind quality assessment of image and speech signals

Quality assessment of multimedia signals is of great interest to the researchers and practitioners in signal processing community. As most multimedia services and systems are provided for human consumption, it is of great importance to reproduce human judgement of perceived quality for objective qua...

Full description

Saved in:
Bibliographic Details
Main Author: Li, Qiaohong
Other Authors: Lin Weisi
Format: Theses and Dissertations
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/70758
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-70758
record_format dspace
spelling sg-ntu-dr.10356-707582023-03-04T00:52:43Z Blind quality assessment of image and speech signals Li, Qiaohong Lin Weisi School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering Quality assessment of multimedia signals is of great interest to the researchers and practitioners in signal processing community. As most multimedia services and systems are provided for human consumption, it is of great importance to reproduce human judgement of perceived quality for objective quality assessment methods. Among all kinds of these methods, no-reference or blind methods that operate solely on the distorted signals are most desirable as the reference signals are not always available in many practical applications. However, blind quality assessment is a very challenging task due to the various distortion types and diverse content properties. In this thesis, I present a series of works on designing better blind models to automatically estimate perceptual quality of image and speech signals for modern multimedia systems. The first work presented here deals with quality assessment on multiply-distorted images. We propose a novel structural feature as the gradient weighted histogram of local binary pattern calculated on the gradient map, which is effective to describe the complex degradation pattern introduced by multiple distortions. In the second work we propose a general-purpose method to predict the visual quality of images degraded by various distortion types. By exploring the characteristics of the human visual system (HVS), two new perceptual features are extracted to represent the structural information and luminance changes in distorted images. We show that the complementary information provided by extracted statistical structural and luminance features plays an important role in image quality estimation. This work is later extended in the third work by two aspects: 1) we show that linear filter response can complement the widely used local contrast normalization response; 2) we fuse the luminance and structural information through a weighting scheme. These two works belong to perceptual feature based methods, accounting for the HVS properties in the feature design. In the fourth work we explore the utilization of natural scene statistics (NSS) for general-purpose blind image quality assessment. We present a new model for natural images, by using multivariate Gaussian mixture model to approximate the joint distribution of log-contrast response. This is the first attempt to use joint NSS model for blind image quality assessment, and it has several advantages over related works. The last work of this thesis presents a novel non-intrusive speech quality assessment method by adopting the bag-of-words model to speech feature extraction. It provides an effective way to producing global representation from local segments. In all of these works we compare the proposed methods to the cutting edge of related works. Doctor of Philosophy (SCE) 2017-05-09T09:23:31Z 2017-05-09T09:23:31Z 2017 Thesis Li, Q. (2017). Blind quality assessment of image and speech signals. Doctoral thesis, Nanyang Technological University, Singapore. http://hdl.handle.net/10356/70758 10.32657/10356/70758 en 144 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Li, Qiaohong
Blind quality assessment of image and speech signals
description Quality assessment of multimedia signals is of great interest to the researchers and practitioners in signal processing community. As most multimedia services and systems are provided for human consumption, it is of great importance to reproduce human judgement of perceived quality for objective quality assessment methods. Among all kinds of these methods, no-reference or blind methods that operate solely on the distorted signals are most desirable as the reference signals are not always available in many practical applications. However, blind quality assessment is a very challenging task due to the various distortion types and diverse content properties. In this thesis, I present a series of works on designing better blind models to automatically estimate perceptual quality of image and speech signals for modern multimedia systems. The first work presented here deals with quality assessment on multiply-distorted images. We propose a novel structural feature as the gradient weighted histogram of local binary pattern calculated on the gradient map, which is effective to describe the complex degradation pattern introduced by multiple distortions. In the second work we propose a general-purpose method to predict the visual quality of images degraded by various distortion types. By exploring the characteristics of the human visual system (HVS), two new perceptual features are extracted to represent the structural information and luminance changes in distorted images. We show that the complementary information provided by extracted statistical structural and luminance features plays an important role in image quality estimation. This work is later extended in the third work by two aspects: 1) we show that linear filter response can complement the widely used local contrast normalization response; 2) we fuse the luminance and structural information through a weighting scheme. These two works belong to perceptual feature based methods, accounting for the HVS properties in the feature design. In the fourth work we explore the utilization of natural scene statistics (NSS) for general-purpose blind image quality assessment. We present a new model for natural images, by using multivariate Gaussian mixture model to approximate the joint distribution of log-contrast response. This is the first attempt to use joint NSS model for blind image quality assessment, and it has several advantages over related works. The last work of this thesis presents a novel non-intrusive speech quality assessment method by adopting the bag-of-words model to speech feature extraction. It provides an effective way to producing global representation from local segments. In all of these works we compare the proposed methods to the cutting edge of related works.
author2 Lin Weisi
author_facet Lin Weisi
Li, Qiaohong
format Theses and Dissertations
author Li, Qiaohong
author_sort Li, Qiaohong
title Blind quality assessment of image and speech signals
title_short Blind quality assessment of image and speech signals
title_full Blind quality assessment of image and speech signals
title_fullStr Blind quality assessment of image and speech signals
title_full_unstemmed Blind quality assessment of image and speech signals
title_sort blind quality assessment of image and speech signals
publishDate 2017
url http://hdl.handle.net/10356/70758
_version_ 1759853926009536512