On scalable to lossless audio coding

With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless commun...

Full description

Saved in:

Bibliographic Details
Main Author:	Li, Te
Other Authors:	Susanto Rahardja
Format:	Theses and Dissertations
Language:	English
Published:	2009
Subjects:	DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory
Online Access:	https://hdl.handle.net/10356/15553
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-15553
record_format	dspace
spelling	sg-ntu-dr.10356-155532023-07-04T17:00:51Z On scalable to lossless audio coding Li, Te Susanto Rahardja Koh Soo Ngee School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless communications. To deal with these various demands, a scalable audio coding technology that supports both lossy and lossless audio compression is thus desirable. MPEG-4 audio scalable lossless (SLS) coding was published as an international standard in June 2006. It allows the scaling up of a perceptually coded audio to a fully lossless audio with a wide range of intermediate bitrate representations. The main technologies adopted in SLS include the latest integer transform, namely integer modified discrete cosine transform (IntMDCT), and a new entropy coder that is based on bit-plane Golomb code (BPGC). As a relatively new coding structure, SLS is far from perfect with room for improvements. There are several critical issues which directly affect the wide adoption and application of the SLS codec. This dissertation aims to provide answers to all these critical issues. Firstly, the effect of the rounding errors introduced by IntMDCT under the perceptual (lossy) audio coding scenario is studied. Based on intensive test results,it is concluded that MDCT and IntMDCT filterbanks are interchangeable in a lossy coding scenario. This finding justifies the use of the low-complexity SLS structure. Secondly, perceptually enhanced prioritized bit-plane audio coding algorithms are proposed for the non-core and low-core-bitrate mode of SLS based on the energy distributions in different frequency regions. By using only a single bit in each frame to indicate one of the two coding models to be used, considerable perceptual quality enhancement is achieved for a wide range of bitrates. Thirdly, efficient bit allocation schemes for stereo channels in both the SLS encoder and truncator are proposed. By allocating bits according to the energy level, significant improvement in quality can be achieved by the proposed algorithm for signal (such as speech) that is highly correlated for the left and right channels. Lastly, a “smart” function is designed for SLS. With a low quality audio format and its original inputs, the proposed smart enhancing process enables a scalable encoder to automatically encode the minimum amount of enhancement for the low quality audio to attain a “transparent quality” that is the same as the CD quality. This function facilitates the application of SLS in multi-quality online music sales. With these proposed solutions, the MPEG-4 SLS coder has been enhanced resulting in a much better perceptual quality and more robust features. The users can benefit from the convenience of the universality, as well as the excellent performance in terms of both the quality and compression, of this codec. Finally, several interesting research topics for scalable lossless coding are also recommended for future research. DOCTOR OF PHILOSOPHY (EEE) 2009-05-13T02:38:25Z 2009-05-13T02:38:25Z 2008 2008 Thesis Li, T. (2008). On scalable to lossless audio coding. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/15553 10.32657/10356/15553 en 167 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory
spellingShingle	DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory Li, Te On scalable to lossless audio coding
description	With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless communications. To deal with these various demands, a scalable audio coding technology that supports both lossy and lossless audio compression is thus desirable. MPEG-4 audio scalable lossless (SLS) coding was published as an international standard in June 2006. It allows the scaling up of a perceptually coded audio to a fully lossless audio with a wide range of intermediate bitrate representations. The main technologies adopted in SLS include the latest integer transform, namely integer modified discrete cosine transform (IntMDCT), and a new entropy coder that is based on bit-plane Golomb code (BPGC). As a relatively new coding structure, SLS is far from perfect with room for improvements. There are several critical issues which directly affect the wide adoption and application of the SLS codec. This dissertation aims to provide answers to all these critical issues. Firstly, the effect of the rounding errors introduced by IntMDCT under the perceptual (lossy) audio coding scenario is studied. Based on intensive test results,it is concluded that MDCT and IntMDCT filterbanks are interchangeable in a lossy coding scenario. This finding justifies the use of the low-complexity SLS structure. Secondly, perceptually enhanced prioritized bit-plane audio coding algorithms are proposed for the non-core and low-core-bitrate mode of SLS based on the energy distributions in different frequency regions. By using only a single bit in each frame to indicate one of the two coding models to be used, considerable perceptual quality enhancement is achieved for a wide range of bitrates. Thirdly, efficient bit allocation schemes for stereo channels in both the SLS encoder and truncator are proposed. By allocating bits according to the energy level, significant improvement in quality can be achieved by the proposed algorithm for signal (such as speech) that is highly correlated for the left and right channels. Lastly, a “smart” function is designed for SLS. With a low quality audio format and its original inputs, the proposed smart enhancing process enables a scalable encoder to automatically encode the minimum amount of enhancement for the low quality audio to attain a “transparent quality” that is the same as the CD quality. This function facilitates the application of SLS in multi-quality online music sales. With these proposed solutions, the MPEG-4 SLS coder has been enhanced resulting in a much better perceptual quality and more robust features. The users can benefit from the convenience of the universality, as well as the excellent performance in terms of both the quality and compression, of this codec. Finally, several interesting research topics for scalable lossless coding are also recommended for future research.
author2	Susanto Rahardja
author_facet	Susanto Rahardja Li, Te
format	Theses and Dissertations
author	Li, Te
author_sort	Li, Te
title	On scalable to lossless audio coding
title_short	On scalable to lossless audio coding
title_full	On scalable to lossless audio coding
title_fullStr	On scalable to lossless audio coding
title_full_unstemmed	On scalable to lossless audio coding
title_sort	on scalable to lossless audio coding
publishDate	2009
url	https://hdl.handle.net/10356/15553
_version_	1772826196778680320

On scalable to lossless audio coding

Similar Items