On scalable to lossless audio coding

With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless commun...

Full description

Saved in:
Bibliographic Details
Main Author: Li, Te
Other Authors: Susanto Rahardja
Format: Theses and Dissertations
Language:English
Published: 2009
Subjects:
Online Access:https://hdl.handle.net/10356/15553
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-15553
record_format dspace
spelling sg-ntu-dr.10356-155532023-07-04T17:00:51Z On scalable to lossless audio coding Li, Te Susanto Rahardja Koh Soo Ngee School of Electrical and Electronic Engineering A*STAR Institute for Infocomm Research DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless communications. To deal with these various demands, a scalable audio coding technology that supports both lossy and lossless audio compression is thus desirable. MPEG-4 audio scalable lossless (SLS) coding was published as an international standard in June 2006. It allows the scaling up of a perceptually coded audio to a fully lossless audio with a wide range of intermediate bitrate representations. The main technologies adopted in SLS include the latest integer transform, namely integer modified discrete cosine transform (IntMDCT), and a new entropy coder that is based on bit-plane Golomb code (BPGC). As a relatively new coding structure, SLS is far from perfect with room for improvements. There are several critical issues which directly affect the wide adoption and application of the SLS codec. This dissertation aims to provide answers to all these critical issues. Firstly, the effect of the rounding errors introduced by IntMDCT under the perceptual (lossy) audio coding scenario is studied. Based on intensive test results,it is concluded that MDCT and IntMDCT filterbanks are interchangeable in a lossy coding scenario. This finding justifies the use of the low-complexity SLS structure. Secondly, perceptually enhanced prioritized bit-plane audio coding algorithms are proposed for the non-core and low-core-bitrate mode of SLS based on the energy distributions in different frequency regions. By using only a single bit in each frame to indicate one of the two coding models to be used, considerable perceptual quality enhancement is achieved for a wide range of bitrates. Thirdly, efficient bit allocation schemes for stereo channels in both the SLS encoder and truncator are proposed. By allocating bits according to the energy level, significant improvement in quality can be achieved by the proposed algorithm for signal (such as speech) that is highly correlated for the left and right channels. Lastly, a “smart” function is designed for SLS. With a low quality audio format and its original inputs, the proposed smart enhancing process enables a scalable encoder to automatically encode the minimum amount of enhancement for the low quality audio to attain a “transparent quality” that is the same as the CD quality. This function facilitates the application of SLS in multi-quality online music sales. With these proposed solutions, the MPEG-4 SLS coder has been enhanced resulting in a much better perceptual quality and more robust features. The users can benefit from the convenience of the universality, as well as the excellent performance in terms of both the quality and compression, of this codec. Finally, several interesting research topics for scalable lossless coding are also recommended for future research. DOCTOR OF PHILOSOPHY (EEE) 2009-05-13T02:38:25Z 2009-05-13T02:38:25Z 2008 2008 Thesis Li, T. (2008). On scalable to lossless audio coding. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/15553 10.32657/10356/15553 en 167 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory
spellingShingle DRNTU::Engineering::Computer science and engineering::Data::Coding and information theory
Li, Te
On scalable to lossless audio coding
description With advances in broadband network and storage technologies, more and more digital audio applications are ready to deliver high sampling rate and high-resolution lossless audio. On the other hand, there are also applications that require highly compressed audio such as those found in wireless communications. To deal with these various demands, a scalable audio coding technology that supports both lossy and lossless audio compression is thus desirable. MPEG-4 audio scalable lossless (SLS) coding was published as an international standard in June 2006. It allows the scaling up of a perceptually coded audio to a fully lossless audio with a wide range of intermediate bitrate representations. The main technologies adopted in SLS include the latest integer transform, namely integer modified discrete cosine transform (IntMDCT), and a new entropy coder that is based on bit-plane Golomb code (BPGC). As a relatively new coding structure, SLS is far from perfect with room for improvements. There are several critical issues which directly affect the wide adoption and application of the SLS codec. This dissertation aims to provide answers to all these critical issues. Firstly, the effect of the rounding errors introduced by IntMDCT under the perceptual (lossy) audio coding scenario is studied. Based on intensive test results,it is concluded that MDCT and IntMDCT filterbanks are interchangeable in a lossy coding scenario. This finding justifies the use of the low-complexity SLS structure. Secondly, perceptually enhanced prioritized bit-plane audio coding algorithms are proposed for the non-core and low-core-bitrate mode of SLS based on the energy distributions in different frequency regions. By using only a single bit in each frame to indicate one of the two coding models to be used, considerable perceptual quality enhancement is achieved for a wide range of bitrates. Thirdly, efficient bit allocation schemes for stereo channels in both the SLS encoder and truncator are proposed. By allocating bits according to the energy level, significant improvement in quality can be achieved by the proposed algorithm for signal (such as speech) that is highly correlated for the left and right channels. Lastly, a “smart” function is designed for SLS. With a low quality audio format and its original inputs, the proposed smart enhancing process enables a scalable encoder to automatically encode the minimum amount of enhancement for the low quality audio to attain a “transparent quality” that is the same as the CD quality. This function facilitates the application of SLS in multi-quality online music sales. With these proposed solutions, the MPEG-4 SLS coder has been enhanced resulting in a much better perceptual quality and more robust features. The users can benefit from the convenience of the universality, as well as the excellent performance in terms of both the quality and compression, of this codec. Finally, several interesting research topics for scalable lossless coding are also recommended for future research.
author2 Susanto Rahardja
author_facet Susanto Rahardja
Li, Te
format Theses and Dissertations
author Li, Te
author_sort Li, Te
title On scalable to lossless audio coding
title_short On scalable to lossless audio coding
title_full On scalable to lossless audio coding
title_fullStr On scalable to lossless audio coding
title_full_unstemmed On scalable to lossless audio coding
title_sort on scalable to lossless audio coding
publishDate 2009
url https://hdl.handle.net/10356/15553
_version_ 1772826196778680320