Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder

The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acce...

Full description

Saved in:
Bibliographic Details
Main Author: Samsudin
Other Authors: Farook Sattar
Format: Theses and Dissertations
Published: 2008
Subjects:
Online Access:https://hdl.handle.net/10356/3521
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
id sg-ntu-dr.10356-3521
record_format dspace
spelling sg-ntu-dr.10356-35212023-07-04T16:56:01Z Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder Samsudin Farook Sattar Evelyn Kurniawati Sapna George Ng Boon Poh School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2007 2007 Thesis Samsudin. (2007). Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3521 10.32657/10356/3521 Nanyang Technological University application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
Samsudin
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
description The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion.
author2 Farook Sattar
author_facet Farook Sattar
Samsudin
format Theses and Dissertations
author Samsudin
author_sort Samsudin
title Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_short Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_full Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_fullStr Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_full_unstemmed Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_sort implementation and optimization of parametric stereo encoding in enhanced aacplus encoder
publishDate 2008
url https://hdl.handle.net/10356/3521
_version_ 1772825553791877120