Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acce...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Published: |
2008
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/3521 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
id |
sg-ntu-dr.10356-3521 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-35212023-07-04T16:56:01Z Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder Samsudin Farook Sattar Evelyn Kurniawati Sapna George Ng Boon Poh School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2007 2007 Thesis Samsudin. (2007). Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3521 10.32657/10356/3521 Nanyang Technological University application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems Samsudin Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
description |
The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion. |
author2 |
Farook Sattar |
author_facet |
Farook Sattar Samsudin |
format |
Theses and Dissertations |
author |
Samsudin |
author_sort |
Samsudin |
title |
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
title_short |
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
title_full |
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
title_fullStr |
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
title_full_unstemmed |
Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder |
title_sort |
implementation and optimization of parametric stereo encoding in enhanced aacplus encoder |
publishDate |
2008 |
url |
https://hdl.handle.net/10356/3521 |
_version_ |
1772825553791877120 |