Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder

The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acce...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Samsudin
مؤلفون آخرون:	Farook Sattar
التنسيق:	Theses and Dissertations
منشور في:	2008
الموضوعات:	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/3521
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

id	sg-ntu-dr.10356-3521
record_format	dspace
spelling	sg-ntu-dr.10356-35212023-07-04T16:56:01Z Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder Samsudin Farook Sattar Evelyn Kurniawati Sapna George Ng Boon Poh School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion. MASTER OF ENGINEERING (EEE) 2008-09-17T09:31:33Z 2008-09-17T09:31:33Z 2007 2007 Thesis Samsudin. (2007). Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder. Master’s thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/3521 10.32657/10356/3521 Nanyang Technological University application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
topic	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering::Electronic systems Samsudin Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
description	The state-of-the-art lowbitrate audio coder is enhanced aacPlus, which is a combination of Parametric Stereo (PS), Spectral Band Replication (SBR) and Advanced Audio Coding (AAC). PS as the newest addition to the coder makes it possible to encode the audio at a bitrate of as low as 24 kbps with acceptable audio quality. The idea behind PS is to code stereo audio as a monaural downmix signal and a small amount of spatial parameters which describe its spatial image. The monaural signal can then be encoded by any generic audio coder while the spatial parameters are embedded into the resulting mono audio bitstream. The details of PS encoding as well as a general overview of the decoding process are presented in this thesis. An implementation of PS encoder which supports the full MPEG-4 PS configurations is presented. Along with the implementation, two optimizations are proposed: enhanced downmixing scheme and unified transient detector. A subjective listening test to evaluate both optimizations reveals that the optimized encoder is able to perform as well as the reference encoder with a total saving of 4% of the computational complexity. In addition, a concept of an objective method to evaluate spatial image distortion due to audio processing is proposed. The evaluation of the proposed method reveals that the output metrics defined are able to approximate the simulated spatial distortion.
author2	Farook Sattar
author_facet	Farook Sattar Samsudin
format	Theses and Dissertations
author	Samsudin
author_sort	Samsudin
title	Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_short	Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_full	Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_fullStr	Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_full_unstemmed	Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder
title_sort	implementation and optimization of parametric stereo encoding in enhanced aacplus encoder
publishDate	2008
url	https://hdl.handle.net/10356/3521
_version_	1772825553791877120

Implementation and optimization of parametric stereo encoding in enhanced aacPlus encoder

مواد مشابهة