Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction

The diversity of today’s playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound...

Full description

Saved in:
Bibliographic Details
Main Authors: He, Jianjun, Gan, Woon-Seng, Tan, Ee-Leng
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2016
Subjects:
Online Access:https://hdl.handle.net/10356/81370
http://hdl.handle.net/10220/39537
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-81370
record_format dspace
spelling sg-ntu-dr.10356-813702020-03-07T13:57:25Z Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction He, Jianjun Gan, Woon-Seng Tan, Ee-Leng School of Electrical and Electronic Engineering Ambient spectrum estimation (ASE) Computational efficiency Primary-ambient extraction (PAE) Sparsity Spatial audio The diversity of today’s playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound mixtures in channel-based audio is crucial. Severe extraction error was found in existing PAE approaches when dealing with sound mixtures that contain a relatively strong ambient component, a commonly encountered case in the sound scenes of digital media. In this paper, we propose a novel ambient spectrum estimation (ASE) framework to improve the performance of PAE. The ASE framework exploits the equal magnitude of the uncorrelated ambient components in two channels of a stereo signal, and reformulates the PAE problem into the problem of estimating either ambient phase or magnitude. In particular, we take advantage of the sparse characteristic of the primary components to derive sparse solutions for ASE based PAE, together with an approximate solution that can significantly reduce the computational cost. Our objective and subjective experimental results demonstrate that the proposed ASE approaches significantly outperform existing approaches, especially when the ambient component is relatively strong. Accepted version 2016-01-04T06:01:55Z 2019-12-06T14:29:28Z 2016-01-04T06:01:55Z 2019-12-06T14:29:28Z 2015 Journal Article He, J., Gan, W.-S., & Tan, E.-L. (2015). Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 23(9), 1431-1444. 2329-9290 https://hdl.handle.net/10356/81370 http://hdl.handle.net/10220/39537 10.1109/TASLP.2015.2434272 en IEEE/ACM Transactions on Audio, Speech, and Language Processing © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TASLP.2015.2434272]. 14 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Ambient spectrum estimation (ASE)
Computational efficiency
Primary-ambient extraction (PAE)
Sparsity
Spatial audio
spellingShingle Ambient spectrum estimation (ASE)
Computational efficiency
Primary-ambient extraction (PAE)
Sparsity
Spatial audio
He, Jianjun
Gan, Woon-Seng
Tan, Ee-Leng
Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
description The diversity of today’s playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound mixtures in channel-based audio is crucial. Severe extraction error was found in existing PAE approaches when dealing with sound mixtures that contain a relatively strong ambient component, a commonly encountered case in the sound scenes of digital media. In this paper, we propose a novel ambient spectrum estimation (ASE) framework to improve the performance of PAE. The ASE framework exploits the equal magnitude of the uncorrelated ambient components in two channels of a stereo signal, and reformulates the PAE problem into the problem of estimating either ambient phase or magnitude. In particular, we take advantage of the sparse characteristic of the primary components to derive sparse solutions for ASE based PAE, together with an approximate solution that can significantly reduce the computational cost. Our objective and subjective experimental results demonstrate that the proposed ASE approaches significantly outperform existing approaches, especially when the ambient component is relatively strong.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
He, Jianjun
Gan, Woon-Seng
Tan, Ee-Leng
format Article
author He, Jianjun
Gan, Woon-Seng
Tan, Ee-Leng
author_sort He, Jianjun
title Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
title_short Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
title_full Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
title_fullStr Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
title_full_unstemmed Primary-Ambient Extraction Using Ambient Spectrum Estimation for Immersive Spatial Audio Reproduction
title_sort primary-ambient extraction using ambient spectrum estimation for immersive spatial audio reproduction
publishDate 2016
url https://hdl.handle.net/10356/81370
http://hdl.handle.net/10220/39537
_version_ 1681047662995963904