Spatial speech processing for multi-party teleconferencing

3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in t...

Full description

Saved in:
Bibliographic Details
Main Author: Phua, Kok Soon.
Other Authors: Gan, Woon Seng
Format: Theses and Dissertations
Language:English
Published: 2008
Subjects:
Online Access:http://hdl.handle.net/10356/13286
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:3D audio reproduction is expected to be widely deployed in applications such as entertainment, simulation and communication. This research focuses on developing a structural model based on 3D audio reproduction to deliver toll-quality speech for the implementation of multi-channel speech coding in teleconferencing. In this model, a monoaural speech signal is synthesized into a binaural signal by reproducing the sound localization cues for each ear so as to create the perceived position of that signal.