Localization and tracking of acoustic sources in room environment

This thesis addresses two areas of the acoustic source localization and tracking (ASLT) problem in a room environment, namely, tracking of acoustic sources using multiple omni-directional microphone arrays and DOA estimation of the acoustic sources using a single acoustic vector sensor (AVS). The ch...

Full description

Saved in:
Bibliographic Details
Main Author: Wu, Kai
Other Authors: Andy Khong Wai Hoong
Format: Theses and Dissertations
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/72519
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-72519
record_format dspace
spelling sg-ntu-dr.10356-725192023-07-04T17:12:57Z Localization and tracking of acoustic sources in room environment Wu, Kai Andy Khong Wai Hoong School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This thesis addresses two areas of the acoustic source localization and tracking (ASLT) problem in a room environment, namely, tracking of acoustic sources using multiple omni-directional microphone arrays and DOA estimation of the acoustic sources using a single acoustic vector sensor (AVS). The challenges for the ASLT problem, in both aforementioned applications may include room reverberation, background environmental noise, sound interference, as well as the presence of multiple speakers. For multiple omni-directional arrays, the thesis focuses on the source tracking problem where the source position is estimated sequentially across several time frames using the particle filter (PF) framework. To achieve single-source tracking in a reverberant and noisy environment, an algorithm which utilizes the well-known sequential importance resampling PF (SIRPF) framework is proposed. As will be shown in this thesis, this proposed algorithm derives the measurement likelihood which is robust to reverberation and noise. For single-source tracking in the presence of sound interference, another SIRPF-based algorithm is proposed. This algorithm exploits the harmonicity feature of a speech signal for deriving the measurement likelihood. Due to the use of distinctive speech feature, speech-sensitive tracking can be achieved in the presence of sound interference. The performance of these two algorithms have been verified through simulation. The problem of tracking of alternating speakers will then be discussed in which the speech sources are active in turns. For solving this problem, a novel swarm intelligence based PF (SWIPF) which jointly exploits the advantages of PF and particle swarm intelligence is proposed. The PF framework is used as sequential state estimation framework which suits for the tracking problem. The limitation of PF, which lies in the particle sampling, is addressed by incorporating the particle swarm intelligence. By using the swarm intelligence, particles are associated with interaction and memory mechanisms. When alternation occurs, particles can be directed toward the true source location by interacting and sharing the fitness information among themselves. In addition, the memory mechanism allows particles to retain their previous best-fit positions when signals are corrupted by noise and reverberation. The proposed SWIPF is verified using both simulations and real experiments. The thesis finally considers the multi-source DOA estimation problem using an AVS. Unlike the conventional microphone arrays which requires inter-spacing between microphones, the co-location of sensor elements in an AVS can be exploited to achieve robust DOA estimation in a reverberant environment. As opposed to the existing multi-source DOA estimation algorithms using AVS, the proposed algorithm is developed from a reverberant received signal model. By exploiting the co-location of the sensor elements in an AVS, the low-reverberant-single-source (LRSS) zones of the received signals, where only one source is dominant with a high signal-to-reverberation ratio, can be identified. By using only these identified LRSS zones followed by a clustering step, multi-source DOA estimation in reverberant environment can therefore be achieved. Simulation is conducted to verify the performance of the proposed algorithm. Doctor of Philosophy (EEE) 2017-08-23T01:27:15Z 2017-08-23T01:27:15Z 2017 Thesis Wu, K. (2017). Localization and tracking of acoustic sources in room environment. Doctoral thesis, Nanyang Technological University, Singapore. http://hdl.handle.net/10356/72519 10.32657/10356/72519 en 177 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Wu, Kai
Localization and tracking of acoustic sources in room environment
description This thesis addresses two areas of the acoustic source localization and tracking (ASLT) problem in a room environment, namely, tracking of acoustic sources using multiple omni-directional microphone arrays and DOA estimation of the acoustic sources using a single acoustic vector sensor (AVS). The challenges for the ASLT problem, in both aforementioned applications may include room reverberation, background environmental noise, sound interference, as well as the presence of multiple speakers. For multiple omni-directional arrays, the thesis focuses on the source tracking problem where the source position is estimated sequentially across several time frames using the particle filter (PF) framework. To achieve single-source tracking in a reverberant and noisy environment, an algorithm which utilizes the well-known sequential importance resampling PF (SIRPF) framework is proposed. As will be shown in this thesis, this proposed algorithm derives the measurement likelihood which is robust to reverberation and noise. For single-source tracking in the presence of sound interference, another SIRPF-based algorithm is proposed. This algorithm exploits the harmonicity feature of a speech signal for deriving the measurement likelihood. Due to the use of distinctive speech feature, speech-sensitive tracking can be achieved in the presence of sound interference. The performance of these two algorithms have been verified through simulation. The problem of tracking of alternating speakers will then be discussed in which the speech sources are active in turns. For solving this problem, a novel swarm intelligence based PF (SWIPF) which jointly exploits the advantages of PF and particle swarm intelligence is proposed. The PF framework is used as sequential state estimation framework which suits for the tracking problem. The limitation of PF, which lies in the particle sampling, is addressed by incorporating the particle swarm intelligence. By using the swarm intelligence, particles are associated with interaction and memory mechanisms. When alternation occurs, particles can be directed toward the true source location by interacting and sharing the fitness information among themselves. In addition, the memory mechanism allows particles to retain their previous best-fit positions when signals are corrupted by noise and reverberation. The proposed SWIPF is verified using both simulations and real experiments. The thesis finally considers the multi-source DOA estimation problem using an AVS. Unlike the conventional microphone arrays which requires inter-spacing between microphones, the co-location of sensor elements in an AVS can be exploited to achieve robust DOA estimation in a reverberant environment. As opposed to the existing multi-source DOA estimation algorithms using AVS, the proposed algorithm is developed from a reverberant received signal model. By exploiting the co-location of the sensor elements in an AVS, the low-reverberant-single-source (LRSS) zones of the received signals, where only one source is dominant with a high signal-to-reverberation ratio, can be identified. By using only these identified LRSS zones followed by a clustering step, multi-source DOA estimation in reverberant environment can therefore be achieved. Simulation is conducted to verify the performance of the proposed algorithm.
author2 Andy Khong Wai Hoong
author_facet Andy Khong Wai Hoong
Wu, Kai
format Theses and Dissertations
author Wu, Kai
author_sort Wu, Kai
title Localization and tracking of acoustic sources in room environment
title_short Localization and tracking of acoustic sources in room environment
title_full Localization and tracking of acoustic sources in room environment
title_fullStr Localization and tracking of acoustic sources in room environment
title_full_unstemmed Localization and tracking of acoustic sources in room environment
title_sort localization and tracking of acoustic sources in room environment
publishDate 2017
url http://hdl.handle.net/10356/72519
_version_ 1772826910549606400