Complex-valued mixing matrix estimation for blind separation of acoustic mixtures

This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types o...

Full description

Saved in:
Bibliographic Details
Main Author: Nguyen, Anh Hai Trieu
Other Authors: Andy Khong W H
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/143962
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-143962
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Nguyen, Anh Hai Trieu
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
description This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types of complex-valued instantaneous mixtures. The thesis includes several main contributions: improvement of estimation performance of sparse component analysis in under-determined case where the number of sensors is less than the number of sources, the connection between sparse component analysis and independent component analysis in determined case, and improvement of estimation performance of two independent component analysis methods in determined case where the number of sensors is equal to the number of sources. For under-determined blind source separation, mixture models of directional statistics are often employed to estimate the mixing matrix. These mixture models commonly adopt the Gaussian function to model the distribution of the observed data about a specific direction. Since the Gaussian function may not be well-suited for sparse signals, the performance of mixture models of directional statistics is limited. Furthermore, solving mixture models of directional statistics is computationally costly due to the computation of spatial covariance matrices. These issues motivated the development of directional sparse filtering (DSF) for complex-valued mixing matrix estimation. Using power mean of the magnitude-squared cosine distances, directional sparse filtering searches for a set of directions such that each data sample is expressed by only a few vectors of the set. Simulations on synthetic data and real mixtures of recorded speeches indicate that the proposed directional sparse filtering outperforms directional clustering baselines while having lower computational complexity. Next, a stability condition for invertible mixing matrix estimation using multi-variate non-separable contrast functions is derived. The stability condition is verified via Monte-Carlo simulation for several directional clustering methods and directional sparse filtering algorithm, given that the sources are super-Gaussian. The stability condition implies that the minima of the cost functions corresponding to these sparse component analysis methods include the mixing matrix in a similar manner to that of independent component analysis. This result shows that sparse component analysis methods based on directional clustering can be interpreted as independent component analysis using quasi-maximum likelihood estimation. For determined blind source separation, two re-parameterizations are proposed to convert the constrained independent component analysis by entropy bound minimization into unconstrained optimization problems. After that, these unconstrained optimization problems are solved via quasi-Newton methods. Experiment results show that the proposed method yields higher estimation performance than the baseline algorithm, particularly when the source distributions are close to Gaussian distribution. The re-parameterizations for independent component analysis by entropy bound minimization are further extended to estimate the mixing matrix and the temporal whitening filters simultaneously. This results in a new algorithm for independent component analysis by entropy rate bound minimization. Compared to the conventional independent component analysis by entropy rate bound minimization, the new method shows improvement in terms of mixing matrix estimation and speech separation.
author2 Andy Khong W H
author_facet Andy Khong W H
Nguyen, Anh Hai Trieu
format Thesis-Doctor of Philosophy
author Nguyen, Anh Hai Trieu
author_sort Nguyen, Anh Hai Trieu
title Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
title_short Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
title_full Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
title_fullStr Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
title_full_unstemmed Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
title_sort complex-valued mixing matrix estimation for blind separation of acoustic mixtures
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/143962
_version_ 1772826035857915904
spelling sg-ntu-dr.10356-1439622023-07-04T17:06:36Z Complex-valued mixing matrix estimation for blind separation of acoustic mixtures Nguyen, Anh Hai Trieu Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Engineering::Electrical and electronic engineering This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types of complex-valued instantaneous mixtures. The thesis includes several main contributions: improvement of estimation performance of sparse component analysis in under-determined case where the number of sensors is less than the number of sources, the connection between sparse component analysis and independent component analysis in determined case, and improvement of estimation performance of two independent component analysis methods in determined case where the number of sensors is equal to the number of sources. For under-determined blind source separation, mixture models of directional statistics are often employed to estimate the mixing matrix. These mixture models commonly adopt the Gaussian function to model the distribution of the observed data about a specific direction. Since the Gaussian function may not be well-suited for sparse signals, the performance of mixture models of directional statistics is limited. Furthermore, solving mixture models of directional statistics is computationally costly due to the computation of spatial covariance matrices. These issues motivated the development of directional sparse filtering (DSF) for complex-valued mixing matrix estimation. Using power mean of the magnitude-squared cosine distances, directional sparse filtering searches for a set of directions such that each data sample is expressed by only a few vectors of the set. Simulations on synthetic data and real mixtures of recorded speeches indicate that the proposed directional sparse filtering outperforms directional clustering baselines while having lower computational complexity. Next, a stability condition for invertible mixing matrix estimation using multi-variate non-separable contrast functions is derived. The stability condition is verified via Monte-Carlo simulation for several directional clustering methods and directional sparse filtering algorithm, given that the sources are super-Gaussian. The stability condition implies that the minima of the cost functions corresponding to these sparse component analysis methods include the mixing matrix in a similar manner to that of independent component analysis. This result shows that sparse component analysis methods based on directional clustering can be interpreted as independent component analysis using quasi-maximum likelihood estimation. For determined blind source separation, two re-parameterizations are proposed to convert the constrained independent component analysis by entropy bound minimization into unconstrained optimization problems. After that, these unconstrained optimization problems are solved via quasi-Newton methods. Experiment results show that the proposed method yields higher estimation performance than the baseline algorithm, particularly when the source distributions are close to Gaussian distribution. The re-parameterizations for independent component analysis by entropy bound minimization are further extended to estimate the mixing matrix and the temporal whitening filters simultaneously. This results in a new algorithm for independent component analysis by entropy rate bound minimization. Compared to the conventional independent component analysis by entropy rate bound minimization, the new method shows improvement in terms of mixing matrix estimation and speech separation. Doctor of Philosophy 2020-10-05T07:33:14Z 2020-10-05T07:33:14Z 2020 Thesis-Doctor of Philosophy Nguyen, A. H. T. (2020). Complex-valued mixing matrix estimation for blind separation of acoustic mixtures. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/143962 10.32657/10356/143962 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University