Complex-valued mixing matrix estimation for blind separation of acoustic mixtures
This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types o...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/143962 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-143962 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Nguyen, Anh Hai Trieu Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
description |
This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types of complex-valued instantaneous mixtures. The thesis includes several main contributions: improvement of estimation performance of sparse component analysis in under-determined case where the number of sensors is less than the number of sources, the connection between sparse component analysis and independent component analysis in determined case, and improvement of estimation performance of two independent component analysis methods in determined case where the number of sensors is equal to the number of sources.
For under-determined blind source separation, mixture models of directional statistics are often employed to estimate the mixing matrix. These mixture models commonly adopt the Gaussian function to model the distribution of the observed data about a specific direction. Since the Gaussian function may not be well-suited for sparse signals, the performance of mixture models of directional statistics is limited. Furthermore, solving mixture models of directional statistics is computationally costly due to the computation of spatial covariance matrices. These issues motivated the development of directional sparse filtering (DSF) for complex-valued mixing matrix estimation. Using power mean of the magnitude-squared cosine distances, directional sparse filtering searches for a set of directions such that each data sample is expressed by only a few vectors of the set. Simulations on synthetic data and real mixtures of recorded speeches indicate that the proposed directional sparse filtering outperforms directional clustering baselines while having lower computational complexity.
Next, a stability condition for invertible mixing matrix estimation using multi-variate non-separable contrast functions is derived. The stability condition is verified via Monte-Carlo simulation for several directional clustering methods and directional sparse filtering algorithm, given that the sources are super-Gaussian. The stability condition implies that the minima of the cost functions corresponding to these sparse component analysis methods include the mixing matrix in a similar manner to that of independent component analysis. This result shows that sparse component analysis methods based on directional clustering can be interpreted as independent component analysis using quasi-maximum likelihood estimation.
For determined blind source separation, two re-parameterizations are proposed to convert the constrained independent component analysis by entropy bound minimization into unconstrained optimization problems. After that, these unconstrained optimization problems are solved via quasi-Newton methods. Experiment results show that the proposed method yields higher estimation performance than the baseline algorithm, particularly when the source distributions are close to Gaussian distribution. The re-parameterizations for independent component analysis by entropy bound minimization are further extended to estimate the mixing matrix and the temporal whitening filters simultaneously. This results in a new algorithm for independent component analysis by entropy rate bound minimization. Compared to the conventional independent component analysis by entropy rate bound minimization, the new method shows improvement in terms of mixing matrix estimation and speech separation. |
author2 |
Andy Khong W H |
author_facet |
Andy Khong W H Nguyen, Anh Hai Trieu |
format |
Thesis-Doctor of Philosophy |
author |
Nguyen, Anh Hai Trieu |
author_sort |
Nguyen, Anh Hai Trieu |
title |
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
title_short |
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
title_full |
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
title_fullStr |
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
title_full_unstemmed |
Complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
title_sort |
complex-valued mixing matrix estimation for blind separation of acoustic mixtures |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/143962 |
_version_ |
1772826035857915904 |
spelling |
sg-ntu-dr.10356-1439622023-07-04T17:06:36Z Complex-valued mixing matrix estimation for blind separation of acoustic mixtures Nguyen, Anh Hai Trieu Andy Khong W H School of Electrical and Electronic Engineering AndyKhong@ntu.edu.sg Engineering::Electrical and electronic engineering This thesis concerns several aspects of complex-valued instantaneous mixing matrix estimation. While the main application of the work is to extract individual speech signal from group-based conversations in real-world environments, the methods developed in this thesis can be applied to other types of complex-valued instantaneous mixtures. The thesis includes several main contributions: improvement of estimation performance of sparse component analysis in under-determined case where the number of sensors is less than the number of sources, the connection between sparse component analysis and independent component analysis in determined case, and improvement of estimation performance of two independent component analysis methods in determined case where the number of sensors is equal to the number of sources. For under-determined blind source separation, mixture models of directional statistics are often employed to estimate the mixing matrix. These mixture models commonly adopt the Gaussian function to model the distribution of the observed data about a specific direction. Since the Gaussian function may not be well-suited for sparse signals, the performance of mixture models of directional statistics is limited. Furthermore, solving mixture models of directional statistics is computationally costly due to the computation of spatial covariance matrices. These issues motivated the development of directional sparse filtering (DSF) for complex-valued mixing matrix estimation. Using power mean of the magnitude-squared cosine distances, directional sparse filtering searches for a set of directions such that each data sample is expressed by only a few vectors of the set. Simulations on synthetic data and real mixtures of recorded speeches indicate that the proposed directional sparse filtering outperforms directional clustering baselines while having lower computational complexity. Next, a stability condition for invertible mixing matrix estimation using multi-variate non-separable contrast functions is derived. The stability condition is verified via Monte-Carlo simulation for several directional clustering methods and directional sparse filtering algorithm, given that the sources are super-Gaussian. The stability condition implies that the minima of the cost functions corresponding to these sparse component analysis methods include the mixing matrix in a similar manner to that of independent component analysis. This result shows that sparse component analysis methods based on directional clustering can be interpreted as independent component analysis using quasi-maximum likelihood estimation. For determined blind source separation, two re-parameterizations are proposed to convert the constrained independent component analysis by entropy bound minimization into unconstrained optimization problems. After that, these unconstrained optimization problems are solved via quasi-Newton methods. Experiment results show that the proposed method yields higher estimation performance than the baseline algorithm, particularly when the source distributions are close to Gaussian distribution. The re-parameterizations for independent component analysis by entropy bound minimization are further extended to estimate the mixing matrix and the temporal whitening filters simultaneously. This results in a new algorithm for independent component analysis by entropy rate bound minimization. Compared to the conventional independent component analysis by entropy rate bound minimization, the new method shows improvement in terms of mixing matrix estimation and speech separation. Doctor of Philosophy 2020-10-05T07:33:14Z 2020-10-05T07:33:14Z 2020 Thesis-Doctor of Philosophy Nguyen, A. H. T. (2020). Complex-valued mixing matrix estimation for blind separation of acoustic mixtures. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/143962 10.32657/10356/143962 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |