High performance voice authentication system
Extensive simulations were performed on two popular speech databases, namely KING and TIMIT, to evaluate the proposed methods. A new background model called Global Background Model (GBM) has been presented to replace the memory intensive Universal Background Model (UBM). Based on a novel set theoret...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Published: |
2008
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/2420 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
id |
sg-ntu-dr.10356-2420 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-24202019-12-10T13:21:53Z High performance voice authentication system Panda Ashish Srikanthan, Thambipillai School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Extensive simulations were performed on two popular speech databases, namely KING and TIMIT, to evaluate the proposed methods. A new background model called Global Background Model (GBM) has been presented to replace the memory intensive Universal Background Model (UBM). Based on a novel set theoretic framework for UBM, it has been analytically shown that the performance of the GBM is comparable to that of the UBM. In the quest for efficient algorithms for model training, applicability of vector quantization algorithms for training a GMM has been studied. Subsequently a Bayes Adaptation (BA) based training scheme has been proposed to replace the iterative Expectation Maximization (EM) algorithm for rapid speaker model estimation. Experiments conducted on the speech databases reveal that BA based training scheme results in comparable, at times even better, accuracy as compared to the EM algorithm scheme, while significantly reducing the training time. Master of Engineering (SCE) 2008-09-17T09:02:37Z 2008-09-17T09:02:37Z 2003 2003 Thesis http://hdl.handle.net/10356/2420 Nanyang Technological University application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Panda Ashish High performance voice authentication system |
description |
Extensive simulations were performed on two popular speech databases, namely KING and TIMIT, to evaluate the proposed methods. A new background model called Global Background Model (GBM) has been presented to replace the memory intensive Universal Background Model (UBM). Based on a novel set theoretic framework for UBM, it has been analytically shown that the performance of the GBM is comparable to that of the UBM. In the quest for efficient algorithms for model training, applicability of vector quantization algorithms for training a GMM has been studied. Subsequently a Bayes Adaptation (BA) based training scheme has been proposed to replace the iterative Expectation Maximization (EM) algorithm for rapid speaker model estimation. Experiments conducted on the speech databases reveal that BA based training scheme results in comparable, at times even better, accuracy as compared to the EM algorithm scheme, while significantly reducing the training time. |
author2 |
Srikanthan, Thambipillai |
author_facet |
Srikanthan, Thambipillai Panda Ashish |
format |
Theses and Dissertations |
author |
Panda Ashish |
author_sort |
Panda Ashish |
title |
High performance voice authentication system |
title_short |
High performance voice authentication system |
title_full |
High performance voice authentication system |
title_fullStr |
High performance voice authentication system |
title_full_unstemmed |
High performance voice authentication system |
title_sort |
high performance voice authentication system |
publishDate |
2008 |
url |
http://hdl.handle.net/10356/2420 |
_version_ |
1681034649922437120 |