Speaker recognition system
This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and G...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/48504 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-48504 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-485042023-03-03T20:48:41Z Speaker recognition system Song, Liyan. Chng Eng Siong School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and GMM-UBM to create speaker model. The detail information of MFCC and GMM-UBM will be explained in the report. The program is build based using GMM-UBM and MFCC, the likelihood ratio of the testing speech are the output of the program. The experiment is carry out to evaluate the effects on accuracy when different mixture and file of MFC are used. Bachelor of Engineering (Computer Science) 2012-04-25T04:21:08Z 2012-04-25T04:21:08Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/48504 en Nanyang Technological University 41 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Song, Liyan. Speaker recognition system |
description |
This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and GMM-UBM to create speaker model. The detail information of MFCC and GMM-UBM will be explained in the report.
The program is build based using GMM-UBM and MFCC, the likelihood ratio of the testing speech are the output of the program. The experiment is carry out to evaluate the effects on accuracy when different mixture and file of MFC are used. |
author2 |
Chng Eng Siong |
author_facet |
Chng Eng Siong Song, Liyan. |
format |
Final Year Project |
author |
Song, Liyan. |
author_sort |
Song, Liyan. |
title |
Speaker recognition system |
title_short |
Speaker recognition system |
title_full |
Speaker recognition system |
title_fullStr |
Speaker recognition system |
title_full_unstemmed |
Speaker recognition system |
title_sort |
speaker recognition system |
publishDate |
2012 |
url |
http://hdl.handle.net/10356/48504 |
_version_ |
1759854638696235008 |