Speaker recognition system

This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and G...

Full description

Saved in:

Bibliographic Details
Main Author:	Song, Liyan.
Other Authors:	Chng Eng Siong
Format:	Final Year Project
Language:	English
Published:	2012
Subjects:	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Online Access:	http://hdl.handle.net/10356/48504
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-48504
record_format	dspace
spelling	sg-ntu-dr.10356-485042023-03-03T20:48:41Z Speaker recognition system Song, Liyan. Chng Eng Siong School of Computer Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and GMM-UBM to create speaker model. The detail information of MFCC and GMM-UBM will be explained in the report. The program is build based using GMM-UBM and MFCC, the likelihood ratio of the testing speech are the output of the program. The experiment is carry out to evaluate the effects on accuracy when different mixture and file of MFC are used. Bachelor of Engineering (Computer Science) 2012-04-25T04:21:08Z 2012-04-25T04:21:08Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/48504 en Nanyang Technological University 41 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle	DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Song, Liyan. Speaker recognition system
description	This report givens an overview of a Gaussian Mixture Model – Universal Background Model (GMM-UBM) system which focusing on speaker identification. In this report we will be focusing on the traditional FFT-based Mel-Frequency Cepstral Coefficients (MFCCs) method to extract feature from wav file and GMM-UBM to create speaker model. The detail information of MFCC and GMM-UBM will be explained in the report. The program is build based using GMM-UBM and MFCC, the likelihood ratio of the testing speech are the output of the program. The experiment is carry out to evaluate the effects on accuracy when different mixture and file of MFC are used.
author2	Chng Eng Siong
author_facet	Chng Eng Siong Song, Liyan.
format	Final Year Project
author	Song, Liyan.
author_sort	Song, Liyan.
title	Speaker recognition system
title_short	Speaker recognition system
title_full	Speaker recognition system
title_fullStr	Speaker recognition system
title_full_unstemmed	Speaker recognition system
title_sort	speaker recognition system
publishDate	2012
url	http://hdl.handle.net/10356/48504
_version_	1759854638696235008

Speaker recognition system

Similar Items