Female voice recognition using artificial neural networks and MATLAB voicebox toolbox

Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recogni...

Full description

Saved in:

Bibliographic Details
Main Authors:	Brucal, Stanley Glenn E., Africa, Aaron Don M., Dadios, Elmer P.
Format:	text
Published:	Animo Repository 2018
Subjects:	Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering
Online Access:	https://animorepository.dlsu.edu.ph/faculty_research/1923
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University

id	oai:animorepository.dlsu.edu.ph:faculty_research-2922
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:faculty_research-29222022-06-07T03:57:49Z Female voice recognition using artificial neural networks and MATLAB voicebox toolbox Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved. 2018-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/1923 Faculty Research Work Animo Repository Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
topic	Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering
spellingShingle	Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
description	Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved.
format	text
author	Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P.
author_facet	Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P.
author_sort	Brucal, Stanley Glenn E.
title	Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_short	Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_full	Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_fullStr	Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_full_unstemmed	Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_sort	female voice recognition using artificial neural networks and matlab voicebox toolbox
publisher	Animo Repository
publishDate	2018
url	https://animorepository.dlsu.edu.ph/faculty_research/1923
_version_	1736864089469091840

Female voice recognition using artificial neural networks and MATLAB voicebox toolbox

Similar Items