Female voice recognition using artificial neural networks and MATLAB voicebox toolbox

Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recogni...

Full description

Saved in:
Bibliographic Details
Main Authors: Brucal, Stanley Glenn E., Africa, Aaron Don M., Dadios, Elmer P.
Format: text
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/1923
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-2922
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-29222022-06-07T03:57:49Z Female voice recognition using artificial neural networks and MATLAB voicebox toolbox Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved. 2018-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/1923 Faculty Research Work Animo Repository Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Automatic speech recognition
Neural networks (Computer science)
Electrical and Computer Engineering
spellingShingle Automatic speech recognition
Neural networks (Computer science)
Electrical and Computer Engineering
Brucal, Stanley Glenn E.
Africa, Aaron Don M.
Dadios, Elmer P.
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
description Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved.
format text
author Brucal, Stanley Glenn E.
Africa, Aaron Don M.
Dadios, Elmer P.
author_facet Brucal, Stanley Glenn E.
Africa, Aaron Don M.
Dadios, Elmer P.
author_sort Brucal, Stanley Glenn E.
title Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_short Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_full Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_fullStr Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_full_unstemmed Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
title_sort female voice recognition using artificial neural networks and matlab voicebox toolbox
publisher Animo Repository
publishDate 2018
url https://animorepository.dlsu.edu.ph/faculty_research/1923
_version_ 1736864089469091840