Female voice recognition using artificial neural networks and MATLAB voicebox toolbox
Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recogni...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Published: |
Animo Repository
2018
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/1923 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
id |
oai:animorepository.dlsu.edu.ph:faculty_research-2922 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:faculty_research-29222022-06-07T03:57:49Z Female voice recognition using artificial neural networks and MATLAB voicebox toolbox Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved. 2018-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/1923 Faculty Research Work Animo Repository Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
topic |
Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering |
spellingShingle |
Automatic speech recognition Neural networks (Computer science) Electrical and Computer Engineering Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
description |
Voice and speaker recognition performances are measured based on the accuracy, speed and robustness. These three key performance indicators are primarily dependent on voice feature extraction method and voice recognition algorithm used. This paper aims to discuss various researches in speech recognition that has yielded high accuracy rates of 95% and above. The extracted MFCCs from MATLAB Voicebox toolbox were used as inputs to the multilayer Artificial Neural Networks (ANN) for female voice recognition algorithm. This study explored the recognition performance of the neural networks using variable number of hidden neurons and layers, and determine the architecture that would provide the optimum performance in terms of high recognition rate. MATLAB simulation resulted to a training and testing recognition rate of 100.00% when using 3-hidden-layer neural network from speech samples of a single-speaker, and highest training recognition rate of 98.11% and testing recognition rate of 87.20% when using 4-hidden-layer neural network from speech samples of several speakers. When tested with homonyms, the best recognition rate was 75.00% from a 3-hidden-layer neural network trained from a single-speaker, and 81.91% from a 4-hidden-layer neural network trained from multiple speakers. The deviation in recognition rates were primarily attributed to the variations made in the number of input neurons, hidden layers, and neurons of the speech recognition neural network. © 2018 Universiti Teknikal Malaysia Melaka. All rights reserved. |
format |
text |
author |
Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. |
author_facet |
Brucal, Stanley Glenn E. Africa, Aaron Don M. Dadios, Elmer P. |
author_sort |
Brucal, Stanley Glenn E. |
title |
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
title_short |
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
title_full |
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
title_fullStr |
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
title_full_unstemmed |
Female voice recognition using artificial neural networks and MATLAB voicebox toolbox |
title_sort |
female voice recognition using artificial neural networks and matlab voicebox toolbox |
publisher |
Animo Repository |
publishDate |
2018 |
url |
https://animorepository.dlsu.edu.ph/faculty_research/1923 |
_version_ |
1736864089469091840 |