Speaker-independent speech recognition system using Kohonen self-organizing feature map
In the past few years, there has been much noteworthy advancement in artificial neural networks. One such classification of a neural network model was presented by Teuvo Kohonen, which produces what he calls self-organizing feature maps (SOFM) similar to how the brain works. The goal of the SOFM alg...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
1999
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/11028 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_bachelors-11673 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_bachelors-116732022-02-28T03:14:25Z Speaker-independent speech recognition system using Kohonen self-organizing feature map Bacong, Mark Anthony Cajes, Gracita A. Ellema, Wilbert M. Galang, Gerson C. Lazaro, Aurora Lourdes Celerina D. In the past few years, there has been much noteworthy advancement in artificial neural networks. One such classification of a neural network model was presented by Teuvo Kohonen, which produces what he calls self-organizing feature maps (SOFM) similar to how the brain works. The goal of the SOFM algorithm is to transform an incoming signal pattern of arbitrary dimensions into a discrete map, and to perform this transformatoin adaptively in a topologically ordered fashion. This pattern classification ability of SOFM is explored for a practical speech recognition problem in this project. This thesis aims to develop a system, using Kohonen's SOFM algorithm, to recognize single word utterances independent of the speaker. With the proper algorithm and training, the SOFM forms a clustering of the inputs to perform word recognition. The speaker-independent speech recognition system accepts as input isolated words stored as digital speech files. The speech files are preprocessed in order to extract the LPC coefficients of each file, which will serve as the input to the neural network. The SOFM is used to create a topological map of the commands in an unsupervised fashion. Once a topological map is generated, fine-turning is done using Optimum Learning Vector Quantization 1 (OLVQ1) algorithm. An architectural structure of the final map is designed using VHDL software. The design implements the Manhattan Distance computation using the IEEE format on real numbers. The system achieved a recognition rate of 97.5%. 1999-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/11028 Bachelor's Theses English Animo Repository Automatic speech recognition Pattern recognition systems Speech processing systems. Engineering Systems and Communications |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Automatic speech recognition Pattern recognition systems Speech processing systems. Engineering Systems and Communications |
spellingShingle |
Automatic speech recognition Pattern recognition systems Speech processing systems. Engineering Systems and Communications Bacong, Mark Anthony Cajes, Gracita A. Ellema, Wilbert M. Galang, Gerson C. Lazaro, Aurora Lourdes Celerina D. Speaker-independent speech recognition system using Kohonen self-organizing feature map |
description |
In the past few years, there has been much noteworthy advancement in artificial neural networks. One such classification of a neural network model was presented by Teuvo Kohonen, which produces what he calls self-organizing feature maps (SOFM) similar to how the brain works. The goal of the SOFM algorithm is to transform an incoming signal pattern of arbitrary dimensions into a discrete map, and to perform this transformatoin adaptively in a topologically ordered fashion.
This pattern classification ability of SOFM is explored for a practical speech recognition problem in this project. This thesis aims to develop a system, using Kohonen's SOFM algorithm, to recognize single word utterances independent of the speaker. With the proper algorithm and training, the SOFM forms a clustering of the inputs to perform word recognition.
The speaker-independent speech recognition system accepts as input isolated words stored as digital speech files. The speech files are preprocessed in order to extract the LPC coefficients of each file, which will serve as the input to the neural network. The SOFM is used to create a topological map of the commands in an unsupervised fashion. Once a topological map is generated, fine-turning is done using Optimum Learning Vector Quantization 1 (OLVQ1) algorithm. An architectural structure of the final map is designed using VHDL software. The design implements the Manhattan Distance computation using the IEEE format on real numbers.
The system achieved a recognition rate of 97.5%. |
format |
text |
author |
Bacong, Mark Anthony Cajes, Gracita A. Ellema, Wilbert M. Galang, Gerson C. Lazaro, Aurora Lourdes Celerina D. |
author_facet |
Bacong, Mark Anthony Cajes, Gracita A. Ellema, Wilbert M. Galang, Gerson C. Lazaro, Aurora Lourdes Celerina D. |
author_sort |
Bacong, Mark Anthony |
title |
Speaker-independent speech recognition system using Kohonen self-organizing feature map |
title_short |
Speaker-independent speech recognition system using Kohonen self-organizing feature map |
title_full |
Speaker-independent speech recognition system using Kohonen self-organizing feature map |
title_fullStr |
Speaker-independent speech recognition system using Kohonen self-organizing feature map |
title_full_unstemmed |
Speaker-independent speech recognition system using Kohonen self-organizing feature map |
title_sort |
speaker-independent speech recognition system using kohonen self-organizing feature map |
publisher |
Animo Repository |
publishDate |
1999 |
url |
https://animorepository.dlsu.edu.ph/etd_bachelors/11028 |
_version_ |
1772834813793796096 |