Speaker-independent speech recognition system using Kohonen self-organizing feature map
In the past few years, there has been much noteworthy advancement in artificial neural networks. One such classification of a neural network model was presented by Teuvo Kohonen, which produces what he calls self-organizing feature maps (SOFM) similar to how the brain works. The goal of the SOFM alg...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
1999
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/11028 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
Summary: | In the past few years, there has been much noteworthy advancement in artificial neural networks. One such classification of a neural network model was presented by Teuvo Kohonen, which produces what he calls self-organizing feature maps (SOFM) similar to how the brain works. The goal of the SOFM algorithm is to transform an incoming signal pattern of arbitrary dimensions into a discrete map, and to perform this transformatoin adaptively in a topologically ordered fashion.
This pattern classification ability of SOFM is explored for a practical speech recognition problem in this project. This thesis aims to develop a system, using Kohonen's SOFM algorithm, to recognize single word utterances independent of the speaker. With the proper algorithm and training, the SOFM forms a clustering of the inputs to perform word recognition.
The speaker-independent speech recognition system accepts as input isolated words stored as digital speech files. The speech files are preprocessed in order to extract the LPC coefficients of each file, which will serve as the input to the neural network. The SOFM is used to create a topological map of the commands in an unsupervised fashion. Once a topological map is generated, fine-turning is done using Optimum Learning Vector Quantization 1 (OLVQ1) algorithm. An architectural structure of the final map is designed using VHDL software. The design implements the Manhattan Distance computation using the IEEE format on real numbers.
The system achieved a recognition rate of 97.5%. |
---|