Speech recognition using joint time frequency analysis
Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech featur...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2002
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/14224 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_bachelors-14866 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_bachelors-148662021-11-11T07:57:40Z Speech recognition using joint time frequency analysis Abelgas, Minette G. Pagsibigan, Romel S. Sin, Johannes Paul S. Wu, Jue-Yu T. Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech feature extraction techniques. However, new techniques have emerged such as the Joint-Time Frequency analysis which as the name implies, checks both the time and frequency elements of a signal. Gabor Transform is a feature extraction algorithm that performs the process mentioned above. Speech Recognition System Using Joint Time-Frequency Analysis (SR-JTFA) is a discrete isolated word recognition system that was designed to recognized ten words. It is for study purposes and will determine how effective Gabor Transform is as a feature extraction technique. These words are predefined in the system's library. The user utters a word, through a microphone connected to a computer, that is part of the library and the system outputs the word that it matches onto. The results will then be tabulated using a confusion matrix to show the efficiency of the system in recognizing the words. The interface was designed using Visual Basic while Turbo C++ was used in designing the speech processing modules of the system. 2002-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/14224 Bachelor's Theses English Animo Repository Speech processing systems--Evaluation Speech perception Computer Sciences |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Speech processing systems--Evaluation Speech perception Computer Sciences |
spellingShingle |
Speech processing systems--Evaluation Speech perception Computer Sciences Abelgas, Minette G. Pagsibigan, Romel S. Sin, Johannes Paul S. Wu, Jue-Yu T. Speech recognition using joint time frequency analysis |
description |
Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech feature extraction techniques. However, new techniques have emerged such as the Joint-Time Frequency analysis which as the name implies, checks both the time and frequency elements of a signal. Gabor Transform is a feature extraction algorithm that performs the process mentioned above.
Speech Recognition System Using Joint Time-Frequency Analysis (SR-JTFA) is a discrete isolated word recognition system that was designed to recognized ten words. It is for study purposes and will determine how effective Gabor Transform is as a feature extraction technique. These words are predefined in the system's library. The user utters a word, through a microphone connected to a computer, that is part of the library and the system outputs the word that it matches onto. The results will then be tabulated using a confusion matrix to show the efficiency of the system in recognizing the words. The interface was designed using Visual Basic while Turbo C++ was used in designing the speech processing modules of the system. |
format |
text |
author |
Abelgas, Minette G. Pagsibigan, Romel S. Sin, Johannes Paul S. Wu, Jue-Yu T. |
author_facet |
Abelgas, Minette G. Pagsibigan, Romel S. Sin, Johannes Paul S. Wu, Jue-Yu T. |
author_sort |
Abelgas, Minette G. |
title |
Speech recognition using joint time frequency analysis |
title_short |
Speech recognition using joint time frequency analysis |
title_full |
Speech recognition using joint time frequency analysis |
title_fullStr |
Speech recognition using joint time frequency analysis |
title_full_unstemmed |
Speech recognition using joint time frequency analysis |
title_sort |
speech recognition using joint time frequency analysis |
publisher |
Animo Repository |
publishDate |
2002 |
url |
https://animorepository.dlsu.edu.ph/etd_bachelors/14224 |
_version_ |
1718382635228594176 |