Speech recognition using joint time frequency analysis

Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech featur...

Full description

Saved in:
Bibliographic Details
Main Authors: Abelgas, Minette G., Pagsibigan, Romel S., Sin, Johannes Paul S., Wu, Jue-Yu T.
Format: text
Language:English
Published: Animo Repository 2002
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/14224
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-14866
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-148662021-11-11T07:57:40Z Speech recognition using joint time frequency analysis Abelgas, Minette G. Pagsibigan, Romel S. Sin, Johannes Paul S. Wu, Jue-Yu T. Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech feature extraction techniques. However, new techniques have emerged such as the Joint-Time Frequency analysis which as the name implies, checks both the time and frequency elements of a signal. Gabor Transform is a feature extraction algorithm that performs the process mentioned above. Speech Recognition System Using Joint Time-Frequency Analysis (SR-JTFA) is a discrete isolated word recognition system that was designed to recognized ten words. It is for study purposes and will determine how effective Gabor Transform is as a feature extraction technique. These words are predefined in the system's library. The user utters a word, through a microphone connected to a computer, that is part of the library and the system outputs the word that it matches onto. The results will then be tabulated using a confusion matrix to show the efficiency of the system in recognizing the words. The interface was designed using Visual Basic while Turbo C++ was used in designing the speech processing modules of the system. 2002-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/14224 Bachelor's Theses English Animo Repository Speech processing systems--Evaluation Speech perception Computer Sciences
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Speech processing systems--Evaluation
Speech perception
Computer Sciences
spellingShingle Speech processing systems--Evaluation
Speech perception
Computer Sciences
Abelgas, Minette G.
Pagsibigan, Romel S.
Sin, Johannes Paul S.
Wu, Jue-Yu T.
Speech recognition using joint time frequency analysis
description Speech is the ultimate interface. As computer telephony continues to gain mainstream appeal, new demands emerge for speech recognition solutions. Many techniques are currently available and proven effective such as the Linear Predictive Coding analysis which is the popular choice among speech feature extraction techniques. However, new techniques have emerged such as the Joint-Time Frequency analysis which as the name implies, checks both the time and frequency elements of a signal. Gabor Transform is a feature extraction algorithm that performs the process mentioned above. Speech Recognition System Using Joint Time-Frequency Analysis (SR-JTFA) is a discrete isolated word recognition system that was designed to recognized ten words. It is for study purposes and will determine how effective Gabor Transform is as a feature extraction technique. These words are predefined in the system's library. The user utters a word, through a microphone connected to a computer, that is part of the library and the system outputs the word that it matches onto. The results will then be tabulated using a confusion matrix to show the efficiency of the system in recognizing the words. The interface was designed using Visual Basic while Turbo C++ was used in designing the speech processing modules of the system.
format text
author Abelgas, Minette G.
Pagsibigan, Romel S.
Sin, Johannes Paul S.
Wu, Jue-Yu T.
author_facet Abelgas, Minette G.
Pagsibigan, Romel S.
Sin, Johannes Paul S.
Wu, Jue-Yu T.
author_sort Abelgas, Minette G.
title Speech recognition using joint time frequency analysis
title_short Speech recognition using joint time frequency analysis
title_full Speech recognition using joint time frequency analysis
title_fullStr Speech recognition using joint time frequency analysis
title_full_unstemmed Speech recognition using joint time frequency analysis
title_sort speech recognition using joint time frequency analysis
publisher Animo Repository
publishDate 2002
url https://animorepository.dlsu.edu.ph/etd_bachelors/14224
_version_ 1718382635228594176