Speech comparison using discrete wavelet transform
Speech recognition is a fascinating application of digital signal processing that is used to automate many tasks that in the past required hands-on human interaction. Speech comparison is one component of speech recognition. The speech comparison system identifies speech by its transitory characteri...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2003
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/14222 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_bachelors-14864 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_bachelors-148642021-11-11T07:43:55Z Speech comparison using discrete wavelet transform Baello, Kimberly Anne D. Nopuente, Mica Rose P. Salgado, Paul Christian R. Tan, Leonilda Marie M. Speech recognition is a fascinating application of digital signal processing that is used to automate many tasks that in the past required hands-on human interaction. Speech comparison is one component of speech recognition. The speech comparison system identifies speech by its transitory characteristics. Comparison is done between 48 sets of minimal pairs. A minimal pair can be defined as words that differ in one segment and are most likely to be confused with each other in speech. The system recognizes these barely discernible differences and identifies them in a speech signal. In comparing the speech signals, the Discrete Wavelet Transform is used because of its features and advantages over other methods like Fourier analysis - improved representation of signals with fewer coefficients, more suitable for non-stationary signals, and the use of time-domain data based on changing frequency components. Other processes performed in the speech signals include endpoint detection, noise reduction, normalization and windowing. Downsampling and Dynamic Time Warping are optional processing methods that may also be applied. The system is completely speaker-independent. It is able to determine that speakers are saying the same word even though they may be speaking at different rates, pitches, and intensities. The program may aid people who have difficulty in pronouncing certain phonemes since he can easily playback the correct pronunciation of each word in the system. 2003-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/14222 Bachelor's Theses English Animo Repository Automatic speech recognition Human-computer interaction Speech processing systems Speech perception Speech synthesis Natural language processing (Computer science) Computer Sciences |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Automatic speech recognition Human-computer interaction Speech processing systems Speech perception Speech synthesis Natural language processing (Computer science) Computer Sciences |
spellingShingle |
Automatic speech recognition Human-computer interaction Speech processing systems Speech perception Speech synthesis Natural language processing (Computer science) Computer Sciences Baello, Kimberly Anne D. Nopuente, Mica Rose P. Salgado, Paul Christian R. Tan, Leonilda Marie M. Speech comparison using discrete wavelet transform |
description |
Speech recognition is a fascinating application of digital signal processing that is used to automate many tasks that in the past required hands-on human interaction. Speech comparison is one component of speech recognition.
The speech comparison system identifies speech by its transitory characteristics. Comparison is done between 48 sets of minimal pairs. A minimal pair can be defined as words that differ in one segment and are most likely to be confused with each other in speech. The system recognizes these barely discernible differences and identifies them in a speech signal.
In comparing the speech signals, the Discrete Wavelet Transform is used because of its features and advantages over other methods like Fourier analysis - improved representation of signals with fewer coefficients, more suitable for non-stationary signals, and the use of time-domain data based on changing frequency components. Other processes performed in the speech signals include endpoint detection, noise reduction, normalization and windowing. Downsampling and Dynamic Time Warping are optional processing methods that may also be applied.
The system is completely speaker-independent. It is able to determine that speakers are saying the same word even though they may be speaking at different rates, pitches, and intensities. The program may aid people who have difficulty in pronouncing certain phonemes since he can easily playback the correct pronunciation of each word in the system. |
format |
text |
author |
Baello, Kimberly Anne D. Nopuente, Mica Rose P. Salgado, Paul Christian R. Tan, Leonilda Marie M. |
author_facet |
Baello, Kimberly Anne D. Nopuente, Mica Rose P. Salgado, Paul Christian R. Tan, Leonilda Marie M. |
author_sort |
Baello, Kimberly Anne D. |
title |
Speech comparison using discrete wavelet transform |
title_short |
Speech comparison using discrete wavelet transform |
title_full |
Speech comparison using discrete wavelet transform |
title_fullStr |
Speech comparison using discrete wavelet transform |
title_full_unstemmed |
Speech comparison using discrete wavelet transform |
title_sort |
speech comparison using discrete wavelet transform |
publisher |
Animo Repository |
publishDate |
2003 |
url |
https://animorepository.dlsu.edu.ph/etd_bachelors/14222 |
_version_ |
1718382634869981184 |