Speech compression using the discrete wavelet transform

Wavelet analysis is a relatively new technology. Wavelets are waveforms that occur in a very short duration that has a mean value of zero. Wavelet transform can represent non-stationary signals more effectively than Fourier transform since it retains both the time and frequency aspect of the signal....

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Russel Lloyd C.
Format: text
Published: Animo Repository 2003
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/7725
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-8453
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-84532022-11-15T01:07:07Z Speech compression using the discrete wavelet transform Lim, Russel Lloyd C. Wavelet analysis is a relatively new technology. Wavelets are waveforms that occur in a very short duration that has a mean value of zero. Wavelet transform can represent non-stationary signals more effectively than Fourier transform since it retains both the time and frequency aspect of the signal. This thesis applies wavelet analysis to speech compression. A mother or basis wavelet is first chosen for the compression. The signal is then decomposed to a set of scaled and translated versions of the mother wavelet. The resulting wavelet coefficients that are insignificant or close to zero are truncated achieving signal compression. Additional compression is realized by encoding of the signal. Analysis of the compression process was performed by comparing the compressed-decompressed signal against the original. This was conducted to determine the effect of the choice of mother wavelet on the speech compression. The results however showed that regardless of bases wavelet used the compression ratio is relatively close to one another. In terms of signal quality, coiflet of order 5 has been seen to be the best basis wavelet. This is taken from the analysis of the Signal to Noise Ratio (SNR) value and its Mean Opinion Score (MOS) rating. Moreover, a comparative analysis of current technologies against these results was performed. The system developed proved to be very efficient in terms of compressing speech files since the system was able to compress better than the current technology, Motion Pictures Expert Group -1 Layer 3 (MP3). The system was able to reduce a specific speech file to 57.41 % while MP3 with 112 kbps bit rate rated at 69.27% against the original signal. Another speech signal was reduced to 58.31 % while the MP3 rated at 70.55%, against the original speech signal. The drawback however is that signal fidelity produced by the system was unable to match the signal quality produced by the current technology, MP3. 2003-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/7725 Faculty Research Work Animo Repository Data compression (Computer science) Wavelets (Mathematics) Numerical Analysis and Scientific Computing
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Data compression (Computer science)
Wavelets (Mathematics)
Numerical Analysis and Scientific Computing
spellingShingle Data compression (Computer science)
Wavelets (Mathematics)
Numerical Analysis and Scientific Computing
Lim, Russel Lloyd C.
Speech compression using the discrete wavelet transform
description Wavelet analysis is a relatively new technology. Wavelets are waveforms that occur in a very short duration that has a mean value of zero. Wavelet transform can represent non-stationary signals more effectively than Fourier transform since it retains both the time and frequency aspect of the signal. This thesis applies wavelet analysis to speech compression. A mother or basis wavelet is first chosen for the compression. The signal is then decomposed to a set of scaled and translated versions of the mother wavelet. The resulting wavelet coefficients that are insignificant or close to zero are truncated achieving signal compression. Additional compression is realized by encoding of the signal. Analysis of the compression process was performed by comparing the compressed-decompressed signal against the original. This was conducted to determine the effect of the choice of mother wavelet on the speech compression. The results however showed that regardless of bases wavelet used the compression ratio is relatively close to one another. In terms of signal quality, coiflet of order 5 has been seen to be the best basis wavelet. This is taken from the analysis of the Signal to Noise Ratio (SNR) value and its Mean Opinion Score (MOS) rating. Moreover, a comparative analysis of current technologies against these results was performed. The system developed proved to be very efficient in terms of compressing speech files since the system was able to compress better than the current technology, Motion Pictures Expert Group -1 Layer 3 (MP3). The system was able to reduce a specific speech file to 57.41 % while MP3 with 112 kbps bit rate rated at 69.27% against the original signal. Another speech signal was reduced to 58.31 % while the MP3 rated at 70.55%, against the original speech signal. The drawback however is that signal fidelity produced by the system was unable to match the signal quality produced by the current technology, MP3.
format text
author Lim, Russel Lloyd C.
author_facet Lim, Russel Lloyd C.
author_sort Lim, Russel Lloyd C.
title Speech compression using the discrete wavelet transform
title_short Speech compression using the discrete wavelet transform
title_full Speech compression using the discrete wavelet transform
title_fullStr Speech compression using the discrete wavelet transform
title_full_unstemmed Speech compression using the discrete wavelet transform
title_sort speech compression using the discrete wavelet transform
publisher Animo Repository
publishDate 2003
url https://animorepository.dlsu.edu.ph/faculty_research/7725
_version_ 1767196754457919488