Speech synthesis using HMM technique
This dissertation implements the speech recognition of letters of alphabet and digits with and accuracy of 96.15% and 100% respectively and also implements and HMM-based speech synthesis system in which the speech waveform is generated from HMMs themselves. The system is modeled by multispace probab...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/18814 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-18814 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-188142023-07-04T15:21:23Z Speech synthesis using HMM technique May, Thwe Khaing. Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing This dissertation implements the speech recognition of letters of alphabet and digits with and accuracy of 96.15% and 100% respectively and also implements and HMM-based speech synthesis system in which the speech waveform is generated from HMMs themselves. The system is modeled by multispace probability distribution HMMs and multi-dimensional Gaussian distributions respectively. The distributions for spectral parameter, pitch parameter and the state duration are clustered independently by using a decision-tree based vocoding technique. The proposed system has been confirmed successfully that it synthesized natural-sounding speech which resembles the speaker in the training database, this hidden Markov Model (HMM) has found widespread use in automatic speech recognition. And the system can change voice qualities of synthesized speech by transforming HMM parameters. Master of Science (Signal Processing) 2009-07-20T03:04:12Z 2009-07-20T03:04:12Z 2008 2008 Thesis http://hdl.handle.net/10356/18814 en 86 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing May, Thwe Khaing. Speech synthesis using HMM technique |
description |
This dissertation implements the speech recognition of letters of alphabet and digits with and accuracy of 96.15% and 100% respectively and also implements and HMM-based speech synthesis system in which the speech waveform is generated from HMMs themselves. The system is modeled by multispace probability distribution HMMs and multi-dimensional Gaussian distributions respectively. The distributions for spectral parameter, pitch parameter and the state duration are clustered independently by using a decision-tree based vocoding technique. The proposed system has been confirmed successfully that it synthesized natural-sounding speech which resembles the speaker in the training database, this hidden Markov Model (HMM) has found widespread use in automatic speech recognition. And the system can change voice qualities of synthesized speech by transforming HMM parameters. |
author2 |
Foo Say Wei |
author_facet |
Foo Say Wei May, Thwe Khaing. |
format |
Theses and Dissertations |
author |
May, Thwe Khaing. |
author_sort |
May, Thwe Khaing. |
title |
Speech synthesis using HMM technique |
title_short |
Speech synthesis using HMM technique |
title_full |
Speech synthesis using HMM technique |
title_fullStr |
Speech synthesis using HMM technique |
title_full_unstemmed |
Speech synthesis using HMM technique |
title_sort |
speech synthesis using hmm technique |
publishDate |
2009 |
url |
http://hdl.handle.net/10356/18814 |
_version_ |
1772826060818219008 |