Synthesis of speech
A text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions. The initial step is that articulatory speech synthesis attempts to mathematically model the human vocal system through area functions of its vocal o...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/65296 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-65296 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-652962023-07-07T15:47:56Z Synthesis of speech Yeo, Poh Cheng Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering A text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions. The initial step is that articulatory speech synthesis attempts to mathematically model the human vocal system through area functions of its vocal organs. In this project, the author has used a male vocal system and designed a large unit inventory practical. Next, the design of the units is acoustically driven but limited by a complex pronunciation model. Finally, concatenative speech synthesis links different length of prerecorded speech samples together that were obtained from natural speech. Fortunately, concatenative synthesis requires less computational complexity at the expense of larger memory space. A set of vocabularies and a prerecorded speech corpus containing 40 phonemes serves as the basic acoustical units were created by the author before testing and carrying out the experiment for the TIS system. Further research was done on how other methods like Pitch-Synchronous Overlap Add (PSOLA) can better concatenate the chain of sound elements and eventually produce a continuous speech in English. Bachelor of Engineering 2015-07-09T02:38:50Z 2015-07-09T02:38:50Z 2007 2007 Final Year Project (FYP) http://hdl.handle.net/10356/65296 en Nanyang Technological University 73 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Yeo, Poh Cheng Synthesis of speech |
description |
A text-to-speech (TTS) synthesis should contribute to the pleasantness,
intelligibility, and speed of speech-based human-machine interactions.
The initial step is that articulatory speech synthesis attempts to mathematically
model the human vocal system through area functions of its vocal organs. In this
project, the author has used a male vocal system and designed a large unit
inventory practical. Next, the design of the units is acoustically driven but limited
by a complex pronunciation model.
Finally, concatenative speech synthesis links different length of prerecorded
speech samples together that were obtained from natural speech. Fortunately,
concatenative synthesis requires less computational complexity at the expense
of larger memory space.
A set of vocabularies and a prerecorded speech corpus containing 40 phonemes
serves as the basic acoustical units were created by the author before testing
and carrying out the experiment for the TIS system. Further research was done
on how other methods like Pitch-Synchronous Overlap Add (PSOLA) can better
concatenate the chain of sound elements and eventually produce a continuous
speech in English. |
author2 |
Foo Say Wei |
author_facet |
Foo Say Wei Yeo, Poh Cheng |
format |
Final Year Project |
author |
Yeo, Poh Cheng |
author_sort |
Yeo, Poh Cheng |
title |
Synthesis of speech |
title_short |
Synthesis of speech |
title_full |
Synthesis of speech |
title_fullStr |
Synthesis of speech |
title_full_unstemmed |
Synthesis of speech |
title_sort |
synthesis of speech |
publishDate |
2015 |
url |
http://hdl.handle.net/10356/65296 |
_version_ |
1772825853716070400 |