Synthesis of speech

A text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions. The initial step is that articulatory speech synthesis attempts to mathematically model the human vocal system through area functions of its vocal o...

Full description

Saved in:
Bibliographic Details
Main Author: Yeo, Poh Cheng
Other Authors: Foo Say Wei
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/65296
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-65296
record_format dspace
spelling sg-ntu-dr.10356-652962023-07-07T15:47:56Z Synthesis of speech Yeo, Poh Cheng Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering A text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions. The initial step is that articulatory speech synthesis attempts to mathematically model the human vocal system through area functions of its vocal organs. In this project, the author has used a male vocal system and designed a large unit inventory practical. Next, the design of the units is acoustically driven but limited by a complex pronunciation model. Finally, concatenative speech synthesis links different length of prerecorded speech samples together that were obtained from natural speech. Fortunately, concatenative synthesis requires less computational complexity at the expense of larger memory space. A set of vocabularies and a prerecorded speech corpus containing 40 phonemes serves as the basic acoustical units were created by the author before testing and carrying out the experiment for the TIS system. Further research was done on how other methods like Pitch-Synchronous Overlap Add (PSOLA) can better concatenate the chain of sound elements and eventually produce a continuous speech in English. Bachelor of Engineering 2015-07-09T02:38:50Z 2015-07-09T02:38:50Z 2007 2007 Final Year Project (FYP) http://hdl.handle.net/10356/65296 en Nanyang Technological University 73 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Yeo, Poh Cheng
Synthesis of speech
description A text-to-speech (TTS) synthesis should contribute to the pleasantness, intelligibility, and speed of speech-based human-machine interactions. The initial step is that articulatory speech synthesis attempts to mathematically model the human vocal system through area functions of its vocal organs. In this project, the author has used a male vocal system and designed a large unit inventory practical. Next, the design of the units is acoustically driven but limited by a complex pronunciation model. Finally, concatenative speech synthesis links different length of prerecorded speech samples together that were obtained from natural speech. Fortunately, concatenative synthesis requires less computational complexity at the expense of larger memory space. A set of vocabularies and a prerecorded speech corpus containing 40 phonemes serves as the basic acoustical units were created by the author before testing and carrying out the experiment for the TIS system. Further research was done on how other methods like Pitch-Synchronous Overlap Add (PSOLA) can better concatenate the chain of sound elements and eventually produce a continuous speech in English.
author2 Foo Say Wei
author_facet Foo Say Wei
Yeo, Poh Cheng
format Final Year Project
author Yeo, Poh Cheng
author_sort Yeo, Poh Cheng
title Synthesis of speech
title_short Synthesis of speech
title_full Synthesis of speech
title_fullStr Synthesis of speech
title_full_unstemmed Synthesis of speech
title_sort synthesis of speech
publishDate 2015
url http://hdl.handle.net/10356/65296
_version_ 1772825853716070400