Computer speaks Mandarin
The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech. In this project, Time Domain Pitch Synchronous Over...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/45847 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-45847 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-458472023-07-07T17:08:37Z Computer speaks Mandarin Chuah, Ree Gann. Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech. In this project, Time Domain Pitch Synchronous Overlap Add technique (TD-PSOLA) is adopted. This approach allows concatenation among pre-recorded syllable samples and provides flexibility in controlling the duration and pitch of each syllable. Chinese syllable is chosen as the basic synthesis unit in this project. More than 1300 Chinese syllables were pre-recorded for this project. A natural speech sentence is pre-recorded and serves as a guideline for pitch and duration of the concatenated speech. Subsequently, TD-PSOLA is used to generate the ideal synthesized speech by modifying the pitch and duration of the concatenated speech so that the speech quality of synthesized version remains as close as the natural speech. TD-PSOLA allows the sample syllables to be stretched or compressed so that it can fit the ideal time duration. The pitch can also be modified to generate the desired pitch contour using the natural speech as a guideline. Bachelor of Engineering 2011-06-22T06:42:34Z 2011-06-22T06:42:34Z 2011 2011 Final Year Project (FYP) http://hdl.handle.net/10356/45847 en Nanyang Technological University 58 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing Chuah, Ree Gann. Computer speaks Mandarin |
description |
The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech.
In this project, Time Domain Pitch Synchronous Overlap Add technique (TD-PSOLA) is adopted. This approach allows concatenation among pre-recorded syllable samples and provides flexibility in controlling the duration and pitch of each syllable. Chinese syllable is chosen as the basic synthesis unit in this project. More than 1300 Chinese syllables were pre-recorded for this project.
A natural speech sentence is pre-recorded and serves as a guideline for pitch and duration of the concatenated speech. Subsequently, TD-PSOLA is used to generate the ideal synthesized speech by modifying the pitch and duration of the concatenated speech so that the speech quality of synthesized version remains as close as the natural speech.
TD-PSOLA allows the sample syllables to be stretched or compressed so that it can fit the ideal time duration. The pitch can also be modified to generate the desired pitch contour using the natural speech as a guideline. |
author2 |
Foo Say Wei |
author_facet |
Foo Say Wei Chuah, Ree Gann. |
format |
Final Year Project |
author |
Chuah, Ree Gann. |
author_sort |
Chuah, Ree Gann. |
title |
Computer speaks Mandarin |
title_short |
Computer speaks Mandarin |
title_full |
Computer speaks Mandarin |
title_fullStr |
Computer speaks Mandarin |
title_full_unstemmed |
Computer speaks Mandarin |
title_sort |
computer speaks mandarin |
publishDate |
2011 |
url |
http://hdl.handle.net/10356/45847 |
_version_ |
1772828141634453504 |