Computer speaks Mandarin

The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech. In this project, Time Domain Pitch Synchronous Over...

Full description

Saved in:
Bibliographic Details
Main Author: Chuah, Ree Gann.
Other Authors: Foo Say Wei
Format: Final Year Project
Language:English
Published: 2011
Subjects:
Online Access:http://hdl.handle.net/10356/45847
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-45847
record_format dspace
spelling sg-ntu-dr.10356-458472023-07-07T17:08:37Z Computer speaks Mandarin Chuah, Ree Gann. Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech. In this project, Time Domain Pitch Synchronous Overlap Add technique (TD-PSOLA) is adopted. This approach allows concatenation among pre-recorded syllable samples and provides flexibility in controlling the duration and pitch of each syllable. Chinese syllable is chosen as the basic synthesis unit in this project. More than 1300 Chinese syllables were pre-recorded for this project. A natural speech sentence is pre-recorded and serves as a guideline for pitch and duration of the concatenated speech. Subsequently, TD-PSOLA is used to generate the ideal synthesized speech by modifying the pitch and duration of the concatenated speech so that the speech quality of synthesized version remains as close as the natural speech. TD-PSOLA allows the sample syllables to be stretched or compressed so that it can fit the ideal time duration. The pitch can also be modified to generate the desired pitch contour using the natural speech as a guideline. Bachelor of Engineering 2011-06-22T06:42:34Z 2011-06-22T06:42:34Z 2011 2011 Final Year Project (FYP) http://hdl.handle.net/10356/45847 en Nanyang Technological University 58 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
spellingShingle DRNTU::Engineering::Electrical and electronic engineering::Electronic systems::Signal processing
Chuah, Ree Gann.
Computer speaks Mandarin
description The objective of this project is to develop a Chinese Text-to-Speech technique using concatenative synthesis. Concatenative synthesis is a method that utilizes pre-recorded samples of natural syllables to generate any desired synthesized speech. In this project, Time Domain Pitch Synchronous Overlap Add technique (TD-PSOLA) is adopted. This approach allows concatenation among pre-recorded syllable samples and provides flexibility in controlling the duration and pitch of each syllable. Chinese syllable is chosen as the basic synthesis unit in this project. More than 1300 Chinese syllables were pre-recorded for this project. A natural speech sentence is pre-recorded and serves as a guideline for pitch and duration of the concatenated speech. Subsequently, TD-PSOLA is used to generate the ideal synthesized speech by modifying the pitch and duration of the concatenated speech so that the speech quality of synthesized version remains as close as the natural speech. TD-PSOLA allows the sample syllables to be stretched or compressed so that it can fit the ideal time duration. The pitch can also be modified to generate the desired pitch contour using the natural speech as a guideline.
author2 Foo Say Wei
author_facet Foo Say Wei
Chuah, Ree Gann.
format Final Year Project
author Chuah, Ree Gann.
author_sort Chuah, Ree Gann.
title Computer speaks Mandarin
title_short Computer speaks Mandarin
title_full Computer speaks Mandarin
title_fullStr Computer speaks Mandarin
title_full_unstemmed Computer speaks Mandarin
title_sort computer speaks mandarin
publishDate 2011
url http://hdl.handle.net/10356/45847
_version_ 1772828141634453504