Make the computer talk

Speech synthesis is part of the advanced technology of artificial intelligence where the computer is able to talk. Text-to-Speech (TTS) synthesis is part of the speech synthesis technology where texts are converted to speech through various methods like articulatory, formant and concatenative synthe...

Full description

Saved in:

Bibliographic Details
Main Author:	Chan, Tai Tat
Other Authors:	Foo Say Wei
Format:	Final Year Project
Language:	English
Published:	2016
Subjects:	DRNTU::Engineering::Electrical and electronic engineering
Online Access:	http://hdl.handle.net/10356/68994
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-68994
record_format	dspace
spelling	sg-ntu-dr.10356-689942023-07-07T15:42:27Z Make the computer talk Chan, Tai Tat Foo Say Wei School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering Speech synthesis is part of the advanced technology of artificial intelligence where the computer is able to talk. Text-to-Speech (TTS) synthesis is part of the speech synthesis technology where texts are converted to speech through various methods like articulatory, formant and concatenative synthesis. Concatenative synthesis is one of the most popular methods in TTS due to its ability to give more human-like sound Pre-recorded speech is concatenated together and its output is changed in terms of pitch and duration according to its suprasegmental features. Suprasegmental features represent the emotions and the meaning between the words and sentences. With the help of a Grammar model, the Grammar structure of a sentence can be determined and this can be a great aid in implementing suprasegmental features to speech signals. Finally, the ability to modify the pitch and duration of a speech signal is part of the speech processing field. There are many different algorithms of pitch marks detection and the algorithm of using fundamental frequency and enveloping is developed and discussed. The PSOLA method and the intelligibility of its output are discussed and a simple algorithm to improve the intelligibility of a speech signal undergoing pitch and duration modification is also developed and discussed. Bachelor of Engineering 2016-08-23T03:21:25Z 2016-08-23T03:21:25Z 2016 Final Year Project (FYP) http://hdl.handle.net/10356/68994 en Nanyang Technological University 72 p. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Electrical and electronic engineering
spellingShingle	DRNTU::Engineering::Electrical and electronic engineering Chan, Tai Tat Make the computer talk
description	Speech synthesis is part of the advanced technology of artificial intelligence where the computer is able to talk. Text-to-Speech (TTS) synthesis is part of the speech synthesis technology where texts are converted to speech through various methods like articulatory, formant and concatenative synthesis. Concatenative synthesis is one of the most popular methods in TTS due to its ability to give more human-like sound Pre-recorded speech is concatenated together and its output is changed in terms of pitch and duration according to its suprasegmental features. Suprasegmental features represent the emotions and the meaning between the words and sentences. With the help of a Grammar model, the Grammar structure of a sentence can be determined and this can be a great aid in implementing suprasegmental features to speech signals. Finally, the ability to modify the pitch and duration of a speech signal is part of the speech processing field. There are many different algorithms of pitch marks detection and the algorithm of using fundamental frequency and enveloping is developed and discussed. The PSOLA method and the intelligibility of its output are discussed and a simple algorithm to improve the intelligibility of a speech signal undergoing pitch and duration modification is also developed and discussed.
author2	Foo Say Wei
author_facet	Foo Say Wei Chan, Tai Tat
format	Final Year Project
author	Chan, Tai Tat
author_sort	Chan, Tai Tat
title	Make the computer talk
title_short	Make the computer talk
title_full	Make the computer talk
title_fullStr	Make the computer talk
title_full_unstemmed	Make the computer talk
title_sort	make the computer talk
publishDate	2016
url	http://hdl.handle.net/10356/68994
_version_	1772826270801854464

Make the computer talk

Similar Items