Filipino text-to-speech system: Tagapagsalita, 2

One of the main types of speech processing technologies today is Text-To-Speech (TTS) synthesis. This technology converts normal language text into speech. Many studies have been conducted to develop TTS systems for various languages. In this Filipino TTS, there are 327 diphones extracted from sets...

Full description

Saved in:

Bibliographic Details
Main Authors:	Jimenez, Jerick T., Juliano, Faye S., Silva, Elrick Jan P.
Format:	text
Language:	English
Published:	Animo Repository 2009
Subjects:	Filipino language Technological innovations
Online Access:	https://animorepository.dlsu.edu.ph/etd_bachelors/7657
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	De La Salle University
Language:	English

id	oai:animorepository.dlsu.edu.ph:etd_bachelors-8302
record_format	eprints
spelling	oai:animorepository.dlsu.edu.ph:etd_bachelors-83022021-07-30T04:17:42Z Filipino text-to-speech system: Tagapagsalita, 2 Jimenez, Jerick T. Juliano, Faye S. Silva, Elrick Jan P. One of the main types of speech processing technologies today is Text-To-Speech (TTS) synthesis. This technology converts normal language text into speech. Many studies have been conducted to develop TTS systems for various languages. In this Filipino TTS, there are 327 diphones extracted from sets of Filipino words, 234 are found valid. Diphones will undergo pre-processing and will be compressed using Linear Predictive Coding (LPC). Through inverse LPC, the diphones can be reproduce using the coefficients and excitations stored in the codebook. After the diphones are synthesized, its pitch, volume and duration are manipulated by a scaling factor depending on the accent mark assigned to it. Once the accent is applied to the diphone, it will be concatenated with the other diphones with the means of Overlap-Add Method (OLA) to form the output signal of the system. 25 respondents were asked to evaluate the system based on ease, syllabication, stress, articulation, and speed with the score of five being the highest and one being the lowest. The average of results for all uttered speech scored 4.453 for listening ease, 4.42 for syllabication, 3.83 for stress, 4.06 for articulation and 3.51 for speed. The linguist's average score are 3.86 for listening ease, 3.36 for syllabication, 2.3 for stress, 3 for articulation and 3.51 for speed. Also, the respondents were asked to do the accent mark test by listening to 15 Filipino words and identify the word that they heard based on the choices indicated in the survey sheet. An average score 11.21 out of 15 questions were achieved by the respondents in identifying the Filipino Heteronyms while the linguist's score was 13 out of 15. 2009-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/7657 Bachelor's Theses English Animo Repository Filipino language Technological innovations
institution	De La Salle University
building	De La Salle University Library
continent	Asia
country	Philippines Philippines
content_provider	De La Salle University Library
collection	DLSU Institutional Repository
language	English
topic	Filipino language Technological innovations
spellingShingle	Filipino language Technological innovations Jimenez, Jerick T. Juliano, Faye S. Silva, Elrick Jan P. Filipino text-to-speech system: Tagapagsalita, 2
description	One of the main types of speech processing technologies today is Text-To-Speech (TTS) synthesis. This technology converts normal language text into speech. Many studies have been conducted to develop TTS systems for various languages. In this Filipino TTS, there are 327 diphones extracted from sets of Filipino words, 234 are found valid. Diphones will undergo pre-processing and will be compressed using Linear Predictive Coding (LPC). Through inverse LPC, the diphones can be reproduce using the coefficients and excitations stored in the codebook. After the diphones are synthesized, its pitch, volume and duration are manipulated by a scaling factor depending on the accent mark assigned to it. Once the accent is applied to the diphone, it will be concatenated with the other diphones with the means of Overlap-Add Method (OLA) to form the output signal of the system. 25 respondents were asked to evaluate the system based on ease, syllabication, stress, articulation, and speed with the score of five being the highest and one being the lowest. The average of results for all uttered speech scored 4.453 for listening ease, 4.42 for syllabication, 3.83 for stress, 4.06 for articulation and 3.51 for speed. The linguist's average score are 3.86 for listening ease, 3.36 for syllabication, 2.3 for stress, 3 for articulation and 3.51 for speed. Also, the respondents were asked to do the accent mark test by listening to 15 Filipino words and identify the word that they heard based on the choices indicated in the survey sheet. An average score 11.21 out of 15 questions were achieved by the respondents in identifying the Filipino Heteronyms while the linguist's score was 13 out of 15.
format	text
author	Jimenez, Jerick T. Juliano, Faye S. Silva, Elrick Jan P.
author_facet	Jimenez, Jerick T. Juliano, Faye S. Silva, Elrick Jan P.
author_sort	Jimenez, Jerick T.
title	Filipino text-to-speech system: Tagapagsalita, 2
title_short	Filipino text-to-speech system: Tagapagsalita, 2
title_full	Filipino text-to-speech system: Tagapagsalita, 2
title_fullStr	Filipino text-to-speech system: Tagapagsalita, 2
title_full_unstemmed	Filipino text-to-speech system: Tagapagsalita, 2
title_sort	filipino text-to-speech system: tagapagsalita, 2
publisher	Animo Repository
publishDate	2009
url	https://animorepository.dlsu.edu.ph/etd_bachelors/7657
_version_	1712576759538384896

Filipino text-to-speech system: Tagapagsalita, 2

Similar Items