Filipino text-to-speech system: Tagapagsalita
Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for th...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2006
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etd_bachelors/7656 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etd_bachelors-8301 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etd_bachelors-83012021-07-30T08:05:54Z Filipino text-to-speech system: Tagapagsalita Aralar, Kevin Romualdo A. Coloso, Paolo Miguel H. Moneda, Jerlyn R. Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for the Tagalog number words Isa to Isandaan was developed. This TTS system works in three major stages. Diphones present in the words Isa to Isandaan were first recorded, cut and denoised using a third party program specialising in audio processing. The pre-processed signals were compressed using Linear Predictive Coding the signals were passed to a reversible filter which extracts LPC Coeffecients, per frame gains and excitation. Finally, these parameters were taken and reversed to produced a synthethic version of the original diphones. Through the use of the Synchronous Overlap-Add (SOLA) technique, reconstructed diphones were concatenated into whole words. Based on its purpose, testing of the system was rated by intelligibility. Thirty-one persons were requested to articulation and speed with the score of 1 being the lowest and 5 being the highest score. Mean opinion score of 30 persons scored an average of 4.30 for listening effort, 4.27 for syllabication, 4.16 for stress, 4.18 for articulation, 4.07 for speed in all significant words for male and 4.25 for listening effort, 4.29 for syllabication, 4.16 for stress, 4.18 for articulation and 4.14 for speed in all significant words for female. Discrepancies of the speech intelligibility and quality are much attributed to the preprocessing phase of the speech signal and also to the subjective perception of the respondent listener based upon the prosodic parameters like pitch, duration and amplitude as seen from the result of the MOS of the synthetic uttered tagalog word Isandaan . Linear Predictive Coding technique is a useful tool for compression, since it can extract information for the synthesis of speech without affecting the intelligibility of the speech. 2006-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/7656 Bachelor's Theses English Animo Repository Filipino language Technological innovations |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Filipino language Technological innovations |
spellingShingle |
Filipino language Technological innovations Aralar, Kevin Romualdo A. Coloso, Paolo Miguel H. Moneda, Jerlyn R. Filipino text-to-speech system: Tagapagsalita |
description |
Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for the Tagalog number words Isa to Isandaan was developed. This TTS system works in three major stages. Diphones present in the words Isa to Isandaan were first recorded, cut and denoised using a third party program specialising in audio processing. The pre-processed signals were compressed using Linear Predictive Coding the signals were passed to a reversible filter which extracts LPC Coeffecients, per frame gains and excitation. Finally, these parameters were taken and reversed to produced a synthethic version of the original diphones. Through the use of the Synchronous Overlap-Add (SOLA) technique, reconstructed diphones were concatenated into whole words. Based on its purpose, testing of the system was rated by intelligibility. Thirty-one persons were requested to articulation and speed with the score of 1 being the lowest and 5 being the highest score. Mean opinion score of 30 persons scored an average of 4.30 for listening effort, 4.27 for syllabication, 4.16 for stress, 4.18 for articulation, 4.07 for speed in all significant words for male and 4.25 for listening effort, 4.29 for syllabication, 4.16 for stress, 4.18 for articulation and 4.14 for speed in all significant words for female. Discrepancies of the speech intelligibility and quality are much attributed to the preprocessing phase of the speech signal and also to the subjective perception of the respondent listener based upon the prosodic parameters like pitch, duration and amplitude as seen from the result of the MOS of the synthetic uttered tagalog word Isandaan . Linear Predictive Coding technique is a useful tool for compression, since it can extract information for the synthesis of speech without affecting the intelligibility of the speech. |
format |
text |
author |
Aralar, Kevin Romualdo A. Coloso, Paolo Miguel H. Moneda, Jerlyn R. |
author_facet |
Aralar, Kevin Romualdo A. Coloso, Paolo Miguel H. Moneda, Jerlyn R. |
author_sort |
Aralar, Kevin Romualdo A. |
title |
Filipino text-to-speech system: Tagapagsalita |
title_short |
Filipino text-to-speech system: Tagapagsalita |
title_full |
Filipino text-to-speech system: Tagapagsalita |
title_fullStr |
Filipino text-to-speech system: Tagapagsalita |
title_full_unstemmed |
Filipino text-to-speech system: Tagapagsalita |
title_sort |
filipino text-to-speech system: tagapagsalita |
publisher |
Animo Repository |
publishDate |
2006 |
url |
https://animorepository.dlsu.edu.ph/etd_bachelors/7656 |
_version_ |
1712576759363272704 |