Filipino text-to-speech system: Tagapagsalita

Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for th...

Full description

Saved in:
Bibliographic Details
Main Authors: Aralar, Kevin Romualdo A., Coloso, Paolo Miguel H., Moneda, Jerlyn R.
Format: text
Language:English
Published: Animo Repository 2006
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etd_bachelors/7656
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etd_bachelors-8301
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etd_bachelors-83012021-07-30T08:05:54Z Filipino text-to-speech system: Tagapagsalita Aralar, Kevin Romualdo A. Coloso, Paolo Miguel H. Moneda, Jerlyn R. Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for the Tagalog number words Isa to Isandaan was developed. This TTS system works in three major stages. Diphones present in the words Isa to Isandaan were first recorded, cut and denoised using a third party program specialising in audio processing. The pre-processed signals were compressed using Linear Predictive Coding the signals were passed to a reversible filter which extracts LPC Coeffecients, per frame gains and excitation. Finally, these parameters were taken and reversed to produced a synthethic version of the original diphones. Through the use of the Synchronous Overlap-Add (SOLA) technique, reconstructed diphones were concatenated into whole words. Based on its purpose, testing of the system was rated by intelligibility. Thirty-one persons were requested to articulation and speed with the score of 1 being the lowest and 5 being the highest score. Mean opinion score of 30 persons scored an average of 4.30 for listening effort, 4.27 for syllabication, 4.16 for stress, 4.18 for articulation, 4.07 for speed in all significant words for male and 4.25 for listening effort, 4.29 for syllabication, 4.16 for stress, 4.18 for articulation and 4.14 for speed in all significant words for female. Discrepancies of the speech intelligibility and quality are much attributed to the preprocessing phase of the speech signal and also to the subjective perception of the respondent listener based upon the prosodic parameters like pitch, duration and amplitude as seen from the result of the MOS of the synthetic uttered tagalog word Isandaan . Linear Predictive Coding technique is a useful tool for compression, since it can extract information for the synthesis of speech without affecting the intelligibility of the speech. 2006-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/etd_bachelors/7656 Bachelor's Theses English Animo Repository Filipino language Technological innovations
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Filipino language
Technological innovations
spellingShingle Filipino language
Technological innovations
Aralar, Kevin Romualdo A.
Coloso, Paolo Miguel H.
Moneda, Jerlyn R.
Filipino text-to-speech system: Tagapagsalita
description Although computers can be used to speak like humans, it is more likely to sound artificial or synthetic. Such a task is normally performed by a Text-to-Speech (TTS) system. Few studies have been conducted to implement TTS systems in Tagalog. In this research a TTS system specifically designed for the Tagalog number words Isa to Isandaan was developed. This TTS system works in three major stages. Diphones present in the words Isa to Isandaan were first recorded, cut and denoised using a third party program specialising in audio processing. The pre-processed signals were compressed using Linear Predictive Coding the signals were passed to a reversible filter which extracts LPC Coeffecients, per frame gains and excitation. Finally, these parameters were taken and reversed to produced a synthethic version of the original diphones. Through the use of the Synchronous Overlap-Add (SOLA) technique, reconstructed diphones were concatenated into whole words. Based on its purpose, testing of the system was rated by intelligibility. Thirty-one persons were requested to articulation and speed with the score of 1 being the lowest and 5 being the highest score. Mean opinion score of 30 persons scored an average of 4.30 for listening effort, 4.27 for syllabication, 4.16 for stress, 4.18 for articulation, 4.07 for speed in all significant words for male and 4.25 for listening effort, 4.29 for syllabication, 4.16 for stress, 4.18 for articulation and 4.14 for speed in all significant words for female. Discrepancies of the speech intelligibility and quality are much attributed to the preprocessing phase of the speech signal and also to the subjective perception of the respondent listener based upon the prosodic parameters like pitch, duration and amplitude as seen from the result of the MOS of the synthetic uttered tagalog word Isandaan . Linear Predictive Coding technique is a useful tool for compression, since it can extract information for the synthesis of speech without affecting the intelligibility of the speech.
format text
author Aralar, Kevin Romualdo A.
Coloso, Paolo Miguel H.
Moneda, Jerlyn R.
author_facet Aralar, Kevin Romualdo A.
Coloso, Paolo Miguel H.
Moneda, Jerlyn R.
author_sort Aralar, Kevin Romualdo A.
title Filipino text-to-speech system: Tagapagsalita
title_short Filipino text-to-speech system: Tagapagsalita
title_full Filipino text-to-speech system: Tagapagsalita
title_fullStr Filipino text-to-speech system: Tagapagsalita
title_full_unstemmed Filipino text-to-speech system: Tagapagsalita
title_sort filipino text-to-speech system: tagapagsalita
publisher Animo Repository
publishDate 2006
url https://animorepository.dlsu.edu.ph/etd_bachelors/7656
_version_ 1712576759363272704