HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers

The language learning system employs a speech corpus that contains selected phrases from an Ilokano phrasebook authored by Dr. Carl Rubino [7]. A survey was conducted in Narvacan, Ilocos Sur for the purpose of selecting Ilokano phrases rated as "commonly used". Using the survey results, th...

Full description

Saved in:
Bibliographic Details
Main Authors: Malaay, Emmanual M., Simora, Michael B., Cabatic, Ronald John O., Castillo, Anthony B., Cabotaje, Ayman P., Pascual, Ronald M.
Format: text
Published: Animo Repository 2015
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/13624
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-13455
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-134552024-12-09T08:44:03Z HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers Malaay, Emmanual M. Simora, Michael B. Cabatic, Ronald John O. Castillo, Anthony B. Cabotaje, Ayman P. Pascual, Ronald M. The language learning system employs a speech corpus that contains selected phrases from an Ilokano phrasebook authored by Dr. Carl Rubino [7]. A survey was conducted in Narvacan, Ilocos Sur for the purpose of selecting Ilokano phrases rated as "commonly used". Using the survey results, the speech corpus was generated from the recordings of 10 middle-age (ages 18-28) native Ilokano speakers which were transcribed to phoneme level. The system has a built-in assessment subsystem that evaluates the user's learning based on three categories: reading, listening, and comprehension. The assessment uses a Reading Miscue Detector (RMD) that basically employs force alignment method. An HMM- based Automated Speech Recognition (ASR) system was developed using the phoneme-level transcribed speech corpus as training data and the Viterbi Alignment as the main method for likelihood scoring. The 3-state HMMs were generated using the Hidden Markov Model Toolkit (HTK). A six-week pilot study was conducted to measure the extent of the users' learning of the language gained from the system. The offline test set contains 3 subsets of full system passages by 3 users wherein each phrase has one phoneme with wrong pronunciation. The test set was used to measure the False Alarm Rate (FAR) and Misdetection Rate (MdR) of the RMD, which have shown fairly low percentages or may be considered as a good RMD according to previous studies presented in the literature. The authors demonstrated that 65.75% of the reading miscues can be detected by the system at a false alarm rate of 25.16% which is good enough for similar systems presented in the literature. Another finding of this study is that the speech rhythm of the Ilokano language is mora-timed. 2015-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/13624 Faculty Research Work Animo Repository Iloko language—Computer-assisted instruction for Tagalog speakers Computer Sciences Language and Literacy Education
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Iloko language—Computer-assisted instruction for Tagalog speakers
Computer Sciences
Language and Literacy Education
spellingShingle Iloko language—Computer-assisted instruction for Tagalog speakers
Computer Sciences
Language and Literacy Education
Malaay, Emmanual M.
Simora, Michael B.
Cabatic, Ronald John O.
Castillo, Anthony B.
Cabotaje, Ayman P.
Pascual, Ronald M.
HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
description The language learning system employs a speech corpus that contains selected phrases from an Ilokano phrasebook authored by Dr. Carl Rubino [7]. A survey was conducted in Narvacan, Ilocos Sur for the purpose of selecting Ilokano phrases rated as "commonly used". Using the survey results, the speech corpus was generated from the recordings of 10 middle-age (ages 18-28) native Ilokano speakers which were transcribed to phoneme level. The system has a built-in assessment subsystem that evaluates the user's learning based on three categories: reading, listening, and comprehension. The assessment uses a Reading Miscue Detector (RMD) that basically employs force alignment method. An HMM- based Automated Speech Recognition (ASR) system was developed using the phoneme-level transcribed speech corpus as training data and the Viterbi Alignment as the main method for likelihood scoring. The 3-state HMMs were generated using the Hidden Markov Model Toolkit (HTK). A six-week pilot study was conducted to measure the extent of the users' learning of the language gained from the system. The offline test set contains 3 subsets of full system passages by 3 users wherein each phrase has one phoneme with wrong pronunciation. The test set was used to measure the False Alarm Rate (FAR) and Misdetection Rate (MdR) of the RMD, which have shown fairly low percentages or may be considered as a good RMD according to previous studies presented in the literature. The authors demonstrated that 65.75% of the reading miscues can be detected by the system at a false alarm rate of 25.16% which is good enough for similar systems presented in the literature. Another finding of this study is that the speech rhythm of the Ilokano language is mora-timed.
format text
author Malaay, Emmanual M.
Simora, Michael B.
Cabatic, Ronald John O.
Castillo, Anthony B.
Cabotaje, Ayman P.
Pascual, Ronald M.
author_facet Malaay, Emmanual M.
Simora, Michael B.
Cabatic, Ronald John O.
Castillo, Anthony B.
Cabotaje, Ayman P.
Pascual, Ronald M.
author_sort Malaay, Emmanual M.
title HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
title_short HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
title_full HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
title_fullStr HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
title_full_unstemmed HMM-based computer-aided Ilokano language learning (CAILL) system for Tagalog speakers
title_sort hmm-based computer-aided ilokano language learning (caill) system for tagalog speakers
publisher Animo Repository
publishDate 2015
url https://animorepository.dlsu.edu.ph/faculty_research/13624
_version_ 1819113601908080640