Automatic assessment of oral reading fluency from children's read speech in the Filipino language

With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extractin...

Full description

Saved in:
Bibliographic Details
Main Author: Dimzon, Francis
Format: text
Language:English
Published: Animo Repository 2023
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/etdd_softtech/1
https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
Language: English
id oai:animorepository.dlsu.edu.ph:etdd_softtech-1000
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:etdd_softtech-10002023-10-02T06:56:40Z Automatic assessment of oral reading fluency from children's read speech in the Filipino language Dimzon, Francis With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extracting and analyzing prosodic features of children's read speech in Filipino. To achieve this, the four-fold set of research activities was conducted to describe an automated oral reading fluency assessment system. It consisted of 1) building a children's Filipino speech corpus, 2) designing methods of extracting and analyzing prosodic features, 3) developing methods of automatically assessing oral reading fluency, and 4) evaluating the performance of developed methods. The dataset consisted of 192 audio files totaling 11 hours, 48 minutes, and 13 seconds. The audio files were recordings of children ages 6 to 11 years reading grade-appropriate passages in the Filipino language. Human raters manually annotated the files as fluent or nonfluent; and as independent, instructional, and frustration levels. Audio and prosodic features were extracted and used as predictor variables in the machine learning training and testing. The machine learning classification methods produced results indicating that the SVM had validation accuracies of 81.18% and 87.71% for the three-level fluency scheme and two-level fluency scheme, respectively. The predictor variables used for these classifications were different. For the three-level scheme, the variables were DSP- and ASR-computed speech rate and Levenshtein distance, while for the two-level scheme, they were total duration, Levenshtein distance, out-of-vocabulary words, DSP-computed articulation rate, and ASR-computed speech rate. The Mel-frequency and gammatone cepstrum coefficients, spectral audio, and wavelet features did not provide significant prediction performance results. On the other hand, the LSTM deep learning method resulted in validation accuracies of 55.08% and 79.61% for the three- and two-level fluency schemes, respectively. To further improve the prediction accuracy, it is recommended that more predictor features be identified, such as other types of reading miscues and pauses features. Also, more reading data may be gathered to balance the distribution of fluency classes in the dataset and to make deep-learning methods discover robust predictor features and improve performance. This study is relevant in addressing the issue of poor reading performance among Filipino children. The study has created a children's read speech corpus in Filipino language, which will eventually be a part of a larger dataset aimed at addressing the limited availability of children's Filipino speech corpus. The study has identified relevant and non-relevant predictor features that can be used to automatically classify oral reading fluency. These features were used as inputs to develop fluency classification methods. The speech corpus, fluency predictor features, and classification techniques based on DSP- and ASR-based feature extraction developed in this study will form as a framework for building an automated oral reading fluency assessment system. 2023-04-01T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdd_softtech/1 https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf Software Technology Dissertations English Animo Repository Speech processing systems Automatic speech recognition Oral reading—Evaluation Reading—Ability testing—Philippines Filipino language—Versification Computer Sciences Educational Technology Software Engineering
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
language English
topic Speech processing systems
Automatic speech recognition
Oral reading—Evaluation
Reading—Ability testing—Philippines
Filipino language—Versification
Computer Sciences
Educational Technology
Software Engineering
spellingShingle Speech processing systems
Automatic speech recognition
Oral reading—Evaluation
Reading—Ability testing—Philippines
Filipino language—Versification
Computer Sciences
Educational Technology
Software Engineering
Dimzon, Francis
Automatic assessment of oral reading fluency from children's read speech in the Filipino language
description With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extracting and analyzing prosodic features of children's read speech in Filipino. To achieve this, the four-fold set of research activities was conducted to describe an automated oral reading fluency assessment system. It consisted of 1) building a children's Filipino speech corpus, 2) designing methods of extracting and analyzing prosodic features, 3) developing methods of automatically assessing oral reading fluency, and 4) evaluating the performance of developed methods. The dataset consisted of 192 audio files totaling 11 hours, 48 minutes, and 13 seconds. The audio files were recordings of children ages 6 to 11 years reading grade-appropriate passages in the Filipino language. Human raters manually annotated the files as fluent or nonfluent; and as independent, instructional, and frustration levels. Audio and prosodic features were extracted and used as predictor variables in the machine learning training and testing. The machine learning classification methods produced results indicating that the SVM had validation accuracies of 81.18% and 87.71% for the three-level fluency scheme and two-level fluency scheme, respectively. The predictor variables used for these classifications were different. For the three-level scheme, the variables were DSP- and ASR-computed speech rate and Levenshtein distance, while for the two-level scheme, they were total duration, Levenshtein distance, out-of-vocabulary words, DSP-computed articulation rate, and ASR-computed speech rate. The Mel-frequency and gammatone cepstrum coefficients, spectral audio, and wavelet features did not provide significant prediction performance results. On the other hand, the LSTM deep learning method resulted in validation accuracies of 55.08% and 79.61% for the three- and two-level fluency schemes, respectively. To further improve the prediction accuracy, it is recommended that more predictor features be identified, such as other types of reading miscues and pauses features. Also, more reading data may be gathered to balance the distribution of fluency classes in the dataset and to make deep-learning methods discover robust predictor features and improve performance. This study is relevant in addressing the issue of poor reading performance among Filipino children. The study has created a children's read speech corpus in Filipino language, which will eventually be a part of a larger dataset aimed at addressing the limited availability of children's Filipino speech corpus. The study has identified relevant and non-relevant predictor features that can be used to automatically classify oral reading fluency. These features were used as inputs to develop fluency classification methods. The speech corpus, fluency predictor features, and classification techniques based on DSP- and ASR-based feature extraction developed in this study will form as a framework for building an automated oral reading fluency assessment system.
format text
author Dimzon, Francis
author_facet Dimzon, Francis
author_sort Dimzon, Francis
title Automatic assessment of oral reading fluency from children's read speech in the Filipino language
title_short Automatic assessment of oral reading fluency from children's read speech in the Filipino language
title_full Automatic assessment of oral reading fluency from children's read speech in the Filipino language
title_fullStr Automatic assessment of oral reading fluency from children's read speech in the Filipino language
title_full_unstemmed Automatic assessment of oral reading fluency from children's read speech in the Filipino language
title_sort automatic assessment of oral reading fluency from children's read speech in the filipino language
publisher Animo Repository
publishDate 2023
url https://animorepository.dlsu.edu.ph/etdd_softtech/1
https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf
_version_ 1779260479581978624