Automatic assessment of oral reading fluency from children's read speech in the Filipino language
With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extractin...
Saved in:
Main Author: | |
---|---|
Format: | text |
Language: | English |
Published: |
Animo Repository
2023
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/etdd_softtech/1 https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
Language: | English |
id |
oai:animorepository.dlsu.edu.ph:etdd_softtech-1000 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:etdd_softtech-10002023-10-02T06:56:40Z Automatic assessment of oral reading fluency from children's read speech in the Filipino language Dimzon, Francis With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extracting and analyzing prosodic features of children's read speech in Filipino. To achieve this, the four-fold set of research activities was conducted to describe an automated oral reading fluency assessment system. It consisted of 1) building a children's Filipino speech corpus, 2) designing methods of extracting and analyzing prosodic features, 3) developing methods of automatically assessing oral reading fluency, and 4) evaluating the performance of developed methods. The dataset consisted of 192 audio files totaling 11 hours, 48 minutes, and 13 seconds. The audio files were recordings of children ages 6 to 11 years reading grade-appropriate passages in the Filipino language. Human raters manually annotated the files as fluent or nonfluent; and as independent, instructional, and frustration levels. Audio and prosodic features were extracted and used as predictor variables in the machine learning training and testing. The machine learning classification methods produced results indicating that the SVM had validation accuracies of 81.18% and 87.71% for the three-level fluency scheme and two-level fluency scheme, respectively. The predictor variables used for these classifications were different. For the three-level scheme, the variables were DSP- and ASR-computed speech rate and Levenshtein distance, while for the two-level scheme, they were total duration, Levenshtein distance, out-of-vocabulary words, DSP-computed articulation rate, and ASR-computed speech rate. The Mel-frequency and gammatone cepstrum coefficients, spectral audio, and wavelet features did not provide significant prediction performance results. On the other hand, the LSTM deep learning method resulted in validation accuracies of 55.08% and 79.61% for the three- and two-level fluency schemes, respectively. To further improve the prediction accuracy, it is recommended that more predictor features be identified, such as other types of reading miscues and pauses features. Also, more reading data may be gathered to balance the distribution of fluency classes in the dataset and to make deep-learning methods discover robust predictor features and improve performance. This study is relevant in addressing the issue of poor reading performance among Filipino children. The study has created a children's read speech corpus in Filipino language, which will eventually be a part of a larger dataset aimed at addressing the limited availability of children's Filipino speech corpus. The study has identified relevant and non-relevant predictor features that can be used to automatically classify oral reading fluency. These features were used as inputs to develop fluency classification methods. The speech corpus, fluency predictor features, and classification techniques based on DSP- and ASR-based feature extraction developed in this study will form as a framework for building an automated oral reading fluency assessment system. 2023-04-01T07:00:00Z text application/pdf https://animorepository.dlsu.edu.ph/etdd_softtech/1 https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf Software Technology Dissertations English Animo Repository Speech processing systems Automatic speech recognition Oral reading—Evaluation Reading—Ability testing—Philippines Filipino language—Versification Computer Sciences Educational Technology Software Engineering |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
language |
English |
topic |
Speech processing systems Automatic speech recognition Oral reading—Evaluation Reading—Ability testing—Philippines Filipino language—Versification Computer Sciences Educational Technology Software Engineering |
spellingShingle |
Speech processing systems Automatic speech recognition Oral reading—Evaluation Reading—Ability testing—Philippines Filipino language—Versification Computer Sciences Educational Technology Software Engineering Dimzon, Francis Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
description |
With the end view of helping the Philippine education system in its literacy initiatives, this study aims to develop methods for automatic assessment of oral reading fluency from children's read speech in the Filipino language. Thus, this study seeks to design methods of automatically extracting and analyzing prosodic features of children's read speech in Filipino. To achieve this, the four-fold set of research activities was conducted to describe an automated oral reading fluency assessment system. It consisted of 1) building a children's Filipino speech corpus, 2) designing methods of extracting and analyzing prosodic features, 3) developing methods of automatically assessing oral reading fluency, and 4) evaluating the performance of developed methods. The dataset consisted of 192 audio files totaling 11 hours, 48 minutes, and 13 seconds. The audio files were recordings of children ages 6 to 11 years reading grade-appropriate passages in the Filipino language. Human raters manually annotated the files as fluent or nonfluent; and as independent, instructional, and frustration levels. Audio and prosodic features were extracted and used as predictor variables in the machine learning training and testing. The machine learning classification methods produced results indicating that the SVM had validation accuracies of 81.18% and 87.71% for the three-level fluency scheme and two-level fluency scheme, respectively. The predictor variables used for these classifications were different. For the three-level scheme, the variables were DSP- and ASR-computed speech rate and Levenshtein distance, while for the two-level scheme, they were total duration, Levenshtein distance, out-of-vocabulary words, DSP-computed articulation rate, and ASR-computed speech rate. The Mel-frequency and gammatone cepstrum coefficients, spectral audio, and wavelet features did not provide significant prediction performance results. On the other hand, the LSTM deep learning method resulted in validation accuracies of 55.08% and 79.61% for the three- and two-level fluency schemes, respectively. To further improve the prediction accuracy, it is recommended that more predictor features be identified, such as other types of reading miscues and pauses features. Also, more reading data may be gathered to balance the distribution of fluency classes in the dataset and to make deep-learning methods discover robust predictor features and improve performance.
This study is relevant in addressing the issue of poor reading performance among Filipino children. The study has created a children's read speech corpus in Filipino language, which will eventually be a part of a larger dataset aimed at addressing the limited availability of children's Filipino speech corpus. The study has identified relevant and non-relevant predictor features that can be used to automatically classify oral reading fluency. These features were used as inputs to develop fluency classification methods. The speech corpus, fluency predictor features, and classification techniques based on DSP- and ASR-based feature extraction developed in this study will form as a framework for building an automated oral reading fluency assessment system. |
format |
text |
author |
Dimzon, Francis |
author_facet |
Dimzon, Francis |
author_sort |
Dimzon, Francis |
title |
Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
title_short |
Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
title_full |
Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
title_fullStr |
Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
title_full_unstemmed |
Automatic assessment of oral reading fluency from children's read speech in the Filipino language |
title_sort |
automatic assessment of oral reading fluency from children's read speech in the filipino language |
publisher |
Animo Repository |
publishDate |
2023 |
url |
https://animorepository.dlsu.edu.ph/etdd_softtech/1 https://animorepository.dlsu.edu.ph/context/etdd_softtech/article/1000/viewcontent/Automatic2_assessment_of_oral_reading_fluency_from_childrens_read_Redacted.pdf |
_version_ |
1779260479581978624 |