Development of a Mandarin learning tool for children using speech recognition model

This project report explores the evaluation performance of speech recognition and generation models specifically for short Mandarin phrases and children's voices. It introduces a Mandarin learning application prototype framework that leverages these models, which have been finetuned to recog...

Full description

Saved in:
Bibliographic Details
Main Author: Wang, Yilin
Other Authors: Tan Yap Peng
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/177146
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-177146
record_format dspace
spelling sg-ntu-dr.10356-1771462024-05-31T15:43:29Z Development of a Mandarin learning tool for children using speech recognition model Wang, Yilin Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering Electrical and electronic engineering This project report explores the evaluation performance of speech recognition and generation models specifically for short Mandarin phrases and children's voices. It introduces a Mandarin learning application prototype framework that leverages these models, which have been finetuned to recognize nuances in children’s voice and short Chinese phrases. The primary goal of this study was to forge a developmental pathway for a learning tool designed to significantly enhance the educational experience of children. Presenting a tool framework focuses on improving pronunciation, intonation, and understanding of Chinese characters (汉字) through a structured pedagogical approach. This project is the extensive adaptation of the Whisper Model, engineered to overcome the inherent variability in children's speech patterns and the tonal complexity of Mandarin. Our approach involved a systematic methodology comprising the assembly of a children audio dataset, model performance testing with a focus on children's voices, and fine-tuning to elevate the model's acuity for concise Mandarin phrases. The prototype framework serves as a proof of concept, demonstrating the capabilities of the model in a structured educational context. It outlines the envisioned interactive modules aimed at reinforcing pronunciation, intonation, and character recognition, fostering a comprehensive learning experience. The project successfully demonstrated the Whisper model's performance at recognising short phrases articulated by both adults and children. This success underpins the model's enhancements to better serve the unique needs of young learners and short phrase recognition, culminating in the introduction of an educational application prototype framework. This prototype harnesses speech technology to facilitate language learning, thereby showcasing the potential of integrating speech recognition and generation technologies into educational tools. The findings lay a crucial groundwork for future research and development in this field. Bachelor's degree 2024-05-27T05:48:34Z 2024-05-27T05:48:34Z 2024 Final Year Project (FYP) Wang, Y. (2024). Development of a Mandarin learning tool for children using speech recognition model. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177146 https://hdl.handle.net/10356/177146 en A3202-231 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Electrical and electronic engineering
spellingShingle Engineering
Electrical and electronic engineering
Wang, Yilin
Development of a Mandarin learning tool for children using speech recognition model
description This project report explores the evaluation performance of speech recognition and generation models specifically for short Mandarin phrases and children's voices. It introduces a Mandarin learning application prototype framework that leverages these models, which have been finetuned to recognize nuances in children’s voice and short Chinese phrases. The primary goal of this study was to forge a developmental pathway for a learning tool designed to significantly enhance the educational experience of children. Presenting a tool framework focuses on improving pronunciation, intonation, and understanding of Chinese characters (汉字) through a structured pedagogical approach. This project is the extensive adaptation of the Whisper Model, engineered to overcome the inherent variability in children's speech patterns and the tonal complexity of Mandarin. Our approach involved a systematic methodology comprising the assembly of a children audio dataset, model performance testing with a focus on children's voices, and fine-tuning to elevate the model's acuity for concise Mandarin phrases. The prototype framework serves as a proof of concept, demonstrating the capabilities of the model in a structured educational context. It outlines the envisioned interactive modules aimed at reinforcing pronunciation, intonation, and character recognition, fostering a comprehensive learning experience. The project successfully demonstrated the Whisper model's performance at recognising short phrases articulated by both adults and children. This success underpins the model's enhancements to better serve the unique needs of young learners and short phrase recognition, culminating in the introduction of an educational application prototype framework. This prototype harnesses speech technology to facilitate language learning, thereby showcasing the potential of integrating speech recognition and generation technologies into educational tools. The findings lay a crucial groundwork for future research and development in this field.
author2 Tan Yap Peng
author_facet Tan Yap Peng
Wang, Yilin
format Final Year Project
author Wang, Yilin
author_sort Wang, Yilin
title Development of a Mandarin learning tool for children using speech recognition model
title_short Development of a Mandarin learning tool for children using speech recognition model
title_full Development of a Mandarin learning tool for children using speech recognition model
title_fullStr Development of a Mandarin learning tool for children using speech recognition model
title_full_unstemmed Development of a Mandarin learning tool for children using speech recognition model
title_sort development of a mandarin learning tool for children using speech recognition model
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/177146
_version_ 1814047341150208000