Southeast Asian multi-language speech recognition engine
In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in tu...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/158457 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-158457 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1584572023-07-07T19:01:02Z Southeast Asian multi-language speech recognition engine Zhang, Keke Ling Keck Voon School of Electrical and Electronic Engineering A*STAR Institute of Material Research and Engineering Tran Huy Dat EKVLING@ntu.edu.sg Engineering::Electrical and electronic engineering In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER). Bachelor of Engineering (Electrical and Electronic Engineering) 2022-06-03T08:19:18Z 2022-06-03T08:19:18Z 2022 Final Year Project (FYP) Zhang, K. (2022). Southeast Asian multi-language speech recognition engine. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158457 https://hdl.handle.net/10356/158457 en B1080-211 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Zhang, Keke Southeast Asian multi-language speech recognition engine |
description |
In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER). |
author2 |
Ling Keck Voon |
author_facet |
Ling Keck Voon Zhang, Keke |
format |
Final Year Project |
author |
Zhang, Keke |
author_sort |
Zhang, Keke |
title |
Southeast Asian multi-language speech recognition engine |
title_short |
Southeast Asian multi-language speech recognition engine |
title_full |
Southeast Asian multi-language speech recognition engine |
title_fullStr |
Southeast Asian multi-language speech recognition engine |
title_full_unstemmed |
Southeast Asian multi-language speech recognition engine |
title_sort |
southeast asian multi-language speech recognition engine |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/158457 |
_version_ |
1772827129027756032 |