Southeast Asian multi-language speech recognition engine

In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in tu...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhang, Keke
Other Authors:	Ling Keck Voon
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/158457
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-158457
record_format	dspace
spelling	sg-ntu-dr.10356-1584572023-07-07T19:01:02Z Southeast Asian multi-language speech recognition engine Zhang, Keke Ling Keck Voon School of Electrical and Electronic Engineering A*STAR Institute of Material Research and Engineering Tran Huy Dat EKVLING@ntu.edu.sg Engineering::Electrical and electronic engineering In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER). Bachelor of Engineering (Electrical and Electronic Engineering) 2022-06-03T08:19:18Z 2022-06-03T08:19:18Z 2022 Final Year Project (FYP) Zhang, K. (2022). Southeast Asian multi-language speech recognition engine. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158457 https://hdl.handle.net/10356/158457 en B1080-211 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Zhang, Keke Southeast Asian multi-language speech recognition engine
description	In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER).
author2	Ling Keck Voon
author_facet	Ling Keck Voon Zhang, Keke
format	Final Year Project
author	Zhang, Keke
author_sort	Zhang, Keke
title	Southeast Asian multi-language speech recognition engine
title_short	Southeast Asian multi-language speech recognition engine
title_full	Southeast Asian multi-language speech recognition engine
title_fullStr	Southeast Asian multi-language speech recognition engine
title_full_unstemmed	Southeast Asian multi-language speech recognition engine
title_sort	southeast asian multi-language speech recognition engine
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/158457
_version_	1772827129027756032

Southeast Asian multi-language speech recognition engine

Similar Items