Southeast Asian multi-language speech recognition engine

In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in tu...

Full description

Saved in:
Bibliographic Details
Main Author: Zhang, Keke
Other Authors: Ling Keck Voon
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158457
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-158457
record_format dspace
spelling sg-ntu-dr.10356-1584572023-07-07T19:01:02Z Southeast Asian multi-language speech recognition engine Zhang, Keke Ling Keck Voon School of Electrical and Electronic Engineering A*STAR Institute of Material Research and Engineering Tran Huy Dat EKVLING@ntu.edu.sg Engineering::Electrical and electronic engineering In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER). Bachelor of Engineering (Electrical and Electronic Engineering) 2022-06-03T08:19:18Z 2022-06-03T08:19:18Z 2022 Final Year Project (FYP) Zhang, K. (2022). Southeast Asian multi-language speech recognition engine. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158457 https://hdl.handle.net/10356/158457 en B1080-211 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Zhang, Keke
Southeast Asian multi-language speech recognition engine
description In the digital era, all kinds of technology advancements have reshaped human life to become easier, faster, and smarter than ever before. Over the past decade, voice services have been adopted across a variety of industries as speech technology being propelled forward. The market prospects are in turn boosted as multiple applications such as Automatic Speech Recognition (ASR), Text-To-Speech (TTS) and AI Assistants are gaining increasing awareness. Amid the Covid-19 crisis, the global speech technology market remained resilient since speech technology is one of the major enablers of contactless interaction. Moreover, driven by the advancements in artificial intelligence, speech technology has become more accessible to a wider range of users at a lower cost in recent years. As a result, more challenges will arise inevitably and accented speech with language mixing is one of them. This project aims to develop an Automatic Speech Recognition (ASR) engine that can be utilised in Singapore, with capabilities to process language mixing input (English mixed with Mandarin) and to produce useful output with low error rate. The focus of this project is on automated text corpus collection, language model training, ASR integration and testing. The performance of the ASR will be evaluated by Mixed Error Rate (MER).
author2 Ling Keck Voon
author_facet Ling Keck Voon
Zhang, Keke
format Final Year Project
author Zhang, Keke
author_sort Zhang, Keke
title Southeast Asian multi-language speech recognition engine
title_short Southeast Asian multi-language speech recognition engine
title_full Southeast Asian multi-language speech recognition engine
title_fullStr Southeast Asian multi-language speech recognition engine
title_full_unstemmed Southeast Asian multi-language speech recognition engine
title_sort southeast asian multi-language speech recognition engine
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/158457
_version_ 1772827129027756032