Multi-language Southeast Asian speech2text development

This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two langua...

Full description

Saved in:

Bibliographic Details
Main Author:	Priya Kanakarajan
Other Authors:	Jiang Xudong
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/158284
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-158284
record_format	dspace
spelling	sg-ntu-dr.10356-1582842023-07-07T18:55:59Z Multi-language Southeast Asian speech2text development Priya Kanakarajan Jiang Xudong School of Electrical and Electronic Engineering A*STAR EXDJiang@ntu.edu.sg Engineering::Electrical and electronic engineering This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two languages chosen for my project are English and Tamil. ASRs consist of the Acoustic Model, Language Model and Pronunciation Lexicon. This report will investigate how we can train better Language Models to improve the outputs. To solve this, we also aim to expand the text corpus by not only collecting data but also generating data for the text corpus Bachelor of Engineering (Information Engineering and Media) 2022-06-01T06:28:22Z 2022-06-01T06:28:22Z 2022 Final Year Project (FYP) Priya Kanakarajan (2022). Multi-language Southeast Asian speech2text development. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158284 https://hdl.handle.net/10356/158284 en B3092-211 Priya Kanakarajan application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering
spellingShingle	Engineering::Electrical and electronic engineering Priya Kanakarajan Multi-language Southeast Asian speech2text development
description	This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two languages chosen for my project are English and Tamil. ASRs consist of the Acoustic Model, Language Model and Pronunciation Lexicon. This report will investigate how we can train better Language Models to improve the outputs. To solve this, we also aim to expand the text corpus by not only collecting data but also generating data for the text corpus
author2	Jiang Xudong
author_facet	Jiang Xudong Priya Kanakarajan
format	Final Year Project
author	Priya Kanakarajan
author_sort	Priya Kanakarajan
title	Multi-language Southeast Asian speech2text development
title_short	Multi-language Southeast Asian speech2text development
title_full	Multi-language Southeast Asian speech2text development
title_fullStr	Multi-language Southeast Asian speech2text development
title_full_unstemmed	Multi-language Southeast Asian speech2text development
title_sort	multi-language southeast asian speech2text development
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/158284
_version_	1772827634147786752

Multi-language Southeast Asian speech2text development

Similar Items