Multi-language Southeast Asian speech2text development

This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two langua...

Full description

Saved in:
Bibliographic Details
Main Author: Priya Kanakarajan
Other Authors: Jiang Xudong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158284
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-158284
record_format dspace
spelling sg-ntu-dr.10356-1582842023-07-07T18:55:59Z Multi-language Southeast Asian speech2text development Priya Kanakarajan Jiang Xudong School of Electrical and Electronic Engineering A*STAR EXDJiang@ntu.edu.sg Engineering::Electrical and electronic engineering This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two languages chosen for my project are English and Tamil. ASRs consist of the Acoustic Model, Language Model and Pronunciation Lexicon. This report will investigate how we can train better Language Models to improve the outputs. To solve this, we also aim to expand the text corpus by not only collecting data but also generating data for the text corpus Bachelor of Engineering (Information Engineering and Media) 2022-06-01T06:28:22Z 2022-06-01T06:28:22Z 2022 Final Year Project (FYP) Priya Kanakarajan (2022). Multi-language Southeast Asian speech2text development. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158284 https://hdl.handle.net/10356/158284 en B3092-211 Priya Kanakarajan application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Priya Kanakarajan
Multi-language Southeast Asian speech2text development
description This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two languages chosen for my project are English and Tamil. ASRs consist of the Acoustic Model, Language Model and Pronunciation Lexicon. This report will investigate how we can train better Language Models to improve the outputs. To solve this, we also aim to expand the text corpus by not only collecting data but also generating data for the text corpus
author2 Jiang Xudong
author_facet Jiang Xudong
Priya Kanakarajan
format Final Year Project
author Priya Kanakarajan
author_sort Priya Kanakarajan
title Multi-language Southeast Asian speech2text development
title_short Multi-language Southeast Asian speech2text development
title_full Multi-language Southeast Asian speech2text development
title_fullStr Multi-language Southeast Asian speech2text development
title_full_unstemmed Multi-language Southeast Asian speech2text development
title_sort multi-language southeast asian speech2text development
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/158284
_version_ 1772827634147786752