Multi-language Southeast Asian speech2text development
This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two langua...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/158284 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-158284 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1582842023-07-07T18:55:59Z Multi-language Southeast Asian speech2text development Priya Kanakarajan Jiang Xudong School of Electrical and Electronic Engineering A*STAR EXDJiang@ntu.edu.sg Engineering::Electrical and electronic engineering This project aims to improve on the current Code-Switch ASR technology for English- (Dravidian Language) further and help in the future development of an engine that allows speech to be detected in English and another Dravidian language. As a native speaker of English and Tamil, the two languages chosen for my project are English and Tamil. ASRs consist of the Acoustic Model, Language Model and Pronunciation Lexicon. This report will investigate how we can train better Language Models to improve the outputs. To solve this, we also aim to expand the text corpus by not only collecting data but also generating data for the text corpus Bachelor of Engineering (Information Engineering and Media) 2022-06-01T06:28:22Z 2022-06-01T06:28:22Z 2022 Final Year Project (FYP) Priya Kanakarajan (2022). Multi-language Southeast Asian speech2text development. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158284 https://hdl.handle.net/10356/158284 en B3092-211 Priya Kanakarajan application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Priya Kanakarajan Multi-language Southeast Asian speech2text development |
description |
This project aims to improve on the current Code-Switch ASR technology for
English- (Dravidian Language) further and help in the future development of an
engine that allows speech to be detected in English and another Dravidian language.
As a native speaker of English and Tamil, the two languages chosen for my project
are English and Tamil. ASRs consist of the Acoustic Model, Language Model and
Pronunciation Lexicon. This report will investigate how we can train better Language
Models to improve the outputs. To solve this, we also aim to expand the text corpus
by not only collecting data but also generating data for the text corpus |
author2 |
Jiang Xudong |
author_facet |
Jiang Xudong Priya Kanakarajan |
format |
Final Year Project |
author |
Priya Kanakarajan |
author_sort |
Priya Kanakarajan |
title |
Multi-language Southeast Asian speech2text development |
title_short |
Multi-language Southeast Asian speech2text development |
title_full |
Multi-language Southeast Asian speech2text development |
title_fullStr |
Multi-language Southeast Asian speech2text development |
title_full_unstemmed |
Multi-language Southeast Asian speech2text development |
title_sort |
multi-language southeast asian speech2text development |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/158284 |
_version_ |
1772827634147786752 |