Deep-learning for conversational speech using semantic textual analysis
Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especia...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156632 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156632 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1566322022-04-21T06:47:50Z Deep-learning for conversational speech using semantic textual analysis Suthakar, Shiny Gladdys Chng Eng Siong School of Computer Science and Engineering ASESChng@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especially potentially life-saving ones. However, most ASR systems today can only work effectively on formal speech input. They have a lot of room to fully understand speech of colloquial nature. Focusing on English speech in the Singaporean context, this project aims to provide a solution for generating formal semantic equivalents of conversational sentences derived from speech. Thus, acoustic and language models of existing ASR systems can be trained with these mappings from conversational to formal text, thus acquiring better comprehension and performance when receiving informal speech input. Furthermore, this project aims to analyse the semantic similarity performance of deep-learning models in terms of semantically similar formal sentence generation and their use of deep-learning techniques. The experimentation results show that the PEGASUS model performs better holistically. This report will present the proposed solution framework and lay out in detail the components of the project implementation. Bachelor of Engineering (Computer Science) 2022-04-21T06:47:50Z 2022-04-21T06:47:50Z 2022 Final Year Project (FYP) Suthakar, S. G. (2022). Deep-learning for conversational speech using semantic textual analysis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156632 https://hdl.handle.net/10356/156632 en SCSE21-0063 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Suthakar, Shiny Gladdys Deep-learning for conversational speech using semantic textual analysis |
description |
Automatic Speech Recognition (ASR) systems today have a prominent and
widespread impact among software applications of different domains.
They are usually embedded in the applications to provide user input to
the main functionality, hence, acting as the cornerstone of these
applications, especially potentially life-saving ones. However, most ASR
systems today can only work effectively on formal speech input. They
have a lot of room to fully understand speech of colloquial nature.
Focusing on English speech in the Singaporean context, this project
aims to provide a solution for generating formal semantic equivalents of
conversational sentences derived from speech. Thus, acoustic and
language models of existing ASR systems can be trained with these
mappings from conversational to formal text, thus acquiring better
comprehension and performance when receiving informal speech input.
Furthermore, this project aims to analyse the semantic similarity
performance of deep-learning models in terms of semantically similar
formal sentence generation and their use of deep-learning techniques. The
experimentation results show that the PEGASUS model performs better
holistically. This report will present the proposed solution framework and
lay out in detail the components of the project implementation. |
author2 |
Chng Eng Siong |
author_facet |
Chng Eng Siong Suthakar, Shiny Gladdys |
format |
Final Year Project |
author |
Suthakar, Shiny Gladdys |
author_sort |
Suthakar, Shiny Gladdys |
title |
Deep-learning for conversational speech using semantic textual analysis |
title_short |
Deep-learning for conversational speech using semantic textual analysis |
title_full |
Deep-learning for conversational speech using semantic textual analysis |
title_fullStr |
Deep-learning for conversational speech using semantic textual analysis |
title_full_unstemmed |
Deep-learning for conversational speech using semantic textual analysis |
title_sort |
deep-learning for conversational speech using semantic textual analysis |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156632 |
_version_ |
1731235750491455488 |