Deep-learning for conversational speech using semantic textual analysis

Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especia...

Full description

Saved in:

Bibliographic Details
Main Author:	Suthakar, Shiny Gladdys
Other Authors:	Chng Eng Siong
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2022
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Online Access:	https://hdl.handle.net/10356/156632
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-156632
record_format	dspace
spelling	sg-ntu-dr.10356-1566322022-04-21T06:47:50Z Deep-learning for conversational speech using semantic textual analysis Suthakar, Shiny Gladdys Chng Eng Siong School of Computer Science and Engineering ASESChng@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especially potentially life-saving ones. However, most ASR systems today can only work effectively on formal speech input. They have a lot of room to fully understand speech of colloquial nature. Focusing on English speech in the Singaporean context, this project aims to provide a solution for generating formal semantic equivalents of conversational sentences derived from speech. Thus, acoustic and language models of existing ASR systems can be trained with these mappings from conversational to formal text, thus acquiring better comprehension and performance when receiving informal speech input. Furthermore, this project aims to analyse the semantic similarity performance of deep-learning models in terms of semantically similar formal sentence generation and their use of deep-learning techniques. The experimentation results show that the PEGASUS model performs better holistically. This report will present the proposed solution framework and lay out in detail the components of the project implementation. Bachelor of Engineering (Computer Science) 2022-04-21T06:47:50Z 2022-04-21T06:47:50Z 2022 Final Year Project (FYP) Suthakar, S. G. (2022). Deep-learning for conversational speech using semantic textual analysis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156632 https://hdl.handle.net/10356/156632 en SCSE21-0063 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Suthakar, Shiny Gladdys Deep-learning for conversational speech using semantic textual analysis
description	Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especially potentially life-saving ones. However, most ASR systems today can only work effectively on formal speech input. They have a lot of room to fully understand speech of colloquial nature. Focusing on English speech in the Singaporean context, this project aims to provide a solution for generating formal semantic equivalents of conversational sentences derived from speech. Thus, acoustic and language models of existing ASR systems can be trained with these mappings from conversational to formal text, thus acquiring better comprehension and performance when receiving informal speech input. Furthermore, this project aims to analyse the semantic similarity performance of deep-learning models in terms of semantically similar formal sentence generation and their use of deep-learning techniques. The experimentation results show that the PEGASUS model performs better holistically. This report will present the proposed solution framework and lay out in detail the components of the project implementation.
author2	Chng Eng Siong
author_facet	Chng Eng Siong Suthakar, Shiny Gladdys
format	Final Year Project
author	Suthakar, Shiny Gladdys
author_sort	Suthakar, Shiny Gladdys
title	Deep-learning for conversational speech using semantic textual analysis
title_short	Deep-learning for conversational speech using semantic textual analysis
title_full	Deep-learning for conversational speech using semantic textual analysis
title_fullStr	Deep-learning for conversational speech using semantic textual analysis
title_full_unstemmed	Deep-learning for conversational speech using semantic textual analysis
title_sort	deep-learning for conversational speech using semantic textual analysis
publisher	Nanyang Technological University
publishDate	2022
url	https://hdl.handle.net/10356/156632
_version_	1731235750491455488

Deep-learning for conversational speech using semantic textual analysis

Similar Items