Deep-learning for conversational speech using semantic textual analysis

Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especia...

Full description

Saved in:
Bibliographic Details
Main Author: Suthakar, Shiny Gladdys
Other Authors: Chng Eng Siong
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156632
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156632
record_format dspace
spelling sg-ntu-dr.10356-1566322022-04-21T06:47:50Z Deep-learning for conversational speech using semantic textual analysis Suthakar, Shiny Gladdys Chng Eng Siong School of Computer Science and Engineering ASESChng@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Document and text processing Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especially potentially life-saving ones. However, most ASR systems today can only work effectively on formal speech input. They have a lot of room to fully understand speech of colloquial nature. Focusing on English speech in the Singaporean context, this project aims to provide a solution for generating formal semantic equivalents of conversational sentences derived from speech. Thus, acoustic and language models of existing ASR systems can be trained with these mappings from conversational to formal text, thus acquiring better comprehension and performance when receiving informal speech input. Furthermore, this project aims to analyse the semantic similarity performance of deep-learning models in terms of semantically similar formal sentence generation and their use of deep-learning techniques. The experimentation results show that the PEGASUS model performs better holistically. This report will present the proposed solution framework and lay out in detail the components of the project implementation. Bachelor of Engineering (Computer Science) 2022-04-21T06:47:50Z 2022-04-21T06:47:50Z 2022 Final Year Project (FYP) Suthakar, S. G. (2022). Deep-learning for conversational speech using semantic textual analysis. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156632 https://hdl.handle.net/10356/156632 en SCSE21-0063 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Document and text processing
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Engineering::Computer science and engineering::Computing methodologies::Document and text processing
Suthakar, Shiny Gladdys
Deep-learning for conversational speech using semantic textual analysis
description Automatic Speech Recognition (ASR) systems today have a prominent and widespread impact among software applications of different domains. They are usually embedded in the applications to provide user input to the main functionality, hence, acting as the cornerstone of these applications, especially potentially life-saving ones. However, most ASR systems today can only work effectively on formal speech input. They have a lot of room to fully understand speech of colloquial nature. Focusing on English speech in the Singaporean context, this project aims to provide a solution for generating formal semantic equivalents of conversational sentences derived from speech. Thus, acoustic and language models of existing ASR systems can be trained with these mappings from conversational to formal text, thus acquiring better comprehension and performance when receiving informal speech input. Furthermore, this project aims to analyse the semantic similarity performance of deep-learning models in terms of semantically similar formal sentence generation and their use of deep-learning techniques. The experimentation results show that the PEGASUS model performs better holistically. This report will present the proposed solution framework and lay out in detail the components of the project implementation.
author2 Chng Eng Siong
author_facet Chng Eng Siong
Suthakar, Shiny Gladdys
format Final Year Project
author Suthakar, Shiny Gladdys
author_sort Suthakar, Shiny Gladdys
title Deep-learning for conversational speech using semantic textual analysis
title_short Deep-learning for conversational speech using semantic textual analysis
title_full Deep-learning for conversational speech using semantic textual analysis
title_fullStr Deep-learning for conversational speech using semantic textual analysis
title_full_unstemmed Deep-learning for conversational speech using semantic textual analysis
title_sort deep-learning for conversational speech using semantic textual analysis
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156632
_version_ 1731235750491455488