Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre

To improve customer experience, businesses need a deeper understanding of their customers. Customer service call centres gather personalised information about customers, and hence help us to gain this understanding. However, manual data mining of this information introduces redundancy. This project...

Full description

Saved in:
Bibliographic Details
Main Author: Muhammad Rais Fawwazi
Other Authors: Chen Lihui
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/76363
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-76363
record_format dspace
spelling sg-ntu-dr.10356-763632023-07-07T16:57:28Z Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre Muhammad Rais Fawwazi Chen Lihui School of Electrical and Electronic Engineering Traveloka Services Pte. Ltd. Tan Freddy Kusnadi DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems To improve customer experience, businesses need a deeper understanding of their customers. Customer service call centres gather personalised information about customers, and hence help us to gain this understanding. However, manual data mining of this information introduces redundancy. This project aims to minimise this redundancy through speech recognition system. This project explores deep learning as a solution to speech recognition. Using Indonesian, we focus primarily on data-scarce language. However, we need to compensate the data scarcity to satisfy the need of abundant data required by deep learning-based models. We focus on transfer learning to help the model learn Indonesian by first learning other languages that have adequate data. Transfer learning improves the model performance significantly, while data collection effort is still required. We also discuss the feasibility of building a speech recognition system within a company for business use. As there are commercial solutions such as Google Cloud Speech API, we might consider them as appealing alternatives. However, by increasing efficiency in the pipeline and create higher-level products based on speech recognition, it might be feasible to build and maintain the system. Bachelor of Engineering (Electrical and Electronic Engineering) 2018-12-20T06:21:35Z 2018-12-20T06:21:35Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/76363 en Nanyang Technological University 52 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems
Muhammad Rais Fawwazi
Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
description To improve customer experience, businesses need a deeper understanding of their customers. Customer service call centres gather personalised information about customers, and hence help us to gain this understanding. However, manual data mining of this information introduces redundancy. This project aims to minimise this redundancy through speech recognition system. This project explores deep learning as a solution to speech recognition. Using Indonesian, we focus primarily on data-scarce language. However, we need to compensate the data scarcity to satisfy the need of abundant data required by deep learning-based models. We focus on transfer learning to help the model learn Indonesian by first learning other languages that have adequate data. Transfer learning improves the model performance significantly, while data collection effort is still required. We also discuss the feasibility of building a speech recognition system within a company for business use. As there are commercial solutions such as Google Cloud Speech API, we might consider them as appealing alternatives. However, by increasing efficiency in the pipeline and create higher-level products based on speech recognition, it might be feasible to build and maintain the system.
author2 Chen Lihui
author_facet Chen Lihui
Muhammad Rais Fawwazi
format Final Year Project
author Muhammad Rais Fawwazi
author_sort Muhammad Rais Fawwazi
title Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
title_short Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
title_full Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
title_fullStr Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
title_full_unstemmed Deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
title_sort deep learning-based speech recognition system for data-scarce language: a feasibility study for a call centre
publishDate 2018
url http://hdl.handle.net/10356/76363
_version_ 1772827897058295808