INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH

A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate with users in natural language, assisting them in completing various user tasks i.e. making restaurant reservations or purchasing tickets. Recently, many researchers are interested in using the end-to-end app...

Full description

Saved in:
Bibliographic Details
Main Author: Dehan Al Kautsar, Muhammad
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/73923
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:73923
spelling id-itb.:739232023-06-25T09:29:16ZINDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH Dehan Al Kautsar, Muhammad Indonesia Final Project task-oriented dialogue system, end-to-end, low-resource language INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/73923 A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate with users in natural language, assisting them in completing various user tasks i.e. making restaurant reservations or purchasing tickets. Recently, many researchers are interested in using the end-to-end approach to build the ToDS framework, as it offers simplicity compared to the other approach. However, since there is no ToDS corpus available in Indonesia, which is an under-represented language, an Indonesian ToDS benchmark has never been conducted. The main focus of this thesis involves the construction of two ToDS training corpora specifically designed for the Indonesian language. Additionally, the thesis includes conducting experiments using two distinct ToDS frameworks, while utilizing the manually created corpora. The total number of constructed dialogues for the Indonesian ToDS corpus in this thesis amounts to 999 dialogues, originating from two English ToDS corpora named CamRest and SMD. The experiments conducted in this study evaluate the frameworks in three types of experiments: monolingual, bilingual, and cross-lingual transfer learning. The end-to-end frameworks tested in this thesis are Sequicity and MinTL, which differ in the type of word embedding employed by each framework. The experiment results reveal that the language adaptation process incorporating two sets of Indonesian and English corpora during the training phase of the ToDS framework has a positive impact on the model's capability to successfully complete user-given tasks. Furthermore, bilingual experiments consistently yield the best metric values across all experiments. However, both frameworks struggle to achieve satisfactory results in cross-lingual experiments due to the language disparity between the training and testing corpora. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate with users in natural language, assisting them in completing various user tasks i.e. making restaurant reservations or purchasing tickets. Recently, many researchers are interested in using the end-to-end approach to build the ToDS framework, as it offers simplicity compared to the other approach. However, since there is no ToDS corpus available in Indonesia, which is an under-represented language, an Indonesian ToDS benchmark has never been conducted. The main focus of this thesis involves the construction of two ToDS training corpora specifically designed for the Indonesian language. Additionally, the thesis includes conducting experiments using two distinct ToDS frameworks, while utilizing the manually created corpora. The total number of constructed dialogues for the Indonesian ToDS corpus in this thesis amounts to 999 dialogues, originating from two English ToDS corpora named CamRest and SMD. The experiments conducted in this study evaluate the frameworks in three types of experiments: monolingual, bilingual, and cross-lingual transfer learning. The end-to-end frameworks tested in this thesis are Sequicity and MinTL, which differ in the type of word embedding employed by each framework. The experiment results reveal that the language adaptation process incorporating two sets of Indonesian and English corpora during the training phase of the ToDS framework has a positive impact on the model's capability to successfully complete user-given tasks. Furthermore, bilingual experiments consistently yield the best metric values across all experiments. However, both frameworks struggle to achieve satisfactory results in cross-lingual experiments due to the language disparity between the training and testing corpora.
format Final Project
author Dehan Al Kautsar, Muhammad
spellingShingle Dehan Al Kautsar, Muhammad
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
author_facet Dehan Al Kautsar, Muhammad
author_sort Dehan Al Kautsar, Muhammad
title INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
title_short INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
title_full INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
title_fullStr INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
title_full_unstemmed INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
title_sort indonesian task-oriented dialogue system using end-to-end approach
url https://digilib.itb.ac.id/gdl/view/73923
_version_ 1822007249480974336