INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH
A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate with users in natural language, assisting them in completing various user tasks i.e. making restaurant reservations or purchasing tickets. Recently, many researchers are interested in using the end-to-end app...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/73923 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:73923 |
---|---|
spelling |
id-itb.:739232023-06-25T09:29:16ZINDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH Dehan Al Kautsar, Muhammad Indonesia Final Project task-oriented dialogue system, end-to-end, low-resource language INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/73923 A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate with users in natural language, assisting them in completing various user tasks i.e. making restaurant reservations or purchasing tickets. Recently, many researchers are interested in using the end-to-end approach to build the ToDS framework, as it offers simplicity compared to the other approach. However, since there is no ToDS corpus available in Indonesia, which is an under-represented language, an Indonesian ToDS benchmark has never been conducted. The main focus of this thesis involves the construction of two ToDS training corpora specifically designed for the Indonesian language. Additionally, the thesis includes conducting experiments using two distinct ToDS frameworks, while utilizing the manually created corpora. The total number of constructed dialogues for the Indonesian ToDS corpus in this thesis amounts to 999 dialogues, originating from two English ToDS corpora named CamRest and SMD. The experiments conducted in this study evaluate the frameworks in three types of experiments: monolingual, bilingual, and cross-lingual transfer learning. The end-to-end frameworks tested in this thesis are Sequicity and MinTL, which differ in the type of word embedding employed by each framework. The experiment results reveal that the language adaptation process incorporating two sets of Indonesian and English corpora during the training phase of the ToDS framework has a positive impact on the model's capability to successfully complete user-given tasks. Furthermore, bilingual experiments consistently yield the best metric values across all experiments. However, both frameworks struggle to achieve satisfactory results in cross-lingual experiments due to the language disparity between the training and testing corpora. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
A Task-oriented dialogue system (ToDS) is a conversational agent designed to communicate
with users in natural language, assisting them in completing various user tasks i.e. making
restaurant reservations or purchasing tickets. Recently, many researchers are interested in
using the end-to-end approach to build the ToDS framework, as it offers simplicity compared
to the other approach. However, since there is no ToDS corpus available in Indonesia, which
is an under-represented language, an Indonesian ToDS benchmark has never been conducted.
The main focus of this thesis involves the construction of two ToDS training corpora
specifically designed for the Indonesian language. Additionally, the thesis includes
conducting experiments using two distinct ToDS frameworks, while utilizing the manually
created corpora. The total number of constructed dialogues for the Indonesian ToDS corpus
in this thesis amounts to 999 dialogues, originating from two English ToDS corpora named
CamRest and SMD. The experiments conducted in this study evaluate the frameworks in
three types of experiments: monolingual, bilingual, and cross-lingual transfer learning. The
end-to-end frameworks tested in this thesis are Sequicity and MinTL, which differ in the type
of word embedding employed by each framework.
The experiment results reveal that the language adaptation process incorporating two sets of
Indonesian and English corpora during the training phase of the ToDS framework has a
positive impact on the model's capability to successfully complete user-given tasks.
Furthermore, bilingual experiments consistently yield the best metric values across all
experiments. However, both frameworks struggle to achieve satisfactory results in
cross-lingual experiments due to the language disparity between the training and testing
corpora. |
format |
Final Project |
author |
Dehan Al Kautsar, Muhammad |
spellingShingle |
Dehan Al Kautsar, Muhammad INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
author_facet |
Dehan Al Kautsar, Muhammad |
author_sort |
Dehan Al Kautsar, Muhammad |
title |
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
title_short |
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
title_full |
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
title_fullStr |
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
title_full_unstemmed |
INDONESIAN TASK-ORIENTED DIALOGUE SYSTEM USING END-TO-END APPROACH |
title_sort |
indonesian task-oriented dialogue system using end-to-end approach |
url |
https://digilib.itb.ac.id/gdl/view/73923 |
_version_ |
1822007249480974336 |