A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS

The recent development of language models has shown promising results by achieving state-of-the-art performance on various natural language tasks by finetuning pre-trained models. In task-oriented dialogue (ToD) systems, language models can be used for end-to-end training without relying on dialog...

Full description

Saved in:
Bibliographic Details
Main Author: Marselino Andreas, Vinsen
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/56938
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:56938
spelling id-itb.:569382021-07-22T19:24:51ZA COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS Marselino Andreas, Vinsen Indonesia Final Project language model, end-to-end, task-oriented dialogue system INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/56938 The recent development of language models has shown promising results by achieving state-of-the-art performance on various natural language tasks by finetuning pre-trained models. In task-oriented dialogue (ToD) systems, language models can be used for end-to-end training without relying on dialogue state tracking to track the dialogue history but allowing the language models to generate responses according to the context given as input. This paper conducts a comparative study to show the effectiveness and strength of using recent pre-trained models for finetuning, such as BART and T5, on end-to-end ToD systems. The experimental results show substantial performance improvements after language model fine-tuning. The models produce more fluent responses after adding knowledge to the context that guides the model to avoid hallucination and generate accurate entities in the generated responses. Furthermore, we found that BART and T5 outperform GPT-based models in BLEU and F1 scores and achieve state-of-the-art performance in a ToD system. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description The recent development of language models has shown promising results by achieving state-of-the-art performance on various natural language tasks by finetuning pre-trained models. In task-oriented dialogue (ToD) systems, language models can be used for end-to-end training without relying on dialogue state tracking to track the dialogue history but allowing the language models to generate responses according to the context given as input. This paper conducts a comparative study to show the effectiveness and strength of using recent pre-trained models for finetuning, such as BART and T5, on end-to-end ToD systems. The experimental results show substantial performance improvements after language model fine-tuning. The models produce more fluent responses after adding knowledge to the context that guides the model to avoid hallucination and generate accurate entities in the generated responses. Furthermore, we found that BART and T5 outperform GPT-based models in BLEU and F1 scores and achieve state-of-the-art performance in a ToD system.
format Final Project
author Marselino Andreas, Vinsen
spellingShingle Marselino Andreas, Vinsen
A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
author_facet Marselino Andreas, Vinsen
author_sort Marselino Andreas, Vinsen
title A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
title_short A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
title_full A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
title_fullStr A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
title_full_unstemmed A COMPARATIVE STUDY ON LANGUAGE MODELS FOR TASK-ORIENTED DIALOGUE SYSTEMS
title_sort comparative study on language models for task-oriented dialogue systems
url https://digilib.itb.ac.id/gdl/view/56938
_version_ 1822274742793535488