AI for human-computer interaction
The use of chatbots to have conversational interactions with users is growing in popularity. They are used for a variety of purposes, such as information retrieval, customer service, and entertainment. In this project, we analyze how to create a chatbot that can converse with users in a natural an...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/166586 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-166586 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1665862023-05-05T15:41:57Z AI for human-computer interaction Kyaw, Htet Aung Erik Cambria School of Computer Science and Engineering Rui Mao cambria@ntu.edu.sg, rui.mao@ntu.edu.sg Engineering::Computer science and engineering The use of chatbots to have conversational interactions with users is growing in popularity. They are used for a variety of purposes, such as information retrieval, customer service, and entertainment. In this project, we analyze how to create a chatbot that can converse with users in a natural and the appropriate manner by utilizing the PyTorch deep learning library. We start by gathering a large amount of conversational data and pre-processing it by cleaning, filtering, and tokenizing it. The PyTorch library is then used to train a neural network model based on a sequence-to-sequence architecture with an attention mechanism. The model takes a sequence of tokens representing a user's message as input and produces a sequence of tokens representing the chatbot's response. We use both automatic metrics and human evaluation through user studies to evaluate the performance of our chatbot. Our results show that our chatbot can respond to user input in a coherent and contextually appropriate manner, outperforming several baseline models. Our model can also handle long and complex conversations, demonstrating resilience in maintaining context across multiple turns. In addition to evaluating our model's performance, we explore the effect of various model configurations, such as the number of layers, hidden size, and attention mechanism. We discovered that increasing the model's complexity can lead to better performance, but it also increases training time and demands more data to avoid overfitting. Overall, our findings show that PyTorch can be used to build conversational agents and highlight the potential for further improvements in chatbot performance using advanced deep learning techniques. Our work also adds to the growing research on building chatbots with deep learning methods, laying the groundwork for future work in this area. Bachelor of Engineering (Computer Science) 2023-05-05T08:20:05Z 2023-05-05T08:20:05Z 2023 Final Year Project (FYP) Kyaw, H. A. (2023). AI for human-computer interaction. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/166586 https://hdl.handle.net/10356/166586 en PSCSE21-0006 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Kyaw, Htet Aung AI for human-computer interaction |
description |
The use of chatbots to have conversational interactions with users is growing in popularity. They are
used for a variety of purposes, such as information retrieval, customer service, and entertainment.
In this project, we analyze how to create a chatbot that can converse with users in a natural and
the appropriate manner by utilizing the PyTorch deep learning library.
We start by gathering a large amount of conversational data and pre-processing it by cleaning,
filtering, and tokenizing it. The PyTorch library is then used to train a neural network model based on
a sequence-to-sequence architecture with an attention mechanism. The model takes a sequence of
tokens representing a user's message as input and produces a sequence of tokens representing the
chatbot's response.
We use both automatic metrics and human evaluation through user studies to evaluate the
performance of our chatbot. Our results show that our chatbot can respond to user input in a
coherent and contextually appropriate manner, outperforming several baseline models. Our model
can also handle long and complex conversations, demonstrating resilience in maintaining context
across multiple turns.
In addition to evaluating our model's performance, we explore the effect of various model
configurations, such as the number of layers, hidden size, and attention mechanism. We discovered
that increasing the model's complexity can lead to better performance, but it also increases training
time and demands more data to avoid overfitting.
Overall, our findings show that PyTorch can be used to build conversational agents and highlight the
potential for further improvements in chatbot performance using advanced deep learning
techniques. Our work also adds to the growing research on building chatbots with deep
learning methods, laying the groundwork for future work in this area. |
author2 |
Erik Cambria |
author_facet |
Erik Cambria Kyaw, Htet Aung |
format |
Final Year Project |
author |
Kyaw, Htet Aung |
author_sort |
Kyaw, Htet Aung |
title |
AI for human-computer interaction |
title_short |
AI for human-computer interaction |
title_full |
AI for human-computer interaction |
title_fullStr |
AI for human-computer interaction |
title_full_unstemmed |
AI for human-computer interaction |
title_sort |
ai for human-computer interaction |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/166586 |
_version_ |
1770564030884216832 |