Federated learning for natural language processing in medical domain
Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/175336 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-175336 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1753362024-04-26T15:42:35Z Federated learning for natural language processing in medical domain Saraf, Ishita Anupam Chattopadhyay School of Computer Science and Engineering anupam@ntu.edu.sg Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications. Bachelor's degree 2024-04-23T02:35:49Z 2024-04-23T02:35:49Z 2024 Final Year Project (FYP) Saraf, I. (2024). Federated learning for natural language processing in medical domain. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175336 https://hdl.handle.net/10356/175336 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent |
spellingShingle |
Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent Saraf, Ishita Federated learning for natural language processing in medical domain |
description |
Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications. |
author2 |
Anupam Chattopadhyay |
author_facet |
Anupam Chattopadhyay Saraf, Ishita |
format |
Final Year Project |
author |
Saraf, Ishita |
author_sort |
Saraf, Ishita |
title |
Federated learning for natural language processing in medical domain |
title_short |
Federated learning for natural language processing in medical domain |
title_full |
Federated learning for natural language processing in medical domain |
title_fullStr |
Federated learning for natural language processing in medical domain |
title_full_unstemmed |
Federated learning for natural language processing in medical domain |
title_sort |
federated learning for natural language processing in medical domain |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/175336 |
_version_ |
1814047216684236800 |