Federated learning for natural language processing in medical domain

Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep...

Full description

Saved in:
Bibliographic Details
Main Author: Saraf, Ishita
Other Authors: Anupam Chattopadhyay
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175336
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175336
record_format dspace
spelling sg-ntu-dr.10356-1753362024-04-26T15:42:35Z Federated learning for natural language processing in medical domain Saraf, Ishita Anupam Chattopadhyay School of Computer Science and Engineering anupam@ntu.edu.sg Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications. Bachelor's degree 2024-04-23T02:35:49Z 2024-04-23T02:35:49Z 2024 Final Year Project (FYP) Saraf, I. (2024). Federated learning for natural language processing in medical domain. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175336 https://hdl.handle.net/10356/175336 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Federated learning
Natural language processing
Chatbot
Medical
Conversational agent
spellingShingle Computer and Information Science
Federated learning
Natural language processing
Chatbot
Medical
Conversational agent
Saraf, Ishita
Federated learning for natural language processing in medical domain
description Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications.
author2 Anupam Chattopadhyay
author_facet Anupam Chattopadhyay
Saraf, Ishita
format Final Year Project
author Saraf, Ishita
author_sort Saraf, Ishita
title Federated learning for natural language processing in medical domain
title_short Federated learning for natural language processing in medical domain
title_full Federated learning for natural language processing in medical domain
title_fullStr Federated learning for natural language processing in medical domain
title_full_unstemmed Federated learning for natural language processing in medical domain
title_sort federated learning for natural language processing in medical domain
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175336
_version_ 1814047216684236800