Federated learning for natural language processing in medical domain

Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep...

Full description

Saved in:

Bibliographic Details
Main Author:	Saraf, Ishita
Other Authors:	Anupam Chattopadhyay
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent
Online Access:	https://hdl.handle.net/10356/175336
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-175336
record_format	dspace
spelling	sg-ntu-dr.10356-1753362024-04-26T15:42:35Z Federated learning for natural language processing in medical domain Saraf, Ishita Anupam Chattopadhyay School of Computer Science and Engineering anupam@ntu.edu.sg Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications. Bachelor's degree 2024-04-23T02:35:49Z 2024-04-23T02:35:49Z 2024 Final Year Project (FYP) Saraf, I. (2024). Federated learning for natural language processing in medical domain. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175336 https://hdl.handle.net/10356/175336 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent
spellingShingle	Computer and Information Science Federated learning Natural language processing Chatbot Medical Conversational agent Saraf, Ishita Federated learning for natural language processing in medical domain
description	Recent years have witnessed an unprecedented surge in interest and innovation in the field of Natural Language Processing (NLP), largely due to ground-breaking developments such as the creation of ChatGPT and other Large Language Model (LLM) applications. In today's landscape, data-driven deep learning algorithms have become the norm in the field of NLP. One of the biggest challenges faced by such data-dependent NLP applications is related to data scarcity and privacy. Federated Learning (FL) is a convincing solution to overcome the issues regarding confidentiality of training data and constraints imposed by inadequate data in specific domains. The main objective of this study is to build a medical assistance chatbot using federated learning to overcome the limitations posed by the private nature data in the medical domain. The study compares the performance of centralized training and federated learning for fine-tuning a BERT-based conversational agent on a medical conversation dataset, serving as a stepping-stone for future research into federated learning for LLM-led NLP applications.
author2	Anupam Chattopadhyay
author_facet	Anupam Chattopadhyay Saraf, Ishita
format	Final Year Project
author	Saraf, Ishita
author_sort	Saraf, Ishita
title	Federated learning for natural language processing in medical domain
title_short	Federated learning for natural language processing in medical domain
title_full	Federated learning for natural language processing in medical domain
title_fullStr	Federated learning for natural language processing in medical domain
title_full_unstemmed	Federated learning for natural language processing in medical domain
title_sort	federated learning for natural language processing in medical domain
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/175336
_version_	1814047216684236800

Federated learning for natural language processing in medical domain

Similar Items