Heuristic development in the use of large language models for materials science

In the ever-expanding field of Artificial Intelligence (AI) tools, Generative Pre-trained Transformer (GPT) is one of many Large Language Models (LLMs) that has revolutionized how academia can interact with data. Simple sentences can be fed into GPTs to return long paragraphs of explanation, informa...

Full description

Saved in:
Bibliographic Details
Main Author: Chye, Vincent Zhen Guang
Other Authors: Leonard Ng Wei Tat
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/176081
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-176081
record_format dspace
spelling sg-ntu-dr.10356-1760812024-05-18T16:46:25Z Heuristic development in the use of large language models for materials science Chye, Vincent Zhen Guang Leonard Ng Wei Tat School of Materials Science and Engineering Maung Thway leonard.ngwt@ntu.edu.sg, maung.thway@ntu.edu.sg Computer and Information Science Engineering Large language models Domain specific transformer Chatbot In the ever-expanding field of Artificial Intelligence (AI) tools, Generative Pre-trained Transformer (GPT) is one of many Large Language Models (LLMs) that has revolutionized how academia can interact with data. Simple sentences can be fed into GPTs to return long paragraphs of explanation, informative search results and even form discussions. This brought a spike in how information is utilised and shifted the entire tech sector in ways comparable to the birth of the internet. This feasibility study, we will harness GPT’s learning capabilities in the form of a chatbot that can bridge knowledge gaps within the field of Material Science. The method proposed is to enrich GPTs with context, through papers selected by a set of specialised search algorithm. By distilling essential research papers and textbook resources, the chatbot is tailored to generate detailed conversations within the user’s specified domain. The experimentations have used AI tools from the following: open-source models from Hugging Face, free literature database tool from Litmaps and paid model of GPT-4 by OpenAI. Bachelor's degree 2024-05-13T11:38:01Z 2024-05-13T11:38:01Z 2024 Final Year Project (FYP) Chye, V. Z. G. (2024). Heuristic development in the use of large language models for materials science. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176081 https://hdl.handle.net/10356/176081 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Engineering
Large language models
Domain specific transformer
Chatbot
spellingShingle Computer and Information Science
Engineering
Large language models
Domain specific transformer
Chatbot
Chye, Vincent Zhen Guang
Heuristic development in the use of large language models for materials science
description In the ever-expanding field of Artificial Intelligence (AI) tools, Generative Pre-trained Transformer (GPT) is one of many Large Language Models (LLMs) that has revolutionized how academia can interact with data. Simple sentences can be fed into GPTs to return long paragraphs of explanation, informative search results and even form discussions. This brought a spike in how information is utilised and shifted the entire tech sector in ways comparable to the birth of the internet. This feasibility study, we will harness GPT’s learning capabilities in the form of a chatbot that can bridge knowledge gaps within the field of Material Science. The method proposed is to enrich GPTs with context, through papers selected by a set of specialised search algorithm. By distilling essential research papers and textbook resources, the chatbot is tailored to generate detailed conversations within the user’s specified domain. The experimentations have used AI tools from the following: open-source models from Hugging Face, free literature database tool from Litmaps and paid model of GPT-4 by OpenAI.
author2 Leonard Ng Wei Tat
author_facet Leonard Ng Wei Tat
Chye, Vincent Zhen Guang
format Final Year Project
author Chye, Vincent Zhen Guang
author_sort Chye, Vincent Zhen Guang
title Heuristic development in the use of large language models for materials science
title_short Heuristic development in the use of large language models for materials science
title_full Heuristic development in the use of large language models for materials science
title_fullStr Heuristic development in the use of large language models for materials science
title_full_unstemmed Heuristic development in the use of large language models for materials science
title_sort heuristic development in the use of large language models for materials science
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/176081
_version_ 1800916248262868992