Heuristic development in the use of large language models for materials science
This project seeks to make use of the benefits of Artificial Intelligence, specifically large language models (LLMs), within the field of materials science and engineering. The overarching objective is to enhance the toolkit available to materials researchers, aiming to optimize research efficien...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/176095 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-176095 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1760952024-05-18T16:46:14Z Heuristic development in the use of large language models for materials science Cho, Zen Han Leonard Ng Wei Tat School of Materials Science and Engineering leonard.ngwt@ntu.edu.sg Engineering This project seeks to make use of the benefits of Artificial Intelligence, specifically large language models (LLMs), within the field of materials science and engineering. The overarching objective is to enhance the toolkit available to materials researchers, aiming to optimize research efficiency. The project achieves its objective by developing a specialized chatbot designed to respond to inquiries related to fabrication of graphene. The project leverages on graphene as it is a material of significant interest and importance within the discipline. Grounded in the principles of retrieval augmented generation (RAG) and prompt engineering, this chatbot ensures reliability and accuracy in its responses. Furthermore, the project conducts a comparative analysis of multiple Large Language Models (LLMs) to identify the optimal model to be used. The project crafted 10 questions pertaining to graphene and devised a scoring matrix to assess the performance of the employed LLMs. To minimize external variables, consistent model parameters were maintained across experiments, ensuring that obtained scores were primarily indicative of LLM performance. The software platforms utilized included Amazon Web Services and Google Colab. The project conducted a comparative analysis of model outputs with and without the integration of RAG and prompt engineering techniques. The findings indicated that outputs generated using these techniques demonstrated significantly improved accuracy and informativeness. Furthermore, an analytical evaluation was undertaken to assess the performance of four Large Language Models (LLMs) in addressing graphene-related queries, revealing that Google's Gemini-pro model outperformed the others. Bachelor's degree 2024-05-13T12:23:40Z 2024-05-13T12:23:40Z 2024 Final Year Project (FYP) Cho, Z. H. (2024). Heuristic development in the use of large language models for materials science. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176095 https://hdl.handle.net/10356/176095 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering |
spellingShingle |
Engineering Cho, Zen Han Heuristic development in the use of large language models for materials science |
description |
This project seeks to make use of the benefits of Artificial Intelligence, specifically
large language models (LLMs), within the field of materials science and engineering.
The overarching objective is to enhance the toolkit available to materials researchers,
aiming to optimize research efficiency.
The project achieves its objective by developing a specialized chatbot designed to
respond to inquiries related to fabrication of graphene. The project leverages on
graphene as it is a material of significant interest and importance within the discipline.
Grounded in the principles of retrieval augmented generation (RAG) and prompt
engineering, this chatbot ensures reliability and accuracy in its responses. Furthermore,
the project conducts a comparative analysis of multiple Large Language Models
(LLMs) to identify the optimal model to be used.
The project crafted 10 questions pertaining to graphene and devised a scoring matrix
to assess the performance of the employed LLMs. To minimize external variables,
consistent model parameters were maintained across experiments, ensuring that
obtained scores were primarily indicative of LLM performance. The software
platforms utilized included Amazon Web Services and Google Colab.
The project conducted a comparative analysis of model outputs with and without the
integration of RAG and prompt engineering techniques. The findings indicated that
outputs generated using these techniques demonstrated significantly improved
accuracy and informativeness. Furthermore, an analytical evaluation was undertaken
to assess the performance of four Large Language Models (LLMs) in addressing
graphene-related queries, revealing that Google's Gemini-pro model outperformed the
others. |
author2 |
Leonard Ng Wei Tat |
author_facet |
Leonard Ng Wei Tat Cho, Zen Han |
format |
Final Year Project |
author |
Cho, Zen Han |
author_sort |
Cho, Zen Han |
title |
Heuristic development in the use of large language models for materials science |
title_short |
Heuristic development in the use of large language models for materials science |
title_full |
Heuristic development in the use of large language models for materials science |
title_fullStr |
Heuristic development in the use of large language models for materials science |
title_full_unstemmed |
Heuristic development in the use of large language models for materials science |
title_sort |
heuristic development in the use of large language models for materials science |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/176095 |
_version_ |
1800916207839215616 |