Exploring machine learning methods on molecular data

Understanding molecular data can be useful for various fields of science, including biology, chemistry, and material science to name a few. In this report, XRD (X-Ray Diffraction) data and protein ligand binding affinity data is looked at and machine learning techniques a...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Chen Hui
Other Authors: Xia Kelin
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148528
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148528
record_format dspace
spelling sg-ntu-dr.10356-1485282023-02-28T23:13:40Z Exploring machine learning methods on molecular data Tan, Chen Hui Xia Kelin School of Physical and Mathematical Sciences xiakelin@ntu.edu.sg Science::Mathematics::Applied mathematics Understanding molecular data can be useful for various fields of science, including biology, chemistry, and material science to name a few. In this report, XRD (X-Ray Diffraction) data and protein ligand binding affinity data is looked at and machine learning techniques are used to tackle classification and regression problems. Working with molecular data can be tricky. For machine learning models to work, all input data must be of the same shape. However, each protein-ligand complex consists of varying number and types of elements, which makes it challenging to come up with models that can capture the signals and information encoded in these different atoms, with their coordinates and physical properties.Similarly, it is difficult to fit XRD data into a machine learning model since different datasets can have different array sizes which makes it a challenge to come up with a method to consistently classify these inconsistent data. Bachelor of Science in Mathematical Sciences 2021-05-04T05:41:51Z 2021-05-04T05:41:51Z 2021 Final Year Project (FYP) Tan, C. H. (2021). Exploring machine learning methods on molecular data. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148528 https://hdl.handle.net/10356/148528 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics::Applied mathematics
spellingShingle Science::Mathematics::Applied mathematics
Tan, Chen Hui
Exploring machine learning methods on molecular data
description Understanding molecular data can be useful for various fields of science, including biology, chemistry, and material science to name a few. In this report, XRD (X-Ray Diffraction) data and protein ligand binding affinity data is looked at and machine learning techniques are used to tackle classification and regression problems. Working with molecular data can be tricky. For machine learning models to work, all input data must be of the same shape. However, each protein-ligand complex consists of varying number and types of elements, which makes it challenging to come up with models that can capture the signals and information encoded in these different atoms, with their coordinates and physical properties.Similarly, it is difficult to fit XRD data into a machine learning model since different datasets can have different array sizes which makes it a challenge to come up with a method to consistently classify these inconsistent data.
author2 Xia Kelin
author_facet Xia Kelin
Tan, Chen Hui
format Final Year Project
author Tan, Chen Hui
author_sort Tan, Chen Hui
title Exploring machine learning methods on molecular data
title_short Exploring machine learning methods on molecular data
title_full Exploring machine learning methods on molecular data
title_fullStr Exploring machine learning methods on molecular data
title_full_unstemmed Exploring machine learning methods on molecular data
title_sort exploring machine learning methods on molecular data
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148528
_version_ 1759854686765056000