Molecular generation using gated graph convolutional neural networks and reinforcement learning

The design of molecules with bespoke chemical properties has wide-ranging applications in materials science, chemistry and drug-discovery. This can be formulated as a supervised learning problem, where we first seek to encode discrete molecular graphs to continuous latent representations, and then u...

Full description

Saved in:
Bibliographic Details
Main Author: Divyansh, Gupta
Other Authors: Xavier Bresson
Format: Final Year Project
Language:English
Published: 2019
Subjects:
Online Access:http://hdl.handle.net/10356/76936
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-76936
record_format dspace
spelling sg-ntu-dr.10356-769362023-03-03T20:55:41Z Molecular generation using gated graph convolutional neural networks and reinforcement learning Divyansh, Gupta Xavier Bresson School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering The design of molecules with bespoke chemical properties has wide-ranging applications in materials science, chemistry and drug-discovery. This can be formulated as a supervised learning problem, where we first seek to encode discrete molecular graphs to continuous latent representations, and then use gradient-based optimization methods on these representations to optimize for the desired chemical properties. Recently, techniques such as Graph Convolutional Neural Networks (G-CNNs) and Message Passing Neural Networks (MPNNs) have been developed, which use deep learning methods to encode discrete graphs to continuous latent representations. In this project, we seek to attack the problem of molecular optimization with a twin- pronged approach of improving both the encoding technique and the optimization method. For this purpose, we build upon an existing state-of-the-art architecture called Junction Tree Variational Autoencoder (JT-VAE), which learns continuous latent vector representations for molecular graphs. These latent vector representations are then indirectly used for gradient-based optimization methods to improve chemical properties. JT-VAE makes use of MPNNs in its architecture. For the first part of our approach, we make use of a powerful variant of G-CNNs called, Gated-Graph Convolutional Neural Networks (GG-CNNs). We objectively demonstrate the efficacy of GG-CNNs over existing MPNN architectures in producing smooth and meaningful representations for molecular graphs. This is accomplished by replacing MPNN with GG-CNN in the JT-VAE architecture and then benchmarking their performance on various tasks. For the second part of our approach, we incorporate the reinforcement learning approach of Deep Deterministic Policy Gradient (DDPG) in the combined JT-VAE and GG-CNN architecture. Thus, we present a novel architecture incorporating GG-CNNs and DDPG on top of the JT-VAE architecture for purposes of molecular optimization. We perform all our experiments on the benchmark QM9 dataset, which contains 133,885 organic compounds having up to 9 heavy atoms. Bachelor of Engineering (Computer Science) 2019-04-24T14:15:09Z 2019-04-24T14:15:09Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/76936 en Nanyang Technological University 72 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Divyansh, Gupta
Molecular generation using gated graph convolutional neural networks and reinforcement learning
description The design of molecules with bespoke chemical properties has wide-ranging applications in materials science, chemistry and drug-discovery. This can be formulated as a supervised learning problem, where we first seek to encode discrete molecular graphs to continuous latent representations, and then use gradient-based optimization methods on these representations to optimize for the desired chemical properties. Recently, techniques such as Graph Convolutional Neural Networks (G-CNNs) and Message Passing Neural Networks (MPNNs) have been developed, which use deep learning methods to encode discrete graphs to continuous latent representations. In this project, we seek to attack the problem of molecular optimization with a twin- pronged approach of improving both the encoding technique and the optimization method. For this purpose, we build upon an existing state-of-the-art architecture called Junction Tree Variational Autoencoder (JT-VAE), which learns continuous latent vector representations for molecular graphs. These latent vector representations are then indirectly used for gradient-based optimization methods to improve chemical properties. JT-VAE makes use of MPNNs in its architecture. For the first part of our approach, we make use of a powerful variant of G-CNNs called, Gated-Graph Convolutional Neural Networks (GG-CNNs). We objectively demonstrate the efficacy of GG-CNNs over existing MPNN architectures in producing smooth and meaningful representations for molecular graphs. This is accomplished by replacing MPNN with GG-CNN in the JT-VAE architecture and then benchmarking their performance on various tasks. For the second part of our approach, we incorporate the reinforcement learning approach of Deep Deterministic Policy Gradient (DDPG) in the combined JT-VAE and GG-CNN architecture. Thus, we present a novel architecture incorporating GG-CNNs and DDPG on top of the JT-VAE architecture for purposes of molecular optimization. We perform all our experiments on the benchmark QM9 dataset, which contains 133,885 organic compounds having up to 9 heavy atoms.
author2 Xavier Bresson
author_facet Xavier Bresson
Divyansh, Gupta
format Final Year Project
author Divyansh, Gupta
author_sort Divyansh, Gupta
title Molecular generation using gated graph convolutional neural networks and reinforcement learning
title_short Molecular generation using gated graph convolutional neural networks and reinforcement learning
title_full Molecular generation using gated graph convolutional neural networks and reinforcement learning
title_fullStr Molecular generation using gated graph convolutional neural networks and reinforcement learning
title_full_unstemmed Molecular generation using gated graph convolutional neural networks and reinforcement learning
title_sort molecular generation using gated graph convolutional neural networks and reinforcement learning
publishDate 2019
url http://hdl.handle.net/10356/76936
_version_ 1759854841450987520