Improving the performance of models for one-step retrosynthesis through re-ranking

Retrosynthesis is at the core of organic chemistry. Recently, the rapid growth of artificial intelligence (AI) has spurred a variety of novel machine learning approaches for data-driven synthesis planning. These methods learn complex patterns from reaction databases in order to predict, for a given...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lin, Min Htoo, Tu, Zhengkai, Coley, Connor W.
Other Authors:	School of Physical and Mathematical Sciences
Format:	Article
Language:	English
Published:	2022
Subjects:	Science::Chemistry Computer-Aided Synthesis Planning Energy-Based Model
Online Access:	https://hdl.handle.net/10356/163080
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Description
Summary:	Retrosynthesis is at the core of organic chemistry. Recently, the rapid growth of artificial intelligence (AI) has spurred a variety of novel machine learning approaches for data-driven synthesis planning. These methods learn complex patterns from reaction databases in order to predict, for a given product, sets of reactants that can be used to synthesise that product. However, their performance as measured by the top-N accuracy in matching published reaction precedents still leaves room for improvement. This work aims to enhance these models by learning to re-rank their reactant predictions. Specifically, we design and train an energy-based model to re-rank, for each product, the published reaction as the top suggestion and the remaining reactant predictions as lower-ranked. We show that re-ranking can improve one-step models significantly using the standard USPTO-50k benchmark dataset, such as RetroSim, a similarity-based method, from 35.7 to 51.8% top-1 accuracy and NeuralSym, a deep learning method, from 45.7 to 51.3%, and also that re-ranking the union of two models' suggestions can lead to better performance than either alone. However, the state-of-the-art top-1 accuracy is not improved by this method.

Improving the performance of models for one-step retrosynthesis through re-ranking

Similar Items