OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs

Motivation: Deep graph learning (DGL) has been widely employed in the realm of ligand-based virtual screening. Within this field, a key hurdle is the existence of activity cliffs (ACs), where minor chemical alterations can lead to significant changes in bioactivity. In response, several DGL models h...

Full description

Saved in:
Bibliographic Details
Main Authors: Yin, Yueming, Hu, Haifeng, Yang, Jitao, Ye, Chun, Goh, Wilson Wen Bin, Kong, Adams Wai Kin, Wu, Jiansheng
Other Authors: College of Computing and Data Science
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181726
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181726
record_format dspace
spelling sg-ntu-dr.10356-1817262024-12-16T04:30:54Z OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs Yin, Yueming Hu, Haifeng Yang, Jitao Ye, Chun Goh, Wilson Wen Bin Kong, Adams Wai Kin Wu, Jiansheng College of Computing and Data Science Lee Kong Chian School of Medicine (LKCMedicine) School of Biological Sciences Center for Biomedical Informatics Center for AI in Medicine Computer and Information Science Artificial neural network Deep learning Motivation: Deep graph learning (DGL) has been widely employed in the realm of ligand-based virtual screening. Within this field, a key hurdle is the existence of activity cliffs (ACs), where minor chemical alterations can lead to significant changes in bioactivity. In response, several DGL models have been developed to enhance ligand bioactivity prediction in the presence of ACs. Yet, there remains a largely unexplored opportunity within ACs for optimizing ligand bioactivity, making it an area ripe for further investigation. Results: We present a novel approach to simultaneously predict and optimize ligand bioactivities through DGL and ACs (OLB-AC). OLB-AC possesses the capability to optimize ligand molecules located near ACs, providing a direct reference for optimizing ligand bioactivities with the matching of original ligands. To accomplish this, a novel attentive graph reconstruction neural network and ligand optimization scheme are proposed. Attentive graph reconstruction neural network reconstructs original ligands and optimizes them through adversarial representations derived from their bioactivity prediction process. Experimental results on nine drug targets reveal that out of the 667 molecules generated through OLB-AC optimization on datasets comprising 974 low-activity, noninhibitor, or highly toxic ligands, 49 are recognized as known highly active, inhibitor, or nontoxic ligands beyond the datasets’ scope. The 27 out of 49 matched molecular pairs generated by OLB-AC reveal novel transformations not present in their training sets. The adversarial representations employed for ligand optimization originate from the gradients of bioactivity predictions. Therefore, we also assess OLB-AC’s prediction accuracy across 33 different bioactivity datasets. Results show that OLB-AC achieves the best Pearson correlation coefficient (r2) on 27/33 datasets, with an average improvement of 7.2%–22.9% against the state-of-the-art bioactivity prediction methods. Availability and implementation: The code and dataset developed in this work are available at github.com/Yueming-Yin/OLB-AC. Published version This work was supported in part by the National Natural Science Foundation of China under grant 62371245. 2024-12-16T04:30:54Z 2024-12-16T04:30:54Z 2024 Journal Article Yin, Y., Hu, H., Yang, J., Ye, C., Goh, W. W. B., Kong, A. W. K. & Wu, J. (2024). OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs. Bioinformatics, 40(6), e365-. https://dx.doi.org/10.1093/bioinformatics/btae365 1367-4803 https://hdl.handle.net/10356/181726 10.1093/bioinformatics/btae365 38889277 2-s2.0-85197387662 6 40 e365 en Bioinformatics © 2024 The Author(s). Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Artificial neural network
Deep learning
spellingShingle Computer and Information Science
Artificial neural network
Deep learning
Yin, Yueming
Hu, Haifeng
Yang, Jitao
Ye, Chun
Goh, Wilson Wen Bin
Kong, Adams Wai Kin
Wu, Jiansheng
OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
description Motivation: Deep graph learning (DGL) has been widely employed in the realm of ligand-based virtual screening. Within this field, a key hurdle is the existence of activity cliffs (ACs), where minor chemical alterations can lead to significant changes in bioactivity. In response, several DGL models have been developed to enhance ligand bioactivity prediction in the presence of ACs. Yet, there remains a largely unexplored opportunity within ACs for optimizing ligand bioactivity, making it an area ripe for further investigation. Results: We present a novel approach to simultaneously predict and optimize ligand bioactivities through DGL and ACs (OLB-AC). OLB-AC possesses the capability to optimize ligand molecules located near ACs, providing a direct reference for optimizing ligand bioactivities with the matching of original ligands. To accomplish this, a novel attentive graph reconstruction neural network and ligand optimization scheme are proposed. Attentive graph reconstruction neural network reconstructs original ligands and optimizes them through adversarial representations derived from their bioactivity prediction process. Experimental results on nine drug targets reveal that out of the 667 molecules generated through OLB-AC optimization on datasets comprising 974 low-activity, noninhibitor, or highly toxic ligands, 49 are recognized as known highly active, inhibitor, or nontoxic ligands beyond the datasets’ scope. The 27 out of 49 matched molecular pairs generated by OLB-AC reveal novel transformations not present in their training sets. The adversarial representations employed for ligand optimization originate from the gradients of bioactivity predictions. Therefore, we also assess OLB-AC’s prediction accuracy across 33 different bioactivity datasets. Results show that OLB-AC achieves the best Pearson correlation coefficient (r2) on 27/33 datasets, with an average improvement of 7.2%–22.9% against the state-of-the-art bioactivity prediction methods. Availability and implementation: The code and dataset developed in this work are available at github.com/Yueming-Yin/OLB-AC.
author2 College of Computing and Data Science
author_facet College of Computing and Data Science
Yin, Yueming
Hu, Haifeng
Yang, Jitao
Ye, Chun
Goh, Wilson Wen Bin
Kong, Adams Wai Kin
Wu, Jiansheng
format Article
author Yin, Yueming
Hu, Haifeng
Yang, Jitao
Ye, Chun
Goh, Wilson Wen Bin
Kong, Adams Wai Kin
Wu, Jiansheng
author_sort Yin, Yueming
title OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
title_short OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
title_full OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
title_fullStr OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
title_full_unstemmed OLB-AC: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
title_sort olb-ac: toward optimizing ligand bioactivities through deep graph learning and activity cliffs
publishDate 2024
url https://hdl.handle.net/10356/181726
_version_ 1819112989951787008