Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/160380 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-160380 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1603802022-07-20T06:25:00Z Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction Liu, Xiang Feng, Huitao Wu, Jie Xia, Kelin School of Physical and Mathematical Sciences Science::Mathematics Persistent Spectral Hypergraph Machine Learning Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know. Ministry of Education (MOE) Nanyang Technological University This work was supported in part by Nanyang Technological University Startup Grant M4081842 and Singapore Ministry of Education Academic Research fund Tier 1 RG109/19, Tier 2 MOE2018-T2-1-033. The third author was supported by Natural Science Foundation of China (NSFC grant no. 11971144) and High-level Scientific Research Foundation of Hebei Province. 2022-07-20T06:25:00Z 2022-07-20T06:25:00Z 2021 Journal Article Liu, X., Feng, H., Wu, J. & Xia, K. (2021). Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction. Briefings in Bioinformatics, 22(5), bbab127-. https://dx.doi.org/10.1093/bib/bbab127 1467-5463 https://hdl.handle.net/10356/160380 10.1093/bib/bbab127 33837771 2-s2.0-85111209696 5 22 bbab127 en M4081842 RG109/19 MOE2018-T2-1-033 Briefings in Bioinformatics © 2021 The authors. Published by Oxford University Press. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Mathematics Persistent Spectral Hypergraph Machine Learning |
spellingShingle |
Science::Mathematics Persistent Spectral Hypergraph Machine Learning Liu, Xiang Feng, Huitao Wu, Jie Xia, Kelin Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
description |
Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know. |
author2 |
School of Physical and Mathematical Sciences |
author_facet |
School of Physical and Mathematical Sciences Liu, Xiang Feng, Huitao Wu, Jie Xia, Kelin |
format |
Article |
author |
Liu, Xiang Feng, Huitao Wu, Jie Xia, Kelin |
author_sort |
Liu, Xiang |
title |
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
title_short |
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
title_full |
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
title_fullStr |
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
title_full_unstemmed |
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction |
title_sort |
persistent spectral hypergraph based machine learning (psh-ml) for protein-ligand binding affinity prediction |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/160380 |
_version_ |
1739837463394779136 |