Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction

Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu, Xiang, Feng, Huitao, Wu, Jie, Xia, Kelin
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/160380
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-160380
record_format dspace
spelling sg-ntu-dr.10356-1603802022-07-20T06:25:00Z Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction Liu, Xiang Feng, Huitao Wu, Jie Xia, Kelin School of Physical and Mathematical Sciences Science::Mathematics Persistent Spectral Hypergraph Machine Learning Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know. Ministry of Education (MOE) Nanyang Technological University This work was supported in part by Nanyang Technological University Startup Grant M4081842 and Singapore Ministry of Education Academic Research fund Tier 1 RG109/19, Tier 2 MOE2018-T2-1-033. The third author was supported by Natural Science Foundation of China (NSFC grant no. 11971144) and High-level Scientific Research Foundation of Hebei Province. 2022-07-20T06:25:00Z 2022-07-20T06:25:00Z 2021 Journal Article Liu, X., Feng, H., Wu, J. & Xia, K. (2021). Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction. Briefings in Bioinformatics, 22(5), bbab127-. https://dx.doi.org/10.1093/bib/bbab127 1467-5463 https://hdl.handle.net/10356/160380 10.1093/bib/bbab127 33837771 2-s2.0-85111209696 5 22 bbab127 en M4081842 RG109/19 MOE2018-T2-1-033 Briefings in Bioinformatics © 2021 The authors. Published by Oxford University Press. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics
Persistent Spectral Hypergraph
Machine Learning
spellingShingle Science::Mathematics
Persistent Spectral Hypergraph
Machine Learning
Liu, Xiang
Feng, Huitao
Wu, Jie
Xia, Kelin
Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
description Molecular descriptors are essential to not only quantitative structure activity/property relationship (QSAR/QSPR) models, but also machine learning based chemical and biological data analysis. In this paper, we propose persistent spectral hypergraph (PSH) based molecular descriptors or fingerprints for the first time. Our PSH-based molecular descriptors are used in the characterization of molecular structures and interactions, and further combined with machine learning models, in particular gradient boosting tree (GBT), for protein-ligand binding affinity prediction. Different from traditional molecular descriptors, which are usually based on molecular graph models, a hypergraph-based topological representation is proposed for protein-ligand interaction characterization. Moreover, a filtration process is introduced to generate a series of nested hypergraphs in different scales. For each of these hypergraphs, its eigen spectrum information can be obtained from the corresponding (Hodge) Laplacain matrix. PSH studies the persistence and variation of the eigen spectrum of the nested hypergraphs during the filtration process. Molecular descriptors or fingerprints can be generated from persistent attributes, which are statistical or combinatorial functions of PSH, and combined with machine learning models, in particular, GBT. We test our PSH-GBT model on three most commonly used datasets, including PDBbind-2007, PDBbind-2013 and PDBbind-2016. Our results, for all these databases, are better than all existing machine learning models with traditional molecular descriptors, as far as we know.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Liu, Xiang
Feng, Huitao
Wu, Jie
Xia, Kelin
format Article
author Liu, Xiang
Feng, Huitao
Wu, Jie
Xia, Kelin
author_sort Liu, Xiang
title Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
title_short Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
title_full Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
title_fullStr Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
title_full_unstemmed Persistent spectral hypergraph based machine learning (PSH-ML) for protein-ligand binding affinity prediction
title_sort persistent spectral hypergraph based machine learning (psh-ml) for protein-ligand binding affinity prediction
publishDate 2022
url https://hdl.handle.net/10356/160380
_version_ 1739837463394779136