Multiscale persistent functions for biomolecular structure characterization

In this paper, we introduce multiscale persistent functions for biomolecular structure characterization. The essential idea is to combine our multiscale rigidity functions (MRFs) with persistent homology analysis, so as to construct a series of multiscale persistent functions, particularly multiscal...

Full description

Saved in:
Bibliographic Details
Main Authors: Xia, Kelin, Li, Zhiming, Mu, Lin
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141285
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141285
record_format dspace
spelling sg-ntu-dr.10356-1412852020-06-05T08:26:00Z Multiscale persistent functions for biomolecular structure characterization Xia, Kelin Li, Zhiming Mu, Lin School of Biological Sciences School of Physical and Mathematical Sciences Science::Chemistry Conformational Entropy Persistent Entropy In this paper, we introduce multiscale persistent functions for biomolecular structure characterization. The essential idea is to combine our multiscale rigidity functions (MRFs) with persistent homology analysis, so as to construct a series of multiscale persistent functions, particularly multiscale persistent entropies, for structure characterization. To clarify the fundamental idea of our method, the multiscale persistent entropy (MPE) model is discussed in great detail. Mathematically, unlike the previous persistent entropy (Chintakunta et al. in Pattern Recognit 48(2):391-401, 2015; Merelli et al. in Entropy 17(10):6872-6892, 2015; Rucco et al. in: Proceedings of ECCS 2014, Springer, pp 117-128, 2016), a special resolution parameter is incorporated into our model. Various scales can be achieved by tuning its value. Physically, our MPE can be used in conformational entropy evaluation. More specifically, it is found that our method incorporates in it a natural classification scheme. This is achieved through a density filtration of an MRF built from angular distributions. To further validate our model, a systematical comparison with the traditional entropy evaluation model is done. It is found that our model is able to preserve the intrinsic topological features of biomolecular data much better than traditional approaches, particularly for resolutions in the intermediate range. Moreover, by comparing with traditional entropies from various grid sizes, bond angle-based methods and a persistent homology-based support vector machine method (Cang et al. in Mol Based Math Biol 3:140-162, 2015), we find that our MPE method gives the best results in terms of average true positive rate in a classic protein structure classification test. More interestingly, all-alpha and all-beta protein classes can be clearly separated from each other with zero error only in our model. Finally, a special protein structure index (PSI) is proposed, for the first time, to describe the "regularity" of protein structures. Basically, a protein structure is deemed as regular if it has a consistent and orderly configuration. Our PSI model is tested on a database of 110 proteins; we find that structures with larger portions of loops and intrinsically disorder regions are always associated with larger PSI, meaning an irregular configuration, while proteins with larger portions of secondary structures, i.e., alpha-helix or beta-sheet, have smaller PSI. Essentially, PSI can be used to describe the "regularity" information in any systems. MOE (Min. of Education, S’pore) 2020-06-05T08:26:00Z 2020-06-05T08:26:00Z 2018 Journal Article Xia, K., Li, Z., & Mu, L. (2018). Multiscale persistent functions for biomolecular structure characterization. Bulletin of mathematical biology, 80(1), 1–31. doi:10.1007/s11538-017-0362-6 0092-8240 https://hdl.handle.net/10356/141285 10.1007/s11538-017-0362-6 29098540 2-s2.0-85032945288 1 80 1 31 en Bulletin of mathematical biology © 2017 Society for Mathematical Biology. All rights reserved.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Science::Chemistry
Conformational Entropy
Persistent Entropy
spellingShingle Science::Chemistry
Conformational Entropy
Persistent Entropy
Xia, Kelin
Li, Zhiming
Mu, Lin
Multiscale persistent functions for biomolecular structure characterization
description In this paper, we introduce multiscale persistent functions for biomolecular structure characterization. The essential idea is to combine our multiscale rigidity functions (MRFs) with persistent homology analysis, so as to construct a series of multiscale persistent functions, particularly multiscale persistent entropies, for structure characterization. To clarify the fundamental idea of our method, the multiscale persistent entropy (MPE) model is discussed in great detail. Mathematically, unlike the previous persistent entropy (Chintakunta et al. in Pattern Recognit 48(2):391-401, 2015; Merelli et al. in Entropy 17(10):6872-6892, 2015; Rucco et al. in: Proceedings of ECCS 2014, Springer, pp 117-128, 2016), a special resolution parameter is incorporated into our model. Various scales can be achieved by tuning its value. Physically, our MPE can be used in conformational entropy evaluation. More specifically, it is found that our method incorporates in it a natural classification scheme. This is achieved through a density filtration of an MRF built from angular distributions. To further validate our model, a systematical comparison with the traditional entropy evaluation model is done. It is found that our model is able to preserve the intrinsic topological features of biomolecular data much better than traditional approaches, particularly for resolutions in the intermediate range. Moreover, by comparing with traditional entropies from various grid sizes, bond angle-based methods and a persistent homology-based support vector machine method (Cang et al. in Mol Based Math Biol 3:140-162, 2015), we find that our MPE method gives the best results in terms of average true positive rate in a classic protein structure classification test. More interestingly, all-alpha and all-beta protein classes can be clearly separated from each other with zero error only in our model. Finally, a special protein structure index (PSI) is proposed, for the first time, to describe the "regularity" of protein structures. Basically, a protein structure is deemed as regular if it has a consistent and orderly configuration. Our PSI model is tested on a database of 110 proteins; we find that structures with larger portions of loops and intrinsically disorder regions are always associated with larger PSI, meaning an irregular configuration, while proteins with larger portions of secondary structures, i.e., alpha-helix or beta-sheet, have smaller PSI. Essentially, PSI can be used to describe the "regularity" information in any systems.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Xia, Kelin
Li, Zhiming
Mu, Lin
format Article
author Xia, Kelin
Li, Zhiming
Mu, Lin
author_sort Xia, Kelin
title Multiscale persistent functions for biomolecular structure characterization
title_short Multiscale persistent functions for biomolecular structure characterization
title_full Multiscale persistent functions for biomolecular structure characterization
title_fullStr Multiscale persistent functions for biomolecular structure characterization
title_full_unstemmed Multiscale persistent functions for biomolecular structure characterization
title_sort multiscale persistent functions for biomolecular structure characterization
publishDate 2020
url https://hdl.handle.net/10356/141285
_version_ 1681058617072025600