Conformational ensemble modeling of multi-domain proteins with experimental data

The structural variation of multi-domain proteins with flexible components mediates many biological processes. Their structure determination is crucial for understanding the mechanism of their interactions and functions, but it remains a challenge to derive the structure ensemble of flexible biomole...

Full description

Saved in:
Bibliographic Details
Main Author: Zhu, Guanhua
Other Authors: Lu Lanyuan
Format: Theses and Dissertations
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/72807
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-72807
record_format dspace
spelling sg-ntu-dr.10356-728072023-02-28T18:48:53Z Conformational ensemble modeling of multi-domain proteins with experimental data Zhu, Guanhua Lu Lanyuan School of Biological Sciences DRNTU::Science::Biological sciences The structural variation of multi-domain proteins with flexible components mediates many biological processes. Their structure determination is crucial for understanding the mechanism of their interactions and functions, but it remains a challenge to derive the structure ensemble of flexible biomolecules solely from experimental data. Computational methods assist in the interpretation of the time- and spatial-averaged experimental data for a correct structural characterization. A conformational ensemble can be determined by selecting a weighted combination of representative structures from a given structure pool, yielding the best fit to experimental measurements. However, a raw numerical data fitting causes the over-interpretation of the experimental data, which would recover the structure ensemble of wrong conformations. In our study, we have employed the idea of hybrid models to simulate multi-domain proteins. The intra-domain structures are modeled with the structure-based potential and the inter-domain interactions are characterized with the physics-based energies. For ensemble construction with experimental data, structure energy is proposed as a physics-based regularization to minimize the problem of data over-fitting. The absence of energy regularization exposes ensemble construction to the noise from high-energy structures. We have demonstrated the importance of the physics-based regularization to interpret experimental data to avoid arbitrary and spurious conformational representations. We have implemented our computational model and ensemble optimization to two systems of multi-domain proteins on the coarse-grained and atomistic levels respectively. Firstly, we simulate the dengue virus 2 (DENV2) non-structural protein 5 (NS5) at the coarse-grained level and construct its conformational ensemble with experimental small-angle X-ray scattering (SAXS) data. The energy regularization assists to recover a conformational ensemble from the SAXS profile, which successfully reveals the domain-domain orientation and domain contacting interface of NS5. Moreover, we applied the computational modeling scheme to a di-domain complex of a poly(U)-binding protein (Pub1), followed by ensemble construction with its experimental paramagnetic relaxation enhancement (PRE) data that correspond to inter-atomic distance restraints. A conformational ensemble of Pub1 has been recovered from the low-energy structures and with the ensemble-size restraint. This ensemble is in good agreement with experimental PRE rates and also supported by the experimental data of chemical shift perturbations. ​Doctor of Philosophy (SBS) 2017-11-23T04:14:22Z 2017-11-23T04:14:22Z 2017 Thesis Zhu, G. (2017). Conformational ensemble modeling of multi-domain proteins with experimental data. Doctoral thesis, Nanyang Technological University, Singapore. http://hdl.handle.net/10356/72807 10.32657/10356/72807 en 153 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Science::Biological sciences
spellingShingle DRNTU::Science::Biological sciences
Zhu, Guanhua
Conformational ensemble modeling of multi-domain proteins with experimental data
description The structural variation of multi-domain proteins with flexible components mediates many biological processes. Their structure determination is crucial for understanding the mechanism of their interactions and functions, but it remains a challenge to derive the structure ensemble of flexible biomolecules solely from experimental data. Computational methods assist in the interpretation of the time- and spatial-averaged experimental data for a correct structural characterization. A conformational ensemble can be determined by selecting a weighted combination of representative structures from a given structure pool, yielding the best fit to experimental measurements. However, a raw numerical data fitting causes the over-interpretation of the experimental data, which would recover the structure ensemble of wrong conformations. In our study, we have employed the idea of hybrid models to simulate multi-domain proteins. The intra-domain structures are modeled with the structure-based potential and the inter-domain interactions are characterized with the physics-based energies. For ensemble construction with experimental data, structure energy is proposed as a physics-based regularization to minimize the problem of data over-fitting. The absence of energy regularization exposes ensemble construction to the noise from high-energy structures. We have demonstrated the importance of the physics-based regularization to interpret experimental data to avoid arbitrary and spurious conformational representations. We have implemented our computational model and ensemble optimization to two systems of multi-domain proteins on the coarse-grained and atomistic levels respectively. Firstly, we simulate the dengue virus 2 (DENV2) non-structural protein 5 (NS5) at the coarse-grained level and construct its conformational ensemble with experimental small-angle X-ray scattering (SAXS) data. The energy regularization assists to recover a conformational ensemble from the SAXS profile, which successfully reveals the domain-domain orientation and domain contacting interface of NS5. Moreover, we applied the computational modeling scheme to a di-domain complex of a poly(U)-binding protein (Pub1), followed by ensemble construction with its experimental paramagnetic relaxation enhancement (PRE) data that correspond to inter-atomic distance restraints. A conformational ensemble of Pub1 has been recovered from the low-energy structures and with the ensemble-size restraint. This ensemble is in good agreement with experimental PRE rates and also supported by the experimental data of chemical shift perturbations.
author2 Lu Lanyuan
author_facet Lu Lanyuan
Zhu, Guanhua
format Theses and Dissertations
author Zhu, Guanhua
author_sort Zhu, Guanhua
title Conformational ensemble modeling of multi-domain proteins with experimental data
title_short Conformational ensemble modeling of multi-domain proteins with experimental data
title_full Conformational ensemble modeling of multi-domain proteins with experimental data
title_fullStr Conformational ensemble modeling of multi-domain proteins with experimental data
title_full_unstemmed Conformational ensemble modeling of multi-domain proteins with experimental data
title_sort conformational ensemble modeling of multi-domain proteins with experimental data
publishDate 2017
url http://hdl.handle.net/10356/72807
_version_ 1759857708240994304