Prediction of G4 formation in live cells with epigenetic data: a deep learning approach

G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin acces...

Full description

Saved in:
Bibliographic Details
Main Authors: Korsakova, Anna, Phan, Anh Tuân
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/170987
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-170987
record_format dspace
spelling sg-ntu-dr.10356-1709872023-10-16T15:35:49Z Prediction of G4 formation in live cells with epigenetic data: a deep learning approach Korsakova, Anna Phan, Anh Tuân School of Physical and Mathematical Sciences NTU Institute of Structural Biology Science::Biological sciences Deep Learning Epigenetics G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin accessibility are all parts of the complex cis regulatory landscape. We propose an approach for prediction of G4 formation in cells that incorporates epigenetic and chromatin accessibility data. The novel approach termed epiG4NN efficiently predicts cell-specific G4 formation in live cells based on a local epigenomic snapshot. Our results confirm the close relationship between H3K4me3 histone methylation, chromatin accessibility and G4 structure formation. Trained on A549 cell data, epiG4NN was then able to predict G4 formation in HEK293T and K562 cell lines. We observe the dependency of model performance with different epigenetic features on the underlying experimental condition of G4 detection. We expect that this approach will contribute to the systematic understanding of correlations between structural and epigenomic feature landscape. Nanyang Technological University Published version Funding: Nanyang Technological University (NTU Singapore) grants (to A.T.P.). Funding for open access charge: Nanyang Technological University. 2023-10-10T01:54:16Z 2023-10-10T01:54:16Z 2023 Journal Article Korsakova, A. & Phan, A. T. (2023). Prediction of G4 formation in live cells with epigenetic data: a deep learning approach. NAR Genomics and Bioinformatics, 5(3), 1-12. https://dx.doi.org/10.1093/nargab/lqad071 2631-9268 https://hdl.handle.net/10356/170987 10.1093/nargab/lqad071 37636021 2-s2.0-85170433719 3 5 1 12 en NAR Genomics and Bioinformatics © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Deep Learning
Epigenetics
spellingShingle Science::Biological sciences
Deep Learning
Epigenetics
Korsakova, Anna
Phan, Anh Tuân
Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
description G-quadruplexes (G4s) are secondary structures abundant in DNA that may play regulatory roles in cells. Despite the ubiquity of the putative G-quadruplex-forming sequences (PQS) in the human genome, only a small fraction forms G4 structures in cells. Folded G4, histone methylation and chromatin accessibility are all parts of the complex cis regulatory landscape. We propose an approach for prediction of G4 formation in cells that incorporates epigenetic and chromatin accessibility data. The novel approach termed epiG4NN efficiently predicts cell-specific G4 formation in live cells based on a local epigenomic snapshot. Our results confirm the close relationship between H3K4me3 histone methylation, chromatin accessibility and G4 structure formation. Trained on A549 cell data, epiG4NN was then able to predict G4 formation in HEK293T and K562 cell lines. We observe the dependency of model performance with different epigenetic features on the underlying experimental condition of G4 detection. We expect that this approach will contribute to the systematic understanding of correlations between structural and epigenomic feature landscape.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Korsakova, Anna
Phan, Anh Tuân
format Article
author Korsakova, Anna
Phan, Anh Tuân
author_sort Korsakova, Anna
title Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_short Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_full Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_fullStr Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_full_unstemmed Prediction of G4 formation in live cells with epigenetic data: a deep learning approach
title_sort prediction of g4 formation in live cells with epigenetic data: a deep learning approach
publishDate 2023
url https://hdl.handle.net/10356/170987
_version_ 1781793728148537344