MCI-frcnn: a deep learning method for topological micro-domain boundary detection

Chromatin structural domains, or topologically associated domains (TADs), are a general organizing principle in chromatin biology. RNA polymerase II (RNAPII) mediates multiple chromatin interactive loops, tethering together as RNAPII-associated chromatin interaction domains (RAIDs) to offer a framew...

Full description

Saved in:
Bibliographic Details
Main Authors: Tian, Simon Zhongyuan, Yin, Pengfei, Jing, Kai, Yang, Yang, Xu, Yewen, Huang, Guangyu, Ning, Duo, Fullwood, Melissa Jane, Zheng, Meizhen
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/164518
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-164518
record_format dspace
spelling sg-ntu-dr.10356-1645182023-02-28T17:13:35Z MCI-frcnn: a deep learning method for topological micro-domain boundary detection Tian, Simon Zhongyuan Yin, Pengfei Jing, Kai Yang, Yang Xu, Yewen Huang, Guangyu Ning, Duo Fullwood, Melissa Jane Zheng, Meizhen School of Biological Sciences Cancer Science Institute of Singapore, NUS Institute of Molecular and Cell Biology, A*STAR Science::Biological sciences Deep Learning Topological Micro-Domain Chromatin structural domains, or topologically associated domains (TADs), are a general organizing principle in chromatin biology. RNA polymerase II (RNAPII) mediates multiple chromatin interactive loops, tethering together as RNAPII-associated chromatin interaction domains (RAIDs) to offer a framework for gene regulation. RAID and TAD alterations have been found to be associated with diseases. They can be further dissected as micro-domains (micro-TADs and micro-RAIDs) by clustering single-molecule chromatin-interactive complexes from next-generation three-dimensional (3D) genome techniques, such as ChIA-Drop. Currently, there are few tools available for micro-domain boundary identification. In this work, we developed the MCI-frcnn deep learning method to train a Faster Region-based Convolutional Neural Network (Faster R-CNN) for micro-domain boundary detection. At the training phase in MCI-frcnn, 50 images of RAIDs from Drosophila RNAPII ChIA-Drop data, containing 261 micro-RAIDs with ground truth boundaries, were trained for 7 days. Using this well-trained MCI-frcnn, we detected micro-RAID boundaries for the input new images, with a fast speed (5.26 fps), high recognition accuracy (AUROC = 0.85, mAP = 0.69), and high boundary region quantification (genomic IoU = 76%). We further applied MCI-frcnn to detect human micro-TADs boundaries using human GM12878 SPRITE data and obtained a high region quantification score (mean gIoU = 85%). In all, the MCI-frcnn deep learning method which we developed in this work is a general tool for micro-domain boundary detection. Ministry of Education (MOE) National Research Foundation (NRF) Published version This work was supported by grants from the National Natural Science Foundation of China (32170644), the National Key R&D Program of China (20222YFC3400400), the Shenzhen Fundamental Research Programme (JCYJ20220530115211026), and the Shenzhen Innovation Committee of Science and Technology (ZDSYS20200811144002008). MF is supported by the National Research Foundation Singapore and the Singapore Ministry of Education under its Research Centres of Excellence initiative and by a Ministry of Education Tier II grant awarded to MF (T2EP30120- 0020). 2023-01-30T06:50:21Z 2023-01-30T06:50:21Z 2022 Journal Article Tian, S. Z., Yin, P., Jing, K., Yang, Y., Xu, Y., Huang, G., Ning, D., Fullwood, M. J. & Zheng, M. (2022). MCI-frcnn: a deep learning method for topological micro-domain boundary detection. Frontiers in Cell and Developmental Biology, 10, 1050769-. https://dx.doi.org/10.3389/fcell.2022.1050769 2296-634X https://hdl.handle.net/10356/164518 10.3389/fcell.2022.1050769 36531953 2-s2.0-85144088390 10 1050769 en T2EP30120- 0020 Frontiers in Cell and Developmental Biology © 2022 Tian, Yin, Jing, Yang, Xu, Huang, Ning, Fullwood and Zheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Deep Learning
Topological Micro-Domain
spellingShingle Science::Biological sciences
Deep Learning
Topological Micro-Domain
Tian, Simon Zhongyuan
Yin, Pengfei
Jing, Kai
Yang, Yang
Xu, Yewen
Huang, Guangyu
Ning, Duo
Fullwood, Melissa Jane
Zheng, Meizhen
MCI-frcnn: a deep learning method for topological micro-domain boundary detection
description Chromatin structural domains, or topologically associated domains (TADs), are a general organizing principle in chromatin biology. RNA polymerase II (RNAPII) mediates multiple chromatin interactive loops, tethering together as RNAPII-associated chromatin interaction domains (RAIDs) to offer a framework for gene regulation. RAID and TAD alterations have been found to be associated with diseases. They can be further dissected as micro-domains (micro-TADs and micro-RAIDs) by clustering single-molecule chromatin-interactive complexes from next-generation three-dimensional (3D) genome techniques, such as ChIA-Drop. Currently, there are few tools available for micro-domain boundary identification. In this work, we developed the MCI-frcnn deep learning method to train a Faster Region-based Convolutional Neural Network (Faster R-CNN) for micro-domain boundary detection. At the training phase in MCI-frcnn, 50 images of RAIDs from Drosophila RNAPII ChIA-Drop data, containing 261 micro-RAIDs with ground truth boundaries, were trained for 7 days. Using this well-trained MCI-frcnn, we detected micro-RAID boundaries for the input new images, with a fast speed (5.26 fps), high recognition accuracy (AUROC = 0.85, mAP = 0.69), and high boundary region quantification (genomic IoU = 76%). We further applied MCI-frcnn to detect human micro-TADs boundaries using human GM12878 SPRITE data and obtained a high region quantification score (mean gIoU = 85%). In all, the MCI-frcnn deep learning method which we developed in this work is a general tool for micro-domain boundary detection.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Tian, Simon Zhongyuan
Yin, Pengfei
Jing, Kai
Yang, Yang
Xu, Yewen
Huang, Guangyu
Ning, Duo
Fullwood, Melissa Jane
Zheng, Meizhen
format Article
author Tian, Simon Zhongyuan
Yin, Pengfei
Jing, Kai
Yang, Yang
Xu, Yewen
Huang, Guangyu
Ning, Duo
Fullwood, Melissa Jane
Zheng, Meizhen
author_sort Tian, Simon Zhongyuan
title MCI-frcnn: a deep learning method for topological micro-domain boundary detection
title_short MCI-frcnn: a deep learning method for topological micro-domain boundary detection
title_full MCI-frcnn: a deep learning method for topological micro-domain boundary detection
title_fullStr MCI-frcnn: a deep learning method for topological micro-domain boundary detection
title_full_unstemmed MCI-frcnn: a deep learning method for topological micro-domain boundary detection
title_sort mci-frcnn: a deep learning method for topological micro-domain boundary detection
publishDate 2023
url https://hdl.handle.net/10356/164518
_version_ 1759858386492456960