RNA alternative splicing prediction with discrete compositional energy network
A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's prima...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/155091 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-155091 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1550912023-02-28T19:17:54Z RNA alternative splicing prediction with discrete compositional energy network Chan, Alvin Korsakova, Anna Ong, Yew-Soon Winnerdy, Fernaldo Richtia Lim, Kah Wai Phan, Anh Tuan School of Physical and Mathematical Sciences Proceedings of the Conference on Health, Inference, and Learning (CHIL '21) Computer and Information Science Medicine, Health and Life Sciences RNA Splicing Prediction RNA Binding Protein A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD1, we show that DCEN outperforms baselines and ablation variants. Nanyang Technological University National Research Foundation (NRF) Published version This work is supported by the Data Science and Artificial Intelligence Research Center (DSAIR), the School of Computer Science and Engineering at Nanyang Technological University and the Singapore National Research Foundation Investigatorship (NRFNRFI2017-09). 2022-02-11T05:34:35Z 2022-02-11T05:34:35Z 2021 Conference Paper Chan, A., Korsakova, A., Ong, Y., Winnerdy, F. R., Lim, K. W. & Phan, A. T. (2021). RNA alternative splicing prediction with discrete compositional energy network. Proceedings of the Conference on Health, Inference, and Learning (CHIL '21), 193-203. https://dx.doi.org/10.1145/3450439.3451857 9781450383592 https://hdl.handle.net/10356/155091 10.1145/3450439.3451857 2-s2.0-85104095601 193 203 en NRFNRFI2017-09 10.21979/N9/FFN0XH © 2021 Copyright held by the owner/author(s). Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Medicine, Health and Life Sciences RNA Splicing Prediction RNA Binding Protein |
spellingShingle |
Computer and Information Science Medicine, Health and Life Sciences RNA Splicing Prediction RNA Binding Protein Chan, Alvin Korsakova, Anna Ong, Yew-Soon Winnerdy, Fernaldo Richtia Lim, Kah Wai Phan, Anh Tuan RNA alternative splicing prediction with discrete compositional energy network |
description |
A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD1, we show that DCEN outperforms baselines and ablation variants. |
author2 |
School of Physical and Mathematical Sciences |
author_facet |
School of Physical and Mathematical Sciences Chan, Alvin Korsakova, Anna Ong, Yew-Soon Winnerdy, Fernaldo Richtia Lim, Kah Wai Phan, Anh Tuan |
format |
Conference or Workshop Item |
author |
Chan, Alvin Korsakova, Anna Ong, Yew-Soon Winnerdy, Fernaldo Richtia Lim, Kah Wai Phan, Anh Tuan |
author_sort |
Chan, Alvin |
title |
RNA alternative splicing prediction with discrete compositional energy network |
title_short |
RNA alternative splicing prediction with discrete compositional energy network |
title_full |
RNA alternative splicing prediction with discrete compositional energy network |
title_fullStr |
RNA alternative splicing prediction with discrete compositional energy network |
title_full_unstemmed |
RNA alternative splicing prediction with discrete compositional energy network |
title_sort |
rna alternative splicing prediction with discrete compositional energy network |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/155091 |
_version_ |
1759857233184686080 |