RNA alternative splicing prediction with discrete compositional energy network

A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's prima...

Full description

Saved in:
Bibliographic Details
Main Authors: Chan, Alvin, Korsakova, Anna, Ong, Yew-Soon, Winnerdy, Fernaldo Richtia, Lim, Kah Wai, Phan, Anh Tuan
Other Authors: School of Physical and Mathematical Sciences
Format: Conference or Workshop Item
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/155091
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-155091
record_format dspace
spelling sg-ntu-dr.10356-1550912023-02-28T19:17:54Z RNA alternative splicing prediction with discrete compositional energy network Chan, Alvin Korsakova, Anna Ong, Yew-Soon Winnerdy, Fernaldo Richtia Lim, Kah Wai Phan, Anh Tuan School of Physical and Mathematical Sciences Proceedings of the Conference on Health, Inference, and Learning (CHIL '21) Computer and Information Science Medicine, Health and Life Sciences RNA Splicing Prediction RNA Binding Protein A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD1, we show that DCEN outperforms baselines and ablation variants. Nanyang Technological University National Research Foundation (NRF) Published version This work is supported by the Data Science and Artificial Intelligence Research Center (DSAIR), the School of Computer Science and Engineering at Nanyang Technological University and the Singapore National Research Foundation Investigatorship (NRFNRFI2017-09). 2022-02-11T05:34:35Z 2022-02-11T05:34:35Z 2021 Conference Paper Chan, A., Korsakova, A., Ong, Y., Winnerdy, F. R., Lim, K. W. & Phan, A. T. (2021). RNA alternative splicing prediction with discrete compositional energy network. Proceedings of the Conference on Health, Inference, and Learning (CHIL '21), 193-203. https://dx.doi.org/10.1145/3450439.3451857 9781450383592 https://hdl.handle.net/10356/155091 10.1145/3450439.3451857 2-s2.0-85104095601 193 203 en NRFNRFI2017-09 10.21979/N9/FFN0XH © 2021 Copyright held by the owner/author(s). Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Medicine, Health and Life Sciences
RNA Splicing Prediction
RNA Binding Protein
spellingShingle Computer and Information Science
Medicine, Health and Life Sciences
RNA Splicing Prediction
RNA Binding Protein
Chan, Alvin
Korsakova, Anna
Ong, Yew-Soon
Winnerdy, Fernaldo Richtia
Lim, Kah Wai
Phan, Anh Tuan
RNA alternative splicing prediction with discrete compositional energy network
description A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD1, we show that DCEN outperforms baselines and ablation variants.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Chan, Alvin
Korsakova, Anna
Ong, Yew-Soon
Winnerdy, Fernaldo Richtia
Lim, Kah Wai
Phan, Anh Tuan
format Conference or Workshop Item
author Chan, Alvin
Korsakova, Anna
Ong, Yew-Soon
Winnerdy, Fernaldo Richtia
Lim, Kah Wai
Phan, Anh Tuan
author_sort Chan, Alvin
title RNA alternative splicing prediction with discrete compositional energy network
title_short RNA alternative splicing prediction with discrete compositional energy network
title_full RNA alternative splicing prediction with discrete compositional energy network
title_fullStr RNA alternative splicing prediction with discrete compositional energy network
title_full_unstemmed RNA alternative splicing prediction with discrete compositional energy network
title_sort rna alternative splicing prediction with discrete compositional energy network
publishDate 2022
url https://hdl.handle.net/10356/155091
_version_ 1759857233184686080