THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized spec...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Article |
Published: |
2023
|
Subjects: | |
Online Access: | https://repository.li.mahidol.ac.th/handle/123456789/83697 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Mahidol University |
id |
th-mahidol.83697 |
---|---|
record_format |
dspace |
spelling |
th-mahidol.836972023-06-18T23:46:46Z THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites Shoombuatong W. Mahidol University Biochemistry, Genetics and Molecular Biology N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A three-layer ensemble predictor for identifying human RNA N7-methylguanosine sites) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis. 2023-06-18T16:46:46Z 2023-06-18T16:46:46Z 2022-06-15 Article Journal of Molecular Biology Vol.434 No.11 (2022) 10.1016/j.jmb.2022.167549 10898638 00222836 35662472 2-s2.0-85127865689 https://repository.li.mahidol.ac.th/handle/123456789/83697 SCOPUS |
institution |
Mahidol University |
building |
Mahidol University Library |
continent |
Asia |
country |
Thailand Thailand |
content_provider |
Mahidol University Library |
collection |
Mahidol University Institutional Repository |
topic |
Biochemistry, Genetics and Molecular Biology |
spellingShingle |
Biochemistry, Genetics and Molecular Biology Shoombuatong W. THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
description |
N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A three-layer ensemble predictor for identifying human RNA N7-methylguanosine sites) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis. |
author2 |
Mahidol University |
author_facet |
Mahidol University Shoombuatong W. |
format |
Article |
author |
Shoombuatong W. |
author_sort |
Shoombuatong W. |
title |
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
title_short |
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
title_full |
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
title_fullStr |
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
title_full_unstemmed |
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites |
title_sort |
throne: a new approach for accurate prediction of human rna n7-methylguanosine sites |
publishDate |
2023 |
url |
https://repository.li.mahidol.ac.th/handle/123456789/83697 |
_version_ |
1781415872818053120 |