THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites

N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized spec...

Full description

Saved in:
Bibliographic Details
Main Author: Shoombuatong W.
Other Authors: Mahidol University
Format: Article
Published: 2023
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/83697
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.83697
record_format dspace
spelling th-mahidol.836972023-06-18T23:46:46Z THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites Shoombuatong W. Mahidol University Biochemistry, Genetics and Molecular Biology N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A three-layer ensemble predictor for identifying human RNA N7-methylguanosine sites) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis. 2023-06-18T16:46:46Z 2023-06-18T16:46:46Z 2022-06-15 Article Journal of Molecular Biology Vol.434 No.11 (2022) 10.1016/j.jmb.2022.167549 10898638 00222836 35662472 2-s2.0-85127865689 https://repository.li.mahidol.ac.th/handle/123456789/83697 SCOPUS
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Biochemistry, Genetics and Molecular Biology
spellingShingle Biochemistry, Genetics and Molecular Biology
Shoombuatong W.
THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
description N7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 5′ cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A three-layer ensemble predictor for identifying human RNA N7-methylguanosine sites) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three-step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis.
author2 Mahidol University
author_facet Mahidol University
Shoombuatong W.
format Article
author Shoombuatong W.
author_sort Shoombuatong W.
title THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
title_short THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
title_full THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
title_fullStr THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
title_full_unstemmed THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methylguanosine Sites
title_sort throne: a new approach for accurate prediction of human rna n7-methylguanosine sites
publishDate 2023
url https://repository.li.mahidol.ac.th/handle/123456789/83697
_version_ 1781415872818053120