Topology based machine learning model for the prediction of anticancer peptides

In recent years, interest in the use of therapeutic peptides for treating cancer has grown vastly. A variety of approaches based on machine learning have been explored for anticancer peptide identification while the featurization of these peptides is also critical to attaining any reasonable predict...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Joshua Zhi En
Other Authors: Xia Kelin
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156895
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In recent years, interest in the use of therapeutic peptides for treating cancer has grown vastly. A variety of approaches based on machine learning have been explored for anticancer peptide identification while the featurization of these peptides is also critical to attaining any reasonable predictive efficacy using machine learning algorithms. In this paper, we propose three topological-based featurization encodings. Machine learning models were developed using these features on two datasets: main and alternative datasets which were subsequently benchmarked with existing machine learning models. The independent testing results demonstrated that the models developed in this study had marked improvements in accuracy, specificity, and sensitivity to that of the baseline model AntiCP2.0 on both datasets. There is great potential in leveraging topological-based featurization alongside existing feature encoding techniques to accelerate the reliable identification of anticancer peptides for clinical usage.