Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture

Protein function prediction is one of the most well-studied topics, attracting attention from countless researchers in the field of computational biology. Implementing deep neural networks that help improve the prediction of protein function, however, is still a major challenge. In this research, we...

Full description

Saved in:
Bibliographic Details
Main Authors: Le, Nguyen Quoc Khanh, Yapp, Edward Kien Yee, Nagasundaram, Nagarajan, Chua, Matthew Chin Heng, Yeh, Hui-Yuan
Other Authors: School of Humanities
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/142239
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-142239
record_format dspace
spelling sg-ntu-dr.10356-1422392020-06-17T09:05:22Z Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture Le, Nguyen Quoc Khanh Yapp, Edward Kien Yee Nagasundaram, Nagarajan Chua, Matthew Chin Heng Yeh, Hui-Yuan School of Humanities Humanities::General Vesicular Trafficking Model Protein Function Prediction Protein function prediction is one of the most well-studied topics, attracting attention from countless researchers in the field of computational biology. Implementing deep neural networks that help improve the prediction of protein function, however, is still a major challenge. In this research, we suggested a new strategy that includes gated recurrent units and position-specific scoring matrix profiles to predict vesicular transportation proteins, a biological function of great importance. Although it is difficult to discover its function, our model is able to achieve accuracies of 82.3% and 85.8% in the cross-validation and independent dataset, respectively. We also solve the problem of imbalance in the dataset via tuning class weight in the deep learning model. The results generated showed sensitivity, specificity, MCC, and AUC to have values of 79.2%, 82.9%, 0.52, and 0.861, respectively. Our strategy shows superiority in results on the same dataset against all other state-of-the-art algorithms. In our suggested research, we have suggested a technique for the discovery of more proteins, particularly proteins connected with vesicular transport. In addition, our accomplishment could encourage the use of gated recurrent units architecture in protein function prediction. Published version 2020-06-17T09:05:22Z 2020-06-17T09:05:22Z 2019 Journal Article Le, N. Q. K., Yapp, E. K. Y., Nagasundaram, N., Chua, M. C. H., & Yeh, H.-Y. (2019). Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture. Computational and Structural Biotechnology Journal, 17, 1245-1254. doi:10.1016/j.csbj.2019.09.005 2001-0370 https://hdl.handle.net/10356/142239 10.1016/j.csbj.2019.09.005 31921391 2-s2.0-85074447970 17 1245 1254 en Computational and structural biotechnology journal © 2019 The Authors. Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons. org/licenses/by/4.0/). application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Humanities::General
Vesicular Trafficking Model
Protein Function Prediction
spellingShingle Humanities::General
Vesicular Trafficking Model
Protein Function Prediction
Le, Nguyen Quoc Khanh
Yapp, Edward Kien Yee
Nagasundaram, Nagarajan
Chua, Matthew Chin Heng
Yeh, Hui-Yuan
Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
description Protein function prediction is one of the most well-studied topics, attracting attention from countless researchers in the field of computational biology. Implementing deep neural networks that help improve the prediction of protein function, however, is still a major challenge. In this research, we suggested a new strategy that includes gated recurrent units and position-specific scoring matrix profiles to predict vesicular transportation proteins, a biological function of great importance. Although it is difficult to discover its function, our model is able to achieve accuracies of 82.3% and 85.8% in the cross-validation and independent dataset, respectively. We also solve the problem of imbalance in the dataset via tuning class weight in the deep learning model. The results generated showed sensitivity, specificity, MCC, and AUC to have values of 79.2%, 82.9%, 0.52, and 0.861, respectively. Our strategy shows superiority in results on the same dataset against all other state-of-the-art algorithms. In our suggested research, we have suggested a technique for the discovery of more proteins, particularly proteins connected with vesicular transport. In addition, our accomplishment could encourage the use of gated recurrent units architecture in protein function prediction.
author2 School of Humanities
author_facet School of Humanities
Le, Nguyen Quoc Khanh
Yapp, Edward Kien Yee
Nagasundaram, Nagarajan
Chua, Matthew Chin Heng
Yeh, Hui-Yuan
format Article
author Le, Nguyen Quoc Khanh
Yapp, Edward Kien Yee
Nagasundaram, Nagarajan
Chua, Matthew Chin Heng
Yeh, Hui-Yuan
author_sort Le, Nguyen Quoc Khanh
title Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
title_short Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
title_full Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
title_fullStr Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
title_full_unstemmed Computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
title_sort computational identification of vesicular transport proteins from sequences using deep gated recurrent units architecture
publishDate 2020
url https://hdl.handle.net/10356/142239
_version_ 1681058438609633280