An Improved gSVM-SCADL2 with Firefly Algorithm for Identification of Informative Genes and Pathways

Incorporation of pathway knowledge into microarray analysis has been favoured by researchers owing to the improved biological interpretation of the analysis outcome. However, most of the pathway data are manually curated without specific biological context. Inclusion of non-informative genes in the...

Full description

Saved in:
Bibliographic Details
Main Authors: Chan, Weng Howe, Mohd Saberi , Mohamad, Safaai , Deris, Corchado, Juan Manuel, Omatu, Sigeru, Zuwairie, Ibrahim, Shahreen, Kasim
Format: Article
Language:English
Published: Inderscience Enterprises Ltd. 2016
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/13631/1/An%20Improved%20gSVM-SCADL2%20with%20Firefly%20Algorithm%20for%20Identification%20of%20Informative%20Genes%20and%20Pathways.pdf
http://umpir.ump.edu.my/id/eprint/13631/
http://www.inderscienceonline.com/doi/pdf/10.1504/IJBRA.2016.075404#d1429e145
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
Description
Summary:Incorporation of pathway knowledge into microarray analysis has been favoured by researchers owing to the improved biological interpretation of the analysis outcome. However, most of the pathway data are manually curated without specific biological context. Inclusion of non-informative genes in the analysis of context specific microarray data could lead to classifier with poor discriminative power. Thus, one of the main challenges is how to effectively identify informative genes from the pathway data. This paper proposes a firefly optimised penalised support vector machine with SCADL2 penalty function (SVM-SCADL2-FFA) in optimising tuning parameters for each pathway for efficient identification of informative genes and pathways. Experiments are done on lung cancer and gender data sets. Tenfold CV is used to evaluate the performance in terms of accuracy, specificity, sensitivity and F-score. The identified informative genes are validated through online databases. Our proposed method shows consistent improvements compared to previous works.