A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets
Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a c...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Springer Science and Business Media Deutschland GmbH
2021
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/94946/ http://dx.doi.org/10.1007/s00500-021-05726-0 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.94946 |
---|---|
record_format |
eprints |
spelling |
my.utm.949462022-04-29T22:32:30Z http://eprints.utm.my/id/eprint/94946/ A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets Hameed, Shilan S. Hassan, Wan Haslina Abdul Latiff, Liza Muhammad, Fahmi F. T Technology (General) Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a comprehensive comparative analysis on three well-known nature-inspired metaheuristic algorithms, namely binary particle swarm optimization (BPSO), genetic algorithm (GA) and cuckoo search algorithm (CS) when they are used in gene selection and classification in twelve high-dimensional cancer datasets. The methodology was carried out through the utilization of a three-phase hybrid approach, considering a pre-processing filtration using Pearson product-moment correlation coefficient (PPMCC) followed by the metaheuristic and classification algorithms. Comparably, five different classification algorithms were used in each phase of analysis. It was seen that the application of PCCMA filter has acted upon reducing the computational complexity of overall analysis. The comparative study showed that BPSO outperformed GA and CS in terms of accuracy. However, CS was able to select fewer attributed genes and was less computationally complex compared to that of GA and BPSO. Springer Science and Business Media Deutschland GmbH 2021 Article PeerReviewed Hameed, Shilan S. and Hassan, Wan Haslina and Abdul Latiff, Liza and Muhammad, Fahmi F. (2021) A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets. Soft Computing, 25 (13). pp. 8683-8701. ISSN 1432-7643 http://dx.doi.org/10.1007/s00500-021-05726-0 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
T Technology (General) |
spellingShingle |
T Technology (General) Hameed, Shilan S. Hassan, Wan Haslina Abdul Latiff, Liza Muhammad, Fahmi F. A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
description |
Identification of informative genes is essential for the disease and cancer studies. Metaheuristic algorithms have been widely used for this purpose. However, their performance on various high-dimensional datasets of genomic studies has not been fully addressed. This work was intended to perform a comprehensive comparative analysis on three well-known nature-inspired metaheuristic algorithms, namely binary particle swarm optimization (BPSO), genetic algorithm (GA) and cuckoo search algorithm (CS) when they are used in gene selection and classification in twelve high-dimensional cancer datasets. The methodology was carried out through the utilization of a three-phase hybrid approach, considering a pre-processing filtration using Pearson product-moment correlation coefficient (PPMCC) followed by the metaheuristic and classification algorithms. Comparably, five different classification algorithms were used in each phase of analysis. It was seen that the application of PCCMA filter has acted upon reducing the computational complexity of overall analysis. The comparative study showed that BPSO outperformed GA and CS in terms of accuracy. However, CS was able to select fewer attributed genes and was less computationally complex compared to that of GA and BPSO. |
format |
Article |
author |
Hameed, Shilan S. Hassan, Wan Haslina Abdul Latiff, Liza Muhammad, Fahmi F. |
author_facet |
Hameed, Shilan S. Hassan, Wan Haslina Abdul Latiff, Liza Muhammad, Fahmi F. |
author_sort |
Hameed, Shilan S. |
title |
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
title_short |
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
title_full |
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
title_fullStr |
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
title_full_unstemmed |
A comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
title_sort |
comparative study of nature-inspired metaheuristic algorithms using a three-phase hybrid approach for gene selection and classification in high-dimensional cancer datasets |
publisher |
Springer Science and Business Media Deutschland GmbH |
publishDate |
2021 |
url |
http://eprints.utm.my/id/eprint/94946/ http://dx.doi.org/10.1007/s00500-021-05726-0 |
_version_ |
1732945413388894208 |