A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression

Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be a...

Full description

Saved in:
Bibliographic Details
Main Author: Tan Ah Chik @ Mohamad, Mohd. Saberi
Format: Thesis
Language:English
Published: 2005
Subjects:
Online Access:http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf
http://eprints.utm.my/id/eprint/34718/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61261?queryType=vitalDismax&query=+A+hybrid+genetic+algorithm+and+support+vector+machine+classifier+for+feature+selection+and+classification+of+gene+expression&public=true
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
id my.utm.34718
record_format eprints
spelling my.utm.347182017-10-11T06:44:24Z http://eprints.utm.my/id/eprint/34718/ A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression Tan Ah Chik @ Mohamad, Mohd. Saberi Unspecified Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be addressed under such circumstances are the efficient selection of a small subset of genes that might profoundly contribute to disease identification from the thousand of genes measured on microarrays that are inherently noisy. This research deals with finding a small subset of informative genes from gene expression data which maximizes the classification accuracy. This research proposed a hybrid between Genetic Algorithm and Support Vector Machine classifier for selecting an optimal small subset of informative genes and classifying the optimal subset. Two benchmark data sets were used to evaluate the usefulness of the approach for small and high dimension data. Although, the experimental results showed that the hybrid method performed better than some of the best previous methods on small dimensional data, its performance deteriorated significantly on the higher dimensional data. An improved version of the hybrid method was designed by introducing a new algorithm for features selection based on improved chromosome representation to replace the original algorithm on the hybrid method which appeared to perform poorly on high dimensional data. The results of the gene expression microarray classification demonstrated that the proposed method performed better than the original and the previous methods. The informative genes from the experiment results proved to be biologically plausible when compared with the biological results produced from biologist and computer scientist researches. 2005 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf Tan Ah Chik @ Mohamad, Mohd. Saberi (2005) A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information System. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61261?queryType=vitalDismax&query=+A+hybrid+genetic+algorithm+and+support+vector+machine+classifier+for+feature+selection+and+classification+of+gene+expression&public=true
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic Unspecified
spellingShingle Unspecified
Tan Ah Chik @ Mohamad, Mohd. Saberi
A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
description Advancement in gene expression technology offers the ability to measure the expression levels of thousand of genes in parallel. Gene expression microarray data is expected to significantly aid in the development of efficient cancer diagnosis and classification platforms. Key issues that need to be addressed under such circumstances are the efficient selection of a small subset of genes that might profoundly contribute to disease identification from the thousand of genes measured on microarrays that are inherently noisy. This research deals with finding a small subset of informative genes from gene expression data which maximizes the classification accuracy. This research proposed a hybrid between Genetic Algorithm and Support Vector Machine classifier for selecting an optimal small subset of informative genes and classifying the optimal subset. Two benchmark data sets were used to evaluate the usefulness of the approach for small and high dimension data. Although, the experimental results showed that the hybrid method performed better than some of the best previous methods on small dimensional data, its performance deteriorated significantly on the higher dimensional data. An improved version of the hybrid method was designed by introducing a new algorithm for features selection based on improved chromosome representation to replace the original algorithm on the hybrid method which appeared to perform poorly on high dimensional data. The results of the gene expression microarray classification demonstrated that the proposed method performed better than the original and the previous methods. The informative genes from the experiment results proved to be biologically plausible when compared with the biological results produced from biologist and computer scientist researches.
format Thesis
author Tan Ah Chik @ Mohamad, Mohd. Saberi
author_facet Tan Ah Chik @ Mohamad, Mohd. Saberi
author_sort Tan Ah Chik @ Mohamad, Mohd. Saberi
title A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_short A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_full A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_fullStr A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_full_unstemmed A hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
title_sort hybrid genetic algorithm and support vector machine classifier for feature selection and classification of gene expression
publishDate 2005
url http://eprints.utm.my/id/eprint/34718/1/MohdSaberiBinTanAhChik%40MohamadMFC2005.pdf
http://eprints.utm.my/id/eprint/34718/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:61261?queryType=vitalDismax&query=+A+hybrid+genetic+algorithm+and+support+vector+machine+classifier+for+feature+selection+and+classification+of+gene+expression&public=true
_version_ 1643649654589489152