Comparative study of feature selection method of microarray data for gene classification

Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and class...

Full description

Saved in:
Bibliographic Details
Main Author: Ghazali, Nurulhuda
Format: Thesis
Language:English
Published: 2009
Subjects:
Online Access:http://eprints.utm.my/id/eprint/11502/6/NurulhudaGhazaliMFSKSM2009.pdf
http://eprints.utm.my/id/eprint/11502/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
Language: English
id my.utm.11502
record_format eprints
spelling my.utm.115022017-09-20T10:00:12Z http://eprints.utm.my/id/eprint/11502/ Comparative study of feature selection method of microarray data for gene classification Ghazali, Nurulhuda QA75 Electronic computers. Computer science RC0254 Neoplasms. Tumors. Oncology (including Cancer) Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and classify cancer diseases. Recent technology in cancer classification is based on gene expression profile rather than on morphological appearance of the tumor. However, this task is made more difficult due to the noisy nature of microarray data and the overwhelming number of genes. Thus, it is an important issue to select a small subset of genes to represent thousands of genes in microarray data which is referred as informative genes. These informative genes will then be classified according to its appropriate classes. To achieve the best solution to the classification issue, we proposed an approach of minimum Redundancy-Maximum Relevance feature selection method together with Probabilistic Neural Network classifier. The minimum Redundancy- Maximum Relevance feature selection method is used to select the informative genes while the Probabilistic Neural Network classifier acts as the classifier. This approach has been tested on a well-known cancer dataset which is Leukemia. The results achieved shows that the gene selected had given high classification accuracy. This reduction of genes helps take out some burdens from biologist and better classification accuracy can be used widely to detect cancer in early stage. 2009-10 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/11502/6/NurulhudaGhazaliMFSKSM2009.pdf Ghazali, Nurulhuda (2009) Comparative study of feature selection method of microarray data for gene classification. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information Systems.
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA75 Electronic computers. Computer science
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
spellingShingle QA75 Electronic computers. Computer science
RC0254 Neoplasms. Tumors. Oncology (including Cancer)
Ghazali, Nurulhuda
Comparative study of feature selection method of microarray data for gene classification
description Recent advances in biotechnology such as microarray, offer the ability to measure the levels of expression of thousands of genes in parallel. Analysis of microarray data can provide understanding and insight into gene function and regulatory mechanisms. This analysis is crucial to identify and classify cancer diseases. Recent technology in cancer classification is based on gene expression profile rather than on morphological appearance of the tumor. However, this task is made more difficult due to the noisy nature of microarray data and the overwhelming number of genes. Thus, it is an important issue to select a small subset of genes to represent thousands of genes in microarray data which is referred as informative genes. These informative genes will then be classified according to its appropriate classes. To achieve the best solution to the classification issue, we proposed an approach of minimum Redundancy-Maximum Relevance feature selection method together with Probabilistic Neural Network classifier. The minimum Redundancy- Maximum Relevance feature selection method is used to select the informative genes while the Probabilistic Neural Network classifier acts as the classifier. This approach has been tested on a well-known cancer dataset which is Leukemia. The results achieved shows that the gene selected had given high classification accuracy. This reduction of genes helps take out some burdens from biologist and better classification accuracy can be used widely to detect cancer in early stage.
format Thesis
author Ghazali, Nurulhuda
author_facet Ghazali, Nurulhuda
author_sort Ghazali, Nurulhuda
title Comparative study of feature selection method of microarray data for gene classification
title_short Comparative study of feature selection method of microarray data for gene classification
title_full Comparative study of feature selection method of microarray data for gene classification
title_fullStr Comparative study of feature selection method of microarray data for gene classification
title_full_unstemmed Comparative study of feature selection method of microarray data for gene classification
title_sort comparative study of feature selection method of microarray data for gene classification
publishDate 2009
url http://eprints.utm.my/id/eprint/11502/6/NurulhudaGhazaliMFSKSM2009.pdf
http://eprints.utm.my/id/eprint/11502/
_version_ 1643645699490840576