Random forest for gene selection and microarray data classification

A random forest method has been selected to perform both gene selection and classification of the microarray data. In this embedded method, the selection of smallest possible sets of genes with lowest error rates is the key factor in achieving highest classification accuracy. Hence, improved gene se...

Full description

Saved in:
Bibliographic Details
Main Authors: Moorthy, Kohbalan, Mohamad, Mohd. Saberi
Format: Article
Published: Springer 2011
Subjects:
Online Access:http://eprints.utm.my/id/eprint/39857/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218317/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.39857
record_format eprints
spelling my.utm.398572019-03-05T01:59:06Z http://eprints.utm.my/id/eprint/39857/ Random forest for gene selection and microarray data classification Moorthy, Kohbalan Mohamad, Mohd. Saberi TK Electrical engineering. Electronics Nuclear engineering A random forest method has been selected to perform both gene selection and classification of the microarray data. In this embedded method, the selection of smallest possible sets of genes with lowest error rates is the key factor in achieving highest classification accuracy. Hence, improved gene selection method using random forest has been proposed to obtain the smallest subset of genes as well as biggest subset of genes prior to classification. The option for biggest subset selection is done to assist researchers who intend to use the informative genes for further research. Enhanced random forest gene selection has performed better in terms of selecting the smallest subset as well as biggest subset of informative genes with lowest out of bag error rates through gene selection. Furthermore, the classification performed on the selected subset of genes using random forest has lead to lower prediction error rates compared to existing method and other similar available methods. Springer 2011 Article PeerReviewed Moorthy, Kohbalan and Mohamad, Mohd. Saberi (2011) Random forest for gene selection and microarray data classification. Biomedical Informatics, 7 (3). pp. 142-146. ISSN 0973-8894 (Print); 0973-2063 (Online) https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218317/
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Moorthy, Kohbalan
Mohamad, Mohd. Saberi
Random forest for gene selection and microarray data classification
description A random forest method has been selected to perform both gene selection and classification of the microarray data. In this embedded method, the selection of smallest possible sets of genes with lowest error rates is the key factor in achieving highest classification accuracy. Hence, improved gene selection method using random forest has been proposed to obtain the smallest subset of genes as well as biggest subset of genes prior to classification. The option for biggest subset selection is done to assist researchers who intend to use the informative genes for further research. Enhanced random forest gene selection has performed better in terms of selecting the smallest subset as well as biggest subset of informative genes with lowest out of bag error rates through gene selection. Furthermore, the classification performed on the selected subset of genes using random forest has lead to lower prediction error rates compared to existing method and other similar available methods.
format Article
author Moorthy, Kohbalan
Mohamad, Mohd. Saberi
author_facet Moorthy, Kohbalan
Mohamad, Mohd. Saberi
author_sort Moorthy, Kohbalan
title Random forest for gene selection and microarray data classification
title_short Random forest for gene selection and microarray data classification
title_full Random forest for gene selection and microarray data classification
title_fullStr Random forest for gene selection and microarray data classification
title_full_unstemmed Random forest for gene selection and microarray data classification
title_sort random forest for gene selection and microarray data classification
publisher Springer
publishDate 2011
url http://eprints.utm.my/id/eprint/39857/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3218317/
_version_ 1643650381714030592