A pathway-based approach for analyzing microarray data using random forests

Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without...

Full description

Saved in:
Bibliographic Details
Main Authors: Chin, Hui Shi, Mohamad, Mohd. Saberi, Deris, Safaai, Ibrahim, Dzuwairie
Format: Article
Published: 2012
Subjects:
Online Access:http://eprints.utm.my/id/eprint/46515/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.46515
record_format eprints
spelling my.utm.465152017-09-12T06:26:25Z http://eprints.utm.my/id/eprint/46515/ A pathway-based approach for analyzing microarray data using random forests Chin, Hui Shi Mohamad, Mohd. Saberi Deris, Safaai Ibrahim, Dzuwairie QP Physiology Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret Thus, the pathway-based survival analysts using machine learning methods represents a promising approach for generating new biological hypothesis from micro array studies. The two popular variants of random forests used in this research for surmised data are. random survival forests and bivariate node-splitting random survival forests. There are three types of datasets used for this research and each dataset with a three-level outcome. This research which compared the four splitting rules available in random survival forests to identify log rank test is the most accurate in terms of prediction error. To evaluate the accuracy of pathway based survival approach, this research considered employing area under the receiver operating characteristic curve for censored data. The use of random survival forests for survival outcomes in analyzing micro array data allows researchers to obtain results that are more closely tied with the biological mechanism of diseases 2012 Article PeerReviewed Chin, Hui Shi and Mohamad, Mohd. Saberi and Deris, Safaai and Ibrahim, Dzuwairie (2012) A pathway-based approach for analyzing microarray data using random forests. ICIC Express Letters, 6 (5). pp. 1253-1257. ISSN 1881-803X
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QP Physiology
spellingShingle QP Physiology
Chin, Hui Shi
Mohamad, Mohd. Saberi
Deris, Safaai
Ibrahim, Dzuwairie
A pathway-based approach for analyzing microarray data using random forests
description Although machine learning methods, such as random forests, have been developed to correlate survival outcomes with a set of genes, less study has assessed the abilities of these methods in incorporating pathway information for analyzing microarray data. In general, genes that are identified without incorporating biological knowledge are more difficult to interpret Thus, the pathway-based survival analysts using machine learning methods represents a promising approach for generating new biological hypothesis from micro array studies. The two popular variants of random forests used in this research for surmised data are. random survival forests and bivariate node-splitting random survival forests. There are three types of datasets used for this research and each dataset with a three-level outcome. This research which compared the four splitting rules available in random survival forests to identify log rank test is the most accurate in terms of prediction error. To evaluate the accuracy of pathway based survival approach, this research considered employing area under the receiver operating characteristic curve for censored data. The use of random survival forests for survival outcomes in analyzing micro array data allows researchers to obtain results that are more closely tied with the biological mechanism of diseases
format Article
author Chin, Hui Shi
Mohamad, Mohd. Saberi
Deris, Safaai
Ibrahim, Dzuwairie
author_facet Chin, Hui Shi
Mohamad, Mohd. Saberi
Deris, Safaai
Ibrahim, Dzuwairie
author_sort Chin, Hui Shi
title A pathway-based approach for analyzing microarray data using random forests
title_short A pathway-based approach for analyzing microarray data using random forests
title_full A pathway-based approach for analyzing microarray data using random forests
title_fullStr A pathway-based approach for analyzing microarray data using random forests
title_full_unstemmed A pathway-based approach for analyzing microarray data using random forests
title_sort pathway-based approach for analyzing microarray data using random forests
publishDate 2012
url http://eprints.utm.my/id/eprint/46515/
_version_ 1643652057833406464