Gene and sample selection for cancer identification

Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured...

Full description

Saved in:
Bibliographic Details
Main Author: Mundra Piyushkumar Arjunlal
Other Authors: Jagath C. Rajapakse
Format: Theses and Dissertations
Language:English
Published: 2011
Subjects:
Online Access:https://hdl.handle.net/10356/45770
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-45770
record_format dspace
spelling sg-ntu-dr.10356-457702023-03-04T00:36:01Z Gene and sample selection for cancer identification Mundra Piyushkumar Arjunlal Jagath C. Rajapakse School of Computer Engineering Bioinformatics Research Centre DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured. One way to handle this problem is to identify relevant genes that contribute to the disease and thereafter inferring the underlying mechanisms of their functions. This thesis focuses on identification of relevant genes, which is hindered due to several reasons. For example, relevant genes could be correlated with other genes that are biologically relevant but redundant for the classification of disease. While ranking the genes according to their relevance, it is important to consider the quality of samples as microarray samples are highly heterogeneous and multimodal in nature. This further raises an issue of stability of a gene selection method because a gene selection method should be repeatable and reproducible, giving high confidence for selected genes. For multiclass classification, sample distribution of various classes may play important role in gene selection. By considering these aspects into gene selection criteria, this research has evolved in multiple ways by introducing several novel gene ranking algorithms. DOCTOR OF PHILOSOPHY (SCE) 2011-06-20T04:37:52Z 2011-06-20T04:37:52Z 2011 2011 Thesis Mundra, P. A. (2011). Gene and sample selection for cancer identification. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/45770 10.32657/10356/45770 en 196 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences
spellingShingle DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences
Mundra Piyushkumar Arjunlal
Gene and sample selection for cancer identification
description Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured. One way to handle this problem is to identify relevant genes that contribute to the disease and thereafter inferring the underlying mechanisms of their functions. This thesis focuses on identification of relevant genes, which is hindered due to several reasons. For example, relevant genes could be correlated with other genes that are biologically relevant but redundant for the classification of disease. While ranking the genes according to their relevance, it is important to consider the quality of samples as microarray samples are highly heterogeneous and multimodal in nature. This further raises an issue of stability of a gene selection method because a gene selection method should be repeatable and reproducible, giving high confidence for selected genes. For multiclass classification, sample distribution of various classes may play important role in gene selection. By considering these aspects into gene selection criteria, this research has evolved in multiple ways by introducing several novel gene ranking algorithms.
author2 Jagath C. Rajapakse
author_facet Jagath C. Rajapakse
Mundra Piyushkumar Arjunlal
format Theses and Dissertations
author Mundra Piyushkumar Arjunlal
author_sort Mundra Piyushkumar Arjunlal
title Gene and sample selection for cancer identification
title_short Gene and sample selection for cancer identification
title_full Gene and sample selection for cancer identification
title_fullStr Gene and sample selection for cancer identification
title_full_unstemmed Gene and sample selection for cancer identification
title_sort gene and sample selection for cancer identification
publishDate 2011
url https://hdl.handle.net/10356/45770
_version_ 1759857984113999872