Gene and sample selection for cancer identification
Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Theses and Dissertations |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/45770 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-45770 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-457702023-03-04T00:36:01Z Gene and sample selection for cancer identification Mundra Piyushkumar Arjunlal Jagath C. Rajapakse School of Computer Engineering Bioinformatics Research Centre DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured. One way to handle this problem is to identify relevant genes that contribute to the disease and thereafter inferring the underlying mechanisms of their functions. This thesis focuses on identification of relevant genes, which is hindered due to several reasons. For example, relevant genes could be correlated with other genes that are biologically relevant but redundant for the classification of disease. While ranking the genes according to their relevance, it is important to consider the quality of samples as microarray samples are highly heterogeneous and multimodal in nature. This further raises an issue of stability of a gene selection method because a gene selection method should be repeatable and reproducible, giving high confidence for selected genes. For multiclass classification, sample distribution of various classes may play important role in gene selection. By considering these aspects into gene selection criteria, this research has evolved in multiple ways by introducing several novel gene ranking algorithms. DOCTOR OF PHILOSOPHY (SCE) 2011-06-20T04:37:52Z 2011-06-20T04:37:52Z 2011 2011 Thesis Mundra, P. A. (2011). Gene and sample selection for cancer identification. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/45770 10.32657/10356/45770 en 196 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences Mundra Piyushkumar Arjunlal Gene and sample selection for cancer identification |
description |
Gene-expression data gathered with microarrays play an important role in detection, classification, and understanding of many diseases including cancer. However, the numbers of samples gathered in experiments still remain in hundreds compared to the thousands of genes whose expressions are measured. One way to handle this problem is to identify relevant genes that contribute to the disease and thereafter inferring the underlying mechanisms of their functions.
This thesis focuses on identification of relevant genes, which is hindered due to several reasons. For example, relevant genes could be correlated with other genes that are biologically relevant but redundant for the classification of disease. While ranking the genes according to their relevance, it is important to consider the quality of samples as microarray samples are highly heterogeneous and multimodal in nature. This further raises an issue of stability of a gene selection method because a gene selection method should be repeatable and reproducible, giving high confidence for selected genes. For multiclass classification, sample distribution of various classes may play important role in gene selection. By considering these aspects into gene selection criteria, this research has evolved in multiple ways by introducing several novel gene ranking algorithms. |
author2 |
Jagath C. Rajapakse |
author_facet |
Jagath C. Rajapakse Mundra Piyushkumar Arjunlal |
format |
Theses and Dissertations |
author |
Mundra Piyushkumar Arjunlal |
author_sort |
Mundra Piyushkumar Arjunlal |
title |
Gene and sample selection for cancer identification |
title_short |
Gene and sample selection for cancer identification |
title_full |
Gene and sample selection for cancer identification |
title_fullStr |
Gene and sample selection for cancer identification |
title_full_unstemmed |
Gene and sample selection for cancer identification |
title_sort |
gene and sample selection for cancer identification |
publishDate |
2011 |
url |
https://hdl.handle.net/10356/45770 |
_version_ |
1759857984113999872 |