Informative top-k class associative rule for cancer biomarker discovery on microarray data
The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a gen...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020
|
Subjects: | |
Online Access: | http://eprints.uthm.edu.my/5262/1/AJ%202020%20%28129%29.pdf http://eprints.uthm.edu.my/5262/ https://doi.org/10.1016/j.eswa.2019.113169 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Tun Hussein Onn Malaysia |
Language: | English |
id |
my.uthm.eprints.5262 |
---|---|
record_format |
eprints |
spelling |
my.uthm.eprints.52622022-01-06T08:16:40Z http://eprints.uthm.edu.my/5262/ Informative top-k class associative rule for cancer biomarker discovery on microarray data Ong, Huey Fang Mustapha, Norwati Hamdan, Hazlina Rosli, Rozita Mustapha, Aida T Technology (General) T173.2-174.5 Technological change The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule ( i TCAR) method in an integrative framework for identifying candidate genes of specific cancers. i TCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological informa- tion from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers. The experimental results show that i TCAR has excellent predictability by achieving the average classification accuracy above 90% and the average area under the curve above 0.8. Besides, i TCAR has significant reproducibility and interpretability through functional enrichment analy- sis and retrieval of meaningful cancer terms. These promising results suggest the proposed method has great potential in identifying candidate genes, which can be further investigated as biomarkers for cancer diseases. Elsevier 2020 Article PeerReviewed text en http://eprints.uthm.edu.my/5262/1/AJ%202020%20%28129%29.pdf Ong, Huey Fang and Mustapha, Norwati and Hamdan, Hazlina and Rosli, Rozita and Mustapha, Aida (2020) Informative top-k class associative rule for cancer biomarker discovery on microarray data. Expert Systems With Applications, 146. pp. 1-18. ISSN 0957-4174 https://doi.org/10.1016/j.eswa.2019.113169 |
institution |
Universiti Tun Hussein Onn Malaysia |
building |
UTHM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tun Hussein Onn Malaysia |
content_source |
UTHM Institutional Repository |
url_provider |
http://eprints.uthm.edu.my/ |
language |
English |
topic |
T Technology (General) T173.2-174.5 Technological change |
spellingShingle |
T Technology (General) T173.2-174.5 Technological change Ong, Huey Fang Mustapha, Norwati Hamdan, Hazlina Rosli, Rozita Mustapha, Aida Informative top-k class associative rule for cancer biomarker discovery on microarray data |
description |
The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule ( i TCAR) method in an integrative framework for identifying candidate genes of specific cancers. i TCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological informa- tion from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers. The experimental results show that i TCAR has excellent predictability by achieving the average classification accuracy above 90% and the average area under the curve above 0.8. Besides, i TCAR has significant reproducibility and interpretability through functional enrichment analy- sis and retrieval of meaningful cancer terms. These promising results suggest the proposed method has great potential in identifying candidate genes, which can be further investigated as biomarkers for cancer diseases. |
format |
Article |
author |
Ong, Huey Fang Mustapha, Norwati Hamdan, Hazlina Rosli, Rozita Mustapha, Aida |
author_facet |
Ong, Huey Fang Mustapha, Norwati Hamdan, Hazlina Rosli, Rozita Mustapha, Aida |
author_sort |
Ong, Huey Fang |
title |
Informative top-k class associative rule for cancer biomarker discovery on microarray data |
title_short |
Informative top-k class associative rule for cancer biomarker discovery on microarray data |
title_full |
Informative top-k class associative rule for cancer biomarker discovery on microarray data |
title_fullStr |
Informative top-k class associative rule for cancer biomarker discovery on microarray data |
title_full_unstemmed |
Informative top-k class associative rule for cancer biomarker discovery on microarray data |
title_sort |
informative top-k class associative rule for cancer biomarker discovery on microarray data |
publisher |
Elsevier |
publishDate |
2020 |
url |
http://eprints.uthm.edu.my/5262/1/AJ%202020%20%28129%29.pdf http://eprints.uthm.edu.my/5262/ https://doi.org/10.1016/j.eswa.2019.113169 |
_version_ |
1738581357492174848 |