Informative top-k class associative rule for cancer biomarker discovery on microarray data

The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a gen...

Full description

Saved in:
Bibliographic Details
Main Authors: Ong, Huey Fang, Mustapha, Norwati, Hamdan, Hazlina, Rosli, Rozita, Mustapha, Aida
Format: Article
Language:English
Published: Elsevier 2020
Online Access:http://psasir.upm.edu.my/id/eprint/89394/1/TOP.pdf
http://psasir.upm.edu.my/id/eprint/89394/
https://www.sciencedirect.com/science/article/pii/S0957417419308863
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
Language: English
id my.upm.eprints.89394
record_format eprints
spelling my.upm.eprints.893942021-08-19T22:57:05Z http://psasir.upm.edu.my/id/eprint/89394/ Informative top-k class associative rule for cancer biomarker discovery on microarray data Ong, Huey Fang Mustapha, Norwati Hamdan, Hazlina Rosli, Rozita Mustapha, Aida The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule (iTCAR) method in an integrative framework for identifying candidate genes of specific cancers. iTCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological information from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers. The experimental results show that iTCAR has excellent predictability by achieving the average classification accuracy above 90% and the average area under the curve above 0.8. Besides, iTCAR has significant reproducibility and interpretability through functional enrichment analysis and retrieval of meaningful cancer terms. These promising results suggest the proposed method has great potential in identifying candidate genes, which can be further investigated as biomarkers for cancer diseases. Elsevier 2020 Article PeerReviewed text en http://psasir.upm.edu.my/id/eprint/89394/1/TOP.pdf Ong, Huey Fang and Mustapha, Norwati and Hamdan, Hazlina and Rosli, Rozita and Mustapha, Aida (2020) Informative top-k class associative rule for cancer biomarker discovery on microarray data. Expert Systems with Applications, 146. art. no. 113169. pp. 1-18. ISSN 0957-4174 https://www.sciencedirect.com/science/article/pii/S0957417419308863 10.1016/j.eswa.2019.113169
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
description The discovery of reliable cancer biomarkers is crucial for accurate early detection and clinical diagnosis. One of the strategies is by identifying expression-based cancer biomarkers through integrative microarray data analysis. Microarray is a powerful high-throughput technology, which allows a genome-wide analysis of human genes with various biological information. Nevertheless, more studies are needed on improving the predictability of the discovered gene biomarkers, as well as their reproducibility and interpretability, to qualify them for clinical use. This paper proposes an informative top-k class associative rule (iTCAR) method in an integrative framework for identifying candidate genes of specific cancers. iTCAR introduces an enhanced associative classification algorithm that integrates microarray data with biological information from gene ontology, KEGG pathways, and protein-protein interactions to generate informative class associative rules. A new interestingness measurement is used to rank and select class associative rules for building accurate classifiers. The experimental results show that iTCAR has excellent predictability by achieving the average classification accuracy above 90% and the average area under the curve above 0.8. Besides, iTCAR has significant reproducibility and interpretability through functional enrichment analysis and retrieval of meaningful cancer terms. These promising results suggest the proposed method has great potential in identifying candidate genes, which can be further investigated as biomarkers for cancer diseases.
format Article
author Ong, Huey Fang
Mustapha, Norwati
Hamdan, Hazlina
Rosli, Rozita
Mustapha, Aida
spellingShingle Ong, Huey Fang
Mustapha, Norwati
Hamdan, Hazlina
Rosli, Rozita
Mustapha, Aida
Informative top-k class associative rule for cancer biomarker discovery on microarray data
author_facet Ong, Huey Fang
Mustapha, Norwati
Hamdan, Hazlina
Rosli, Rozita
Mustapha, Aida
author_sort Ong, Huey Fang
title Informative top-k class associative rule for cancer biomarker discovery on microarray data
title_short Informative top-k class associative rule for cancer biomarker discovery on microarray data
title_full Informative top-k class associative rule for cancer biomarker discovery on microarray data
title_fullStr Informative top-k class associative rule for cancer biomarker discovery on microarray data
title_full_unstemmed Informative top-k class associative rule for cancer biomarker discovery on microarray data
title_sort informative top-k class associative rule for cancer biomarker discovery on microarray data
publisher Elsevier
publishDate 2020
url http://psasir.upm.edu.my/id/eprint/89394/1/TOP.pdf
http://psasir.upm.edu.my/id/eprint/89394/
https://www.sciencedirect.com/science/article/pii/S0957417419308863
_version_ 1709669009351245824