Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection

Undetected errors in the expression measurements from highthroughput DNA microarrays and protein spectroscopy could seriously affect the diagnostic reliability in disease detection. In addition to a high resilience against such errors, diagnostic models need to be more comprehensible so that a deepe...

Full description

Saved in:
Bibliographic Details
Main Authors: YAP, Ghim-Eng, TAN, Ah-Hwee, PANG, Hwee Hwa
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2007
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/394
https://ink.library.smu.edu.sg/context/sis_research/article/1393/viewcontent/Learning_Causal_Models_for_Noisy_Biological_Data_Mining__An_Application_to_Ovarian_Cancer_Detection_edited_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1393
record_format dspace
spelling sg-smu-ink.sis_research-13932017-12-07T03:40:07Z Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection YAP, Ghim-Eng TAN, Ah-Hwee PANG, Hwee Hwa Undetected errors in the expression measurements from highthroughput DNA microarrays and protein spectroscopy could seriously affect the diagnostic reliability in disease detection. In addition to a high resilience against such errors, diagnostic models need to be more comprehensible so that a deeper understanding of the causal interactions among biological entities like genes and proteins may be possible. In this paper, we introduce a robust knowledge discovery approach that addresses these challenges. First, the causal interactions among the genes and proteins in the noisy expression data are discovered automatically through Bayesian network learning. Then, the diagnosis of a disease based on the network is performed using a novel error-handling procedure, which automatically identifies the noisy measurements and accounts for their uncertainties during diagnosis. An application to the problem of ovarian cancer detection shows that the approach effectively discovers causal interactions among cancer-specific proteins. With the proposed error-handling procedure, the network perfectly distinguishes between the cancer and normal patients. 2007-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/394 https://ink.library.smu.edu.sg/context/sis_research/article/1393/viewcontent/Learning_Causal_Models_for_Noisy_Biological_Data_Mining__An_Application_to_Ovarian_Cancer_Detection_edited_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Disease detection Error-handling procedure Learning causal models Noisy biological data Databases and Information Systems Medicine and Health Sciences Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Disease detection
Error-handling procedure
Learning causal models
Noisy biological data
Databases and Information Systems
Medicine and Health Sciences
Numerical Analysis and Scientific Computing
spellingShingle Disease detection
Error-handling procedure
Learning causal models
Noisy biological data
Databases and Information Systems
Medicine and Health Sciences
Numerical Analysis and Scientific Computing
YAP, Ghim-Eng
TAN, Ah-Hwee
PANG, Hwee Hwa
Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
description Undetected errors in the expression measurements from highthroughput DNA microarrays and protein spectroscopy could seriously affect the diagnostic reliability in disease detection. In addition to a high resilience against such errors, diagnostic models need to be more comprehensible so that a deeper understanding of the causal interactions among biological entities like genes and proteins may be possible. In this paper, we introduce a robust knowledge discovery approach that addresses these challenges. First, the causal interactions among the genes and proteins in the noisy expression data are discovered automatically through Bayesian network learning. Then, the diagnosis of a disease based on the network is performed using a novel error-handling procedure, which automatically identifies the noisy measurements and accounts for their uncertainties during diagnosis. An application to the problem of ovarian cancer detection shows that the approach effectively discovers causal interactions among cancer-specific proteins. With the proposed error-handling procedure, the network perfectly distinguishes between the cancer and normal patients.
format text
author YAP, Ghim-Eng
TAN, Ah-Hwee
PANG, Hwee Hwa
author_facet YAP, Ghim-Eng
TAN, Ah-Hwee
PANG, Hwee Hwa
author_sort YAP, Ghim-Eng
title Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
title_short Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
title_full Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
title_fullStr Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
title_full_unstemmed Learning Causal Models for Noisy Biological Data Mining: An Application to Ovarian Cancer Detection
title_sort learning causal models for noisy biological data mining: an application to ovarian cancer detection
publisher Institutional Knowledge at Singapore Management University
publishDate 2007
url https://ink.library.smu.edu.sg/sis_research/394
https://ink.library.smu.edu.sg/context/sis_research/article/1393/viewcontent/Learning_Causal_Models_for_Noisy_Biological_Data_Mining__An_Application_to_Ovarian_Cancer_Detection_edited_.pdf
_version_ 1770570408846688256