Big data and computational biology strategy for personalized prognosis

The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of bo...

Full description

Saved in:
Bibliographic Details
Main Authors: Ow, Ghim Siong, Tang, Zhiqun, Kuznetsov, Vladimir A.
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2018
Subjects:
Online Access:https://hdl.handle.net/10356/88605
http://hdl.handle.net/10220/46926
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-88605
record_format dspace
spelling sg-ntu-dr.10356-886052020-03-07T11:48:59Z Big data and computational biology strategy for personalized prognosis Ow, Ghim Siong Tang, Zhiqun Kuznetsov, Vladimir A. School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering Big Data Aging The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of both clinical and expression data could be useful for predicting the prognostic risk or therapeutic efficacy.Here, we propose a novel methodology to predict disease/treatment outcome via analysis of the similarity between any pair of patients who are each characterized by a certain set of pre-defined biological variables (biomarkers or clinical features) represented initially as a prognostic binary variable vector (PBVV) and subsequently transformed to a prognostic signature vector (PSV). Our analyses revealed that Euclidean distance rather correlation distance measure was effective in defining an unbiased similarity measure calculated between two PSVs.We implemented our methods to high-grade serous ovarian cancer (HGSC) based on a 36-mRNA predictor that was previously shown to stratify patients into 3 distinct prognostic subgroups. We studied and revealed that patient's age, when converted into binary variable, was positively correlated with the overall risk of succumbing to the disease. When applied to an independent testing dataset, the inclusion of age into the molecular predictor provided more robust personalized prognosis of overall survival correlated with the therapeutic response of HGSC and provided benefit for treatment targeting of the tumors in HGSC patients.Finally, our method can be generalized and implemented in many other diseases to accurately predict personalized patients' outcomes. ASTAR (Agency for Sci., Tech. and Research, S’pore) Published version 2018-12-12T08:12:21Z 2019-12-06T17:07:05Z 2018-12-12T08:12:21Z 2019-12-06T17:07:05Z 2016 Journal Article Ow, G. S., Tang, Z., & Kuznetsov, V. A. (2016). Big data and computational biology strategy for personalized prognosis. Oncotarget, 7(26), 40200-40220. doi:10.18632/oncotarget.9571 https://hdl.handle.net/10356/88605 http://hdl.handle.net/10220/46926 10.18632/oncotarget.9571 en Oncotarget © 2016 The authors (published by Impact Journals). This is an open-access article distributed under the terms of the Creative Commons Attribution License 3.0 (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
Big Data
Aging
spellingShingle DRNTU::Engineering::Computer science and engineering
Big Data
Aging
Ow, Ghim Siong
Tang, Zhiqun
Kuznetsov, Vladimir A.
Big data and computational biology strategy for personalized prognosis
description The era of big data and precision medicine has led to accumulation of massive datasets of gene expression data and clinical information of patients. For a new patient, we propose that identification of a highly similar reference patient from an existing patient database via similarity matching of both clinical and expression data could be useful for predicting the prognostic risk or therapeutic efficacy.Here, we propose a novel methodology to predict disease/treatment outcome via analysis of the similarity between any pair of patients who are each characterized by a certain set of pre-defined biological variables (biomarkers or clinical features) represented initially as a prognostic binary variable vector (PBVV) and subsequently transformed to a prognostic signature vector (PSV). Our analyses revealed that Euclidean distance rather correlation distance measure was effective in defining an unbiased similarity measure calculated between two PSVs.We implemented our methods to high-grade serous ovarian cancer (HGSC) based on a 36-mRNA predictor that was previously shown to stratify patients into 3 distinct prognostic subgroups. We studied and revealed that patient's age, when converted into binary variable, was positively correlated with the overall risk of succumbing to the disease. When applied to an independent testing dataset, the inclusion of age into the molecular predictor provided more robust personalized prognosis of overall survival correlated with the therapeutic response of HGSC and provided benefit for treatment targeting of the tumors in HGSC patients.Finally, our method can be generalized and implemented in many other diseases to accurately predict personalized patients' outcomes.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Ow, Ghim Siong
Tang, Zhiqun
Kuznetsov, Vladimir A.
format Article
author Ow, Ghim Siong
Tang, Zhiqun
Kuznetsov, Vladimir A.
author_sort Ow, Ghim Siong
title Big data and computational biology strategy for personalized prognosis
title_short Big data and computational biology strategy for personalized prognosis
title_full Big data and computational biology strategy for personalized prognosis
title_fullStr Big data and computational biology strategy for personalized prognosis
title_full_unstemmed Big data and computational biology strategy for personalized prognosis
title_sort big data and computational biology strategy for personalized prognosis
publishDate 2018
url https://hdl.handle.net/10356/88605
http://hdl.handle.net/10220/46926
_version_ 1681048985237716992