A deep neural network approach to predicting clinical outcomes of neuroblastoma patients

Background: The availability of high-throughput omics datasets from large patient cohorts has allowed the development of methods that aim at predicting patient clinical outcomes, such as survival and disease recurrence. Such methods are also important to better understand the biological mechanisms u...

Full description

Saved in:
Bibliographic Details
Main Authors: Tranchevent, Léon-Charles, Azuaje, Francisco, Rajapakse, Jagath Chandana
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/146977
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-146977
record_format dspace
spelling sg-ntu-dr.10356-1469772021-03-18T08:11:16Z A deep neural network approach to predicting clinical outcomes of neuroblastoma patients Tranchevent, Léon-Charles Azuaje, Francisco Rajapakse, Jagath Chandana School of Computer Science and Engineering Bioinformatics Research Centre Science::Biological sciences Machine Learning Deep Learning Background: The availability of high-throughput omics datasets from large patient cohorts has allowed the development of methods that aim at predicting patient clinical outcomes, such as survival and disease recurrence. Such methods are also important to better understand the biological mechanisms underlying disease etiology and development, as well as treatment responses. Recently, different predictive models, relying on distinct algorithms (including Support Vector Machines and Random Forests) have been investigated. In this context, deep learning strategies are of special interest due to their demonstrated superior performance over a wide range of problems and datasets. One of the main challenges of such strategies is the "small n large p" problem. Indeed, omics datasets typically consist of small numbers of samples and large numbers of features relative to typical deep learning datasets. Neural networks usually tackle this problem through feature selection or by including additional constraints during the learning process. Methods: We propose to tackle this problem with a novel strategy that relies on a graph-based method for featureextraction, coupled with a deep neural network for clinical outcome prediction. The omics data are first representedas graphs whose nodes represent patients, and edges represent correlations between the patients’ omics profiles.Topological features, such as centralities, are then extracted from these graphs for every node. Lastly, these featuresare used as input to train and test various classifiers. Results: We apply this strategy to four neuroblastoma datasets and observe that models based on neural networksare more accurate than state of the art models (DNN: 85%-87%, SVM/RF: 75%-82%). We explore how differentparameters and configurations are selected in order to overcome the effects of the small data problem as well as thecurse of dimensionality. Conclusions: Our results indicate that the deep neural networks capture complex features in the data that helppredicting patient clinical outcomes. Published version 2021-03-18T08:11:16Z 2021-03-18T08:11:16Z 2019 Journal Article Tranchevent, L., Azuaje, F. & Rajapakse, J. C. (2019). A deep neural network approach to predicting clinical outcomes of neuroblastoma patients. BMC Medical Genomics, 12(Suppl 8). https://dx.doi.org/10.1186/s12920-019-0628-y 1755-8794 0000-0001-7944-1658 https://hdl.handle.net/10356/146977 10.1186/s12920-019-0628-y 31856829 2-s2.0-85077073432 Suppl 8 12 en BMC Medical Genomics © 2019 The Author(s). This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Machine Learning
Deep Learning
spellingShingle Science::Biological sciences
Machine Learning
Deep Learning
Tranchevent, Léon-Charles
Azuaje, Francisco
Rajapakse, Jagath Chandana
A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
description Background: The availability of high-throughput omics datasets from large patient cohorts has allowed the development of methods that aim at predicting patient clinical outcomes, such as survival and disease recurrence. Such methods are also important to better understand the biological mechanisms underlying disease etiology and development, as well as treatment responses. Recently, different predictive models, relying on distinct algorithms (including Support Vector Machines and Random Forests) have been investigated. In this context, deep learning strategies are of special interest due to their demonstrated superior performance over a wide range of problems and datasets. One of the main challenges of such strategies is the "small n large p" problem. Indeed, omics datasets typically consist of small numbers of samples and large numbers of features relative to typical deep learning datasets. Neural networks usually tackle this problem through feature selection or by including additional constraints during the learning process. Methods: We propose to tackle this problem with a novel strategy that relies on a graph-based method for featureextraction, coupled with a deep neural network for clinical outcome prediction. The omics data are first representedas graphs whose nodes represent patients, and edges represent correlations between the patients’ omics profiles.Topological features, such as centralities, are then extracted from these graphs for every node. Lastly, these featuresare used as input to train and test various classifiers. Results: We apply this strategy to four neuroblastoma datasets and observe that models based on neural networksare more accurate than state of the art models (DNN: 85%-87%, SVM/RF: 75%-82%). We explore how differentparameters and configurations are selected in order to overcome the effects of the small data problem as well as thecurse of dimensionality. Conclusions: Our results indicate that the deep neural networks capture complex features in the data that helppredicting patient clinical outcomes.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Tranchevent, Léon-Charles
Azuaje, Francisco
Rajapakse, Jagath Chandana
format Article
author Tranchevent, Léon-Charles
Azuaje, Francisco
Rajapakse, Jagath Chandana
author_sort Tranchevent, Léon-Charles
title A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
title_short A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
title_full A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
title_fullStr A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
title_full_unstemmed A deep neural network approach to predicting clinical outcomes of neuroblastoma patients
title_sort deep neural network approach to predicting clinical outcomes of neuroblastoma patients
publishDate 2021
url https://hdl.handle.net/10356/146977
_version_ 1695636082083758080