Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers

The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS...

Full description

Saved in:
Bibliographic Details
Main Authors: Peng, Mengjiao, Xiang, Liming
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/168891
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-168891
record_format dspace
spelling sg-ntu-dr.10356-1688912023-06-21T06:31:51Z Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers Peng, Mengjiao Xiang, Liming School of Physical and Mathematical Sciences Science::Mathematics Conditional Survival Function Correlation Rank The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors. Ministry of Education (MOE) Xiang’s research was partially supported by the Singapore Ministry of Education Academic Research Fund Tier 1 Grant (RG98/20) and Tier 2 Grant (MOE-T2EP20121-0004), and Peng’s research was partially supported by the National Key R&D Program of China (Nos. 2021YFA1000100 and 2021YFA1000101) and the National Natural Science Foundation of China (NSFC grant no. 92046005). 2023-06-21T06:31:50Z 2023-06-21T06:31:50Z 2023 Journal Article Peng, M. & Xiang, L. (2023). Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers. Statistics in Medicine, 42(13), 2082-2100. https://dx.doi.org/10.1002/sim.9712 0277-6715 https://hdl.handle.net/10356/168891 10.1002/sim.9712 36951373 2-s2.0-85150942871 13 42 2082 2100 en RG98/20 MOE-T2EP20121-0004 Statistics in Medicine © 2023 John Wily & Sons Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics
Conditional Survival Function
Correlation Rank
spellingShingle Science::Mathematics
Conditional Survival Function
Correlation Rank
Peng, Mengjiao
Xiang, Liming
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
description The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Peng, Mengjiao
Xiang, Liming
format Article
author Peng, Mengjiao
Xiang, Liming
author_sort Peng, Mengjiao
title Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
title_short Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
title_full Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
title_fullStr Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
title_full_unstemmed Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
title_sort disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
publishDate 2023
url https://hdl.handle.net/10356/168891
_version_ 1772826725176049664