Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers
The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/168891 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-168891 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1688912023-06-21T06:31:51Z Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers Peng, Mengjiao Xiang, Liming School of Physical and Mathematical Sciences Science::Mathematics Conditional Survival Function Correlation Rank The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors. Ministry of Education (MOE) Xiang’s research was partially supported by the Singapore Ministry of Education Academic Research Fund Tier 1 Grant (RG98/20) and Tier 2 Grant (MOE-T2EP20121-0004), and Peng’s research was partially supported by the National Key R&D Program of China (Nos. 2021YFA1000100 and 2021YFA1000101) and the National Natural Science Foundation of China (NSFC grant no. 92046005). 2023-06-21T06:31:50Z 2023-06-21T06:31:50Z 2023 Journal Article Peng, M. & Xiang, L. (2023). Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers. Statistics in Medicine, 42(13), 2082-2100. https://dx.doi.org/10.1002/sim.9712 0277-6715 https://hdl.handle.net/10356/168891 10.1002/sim.9712 36951373 2-s2.0-85150942871 13 42 2082 2100 en RG98/20 MOE-T2EP20121-0004 Statistics in Medicine © 2023 John Wily & Sons Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Mathematics Conditional Survival Function Correlation Rank |
spellingShingle |
Science::Mathematics Conditional Survival Function Correlation Rank Peng, Mengjiao Xiang, Liming Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
description |
The increased availability of ultrahigh-dimensional biomarker data and the high demand of identifying biomarkers importantly related to survival outcomes made feature screening methods commonplace in the analysis of cancer genome data. When survival outcomes include endpoints of overall survival (OS) and time-to-progression (TTP), a high concordance is typically found in both endpoints in cancer studies, namely, patients' OS would most likely be extended when tumour progression is delayed. Existing screening procedures are often performed on a single survival endpoint only and may result in biased selection of features for OS in ignorance of disease progression. We propose a novel feature screening method by incorporating information of TTP into the selection of important biomarker predictors for more accurate inference of OS subsequent to disease progression. The proposal is based on the rank of correlation between individual features and the conditional distribution of OS given observations of TTP. It is advantageous for its flexible model nature, which requires no marginal model assumption for each endpoint, and its minimal computational cost for implementation. Theoretical results show its ranking consistency, sure screening and false rate control properties. Simulation results demonstrate that the proposed screener leads to more accurate feature selection than the method without considering the prior observations of disease progression. An application to breast cancer genome data illustrates its practical utility and facilitates disease classification using selected biomarker predictors. |
author2 |
School of Physical and Mathematical Sciences |
author_facet |
School of Physical and Mathematical Sciences Peng, Mengjiao Xiang, Liming |
format |
Article |
author |
Peng, Mengjiao Xiang, Liming |
author_sort |
Peng, Mengjiao |
title |
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
title_short |
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
title_full |
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
title_fullStr |
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
title_full_unstemmed |
Disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
title_sort |
disease progression based feature screening for ultrahigh-dimensional survival-associated biomarkers |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/168891 |
_version_ |
1772826725176049664 |