Sparse supervised principal component analysis for survival models
Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failur...
محفوظ في:
المؤلف الرئيسي: | |
---|---|
مؤلفون آخرون: | |
التنسيق: | Final Year Project |
اللغة: | English |
منشور في: |
Nanyang Technological University
2022
|
الموضوعات: | |
الوصول للمادة أونلاين: | https://hdl.handle.net/10356/156924 |
الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
المؤسسة: | Nanyang Technological University |
اللغة: | English |
الملخص: | Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failure Time models (SSPCA-AFT), which formulates sparse principal components with maximum dependency on the survival outcome of interest, through the optimisation of a penalised negative log-likelihood and PCA loss function. An adaptive sparse regularisation on the principal component loadings is also proposed through the adaptive SSPCA-AFT (aSSPCA-AFT) method, which was shown to incorporate sparsity more effectively than SSPCA-AFT. The proposed methods were illustrated with simulation studies and two real datasets - the Primary Biliary Cirrhosis (PBC) and Systolic Heart Failure (Peak VO2) datasets. The empirical results show that our proposed method is competitive with existing methods in terms of variable selection and prediction accuracy, with the added advantage of interpretability. Overall, we have proposed a suitable and effective method to overcome the problem of overfitting in survival prediction. |
---|