Sparse supervised principal component analysis for survival models
Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failur...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156924 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156924 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1569242023-02-28T23:12:06Z Sparse supervised principal component analysis for survival models Poh, Charissa Li Ann Xiang Liming School of Physical and Mathematical Sciences LMXiang@ntu.edu.sg Science::Mathematics::Statistics Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failure Time models (SSPCA-AFT), which formulates sparse principal components with maximum dependency on the survival outcome of interest, through the optimisation of a penalised negative log-likelihood and PCA loss function. An adaptive sparse regularisation on the principal component loadings is also proposed through the adaptive SSPCA-AFT (aSSPCA-AFT) method, which was shown to incorporate sparsity more effectively than SSPCA-AFT. The proposed methods were illustrated with simulation studies and two real datasets - the Primary Biliary Cirrhosis (PBC) and Systolic Heart Failure (Peak VO2) datasets. The empirical results show that our proposed method is competitive with existing methods in terms of variable selection and prediction accuracy, with the added advantage of interpretability. Overall, we have proposed a suitable and effective method to overcome the problem of overfitting in survival prediction. Bachelor of Science in Mathematical Sciences 2022-04-28T11:26:23Z 2022-04-28T11:26:23Z 2022 Final Year Project (FYP) Poh, C. L. A. (2022). Sparse supervised principal component analysis for survival models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156924 https://hdl.handle.net/10356/156924 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Mathematics::Statistics |
spellingShingle |
Science::Mathematics::Statistics Poh, Charissa Li Ann Sparse supervised principal component analysis for survival models |
description |
Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failure Time models (SSPCA-AFT), which formulates sparse principal components with maximum dependency on the survival outcome of interest, through the optimisation of a penalised negative log-likelihood and PCA loss function. An adaptive sparse regularisation on the principal component loadings is also proposed through the adaptive SSPCA-AFT (aSSPCA-AFT) method, which was shown to incorporate sparsity more effectively than SSPCA-AFT. The proposed methods were illustrated with simulation studies and two real datasets - the Primary Biliary Cirrhosis (PBC) and Systolic Heart Failure (Peak VO2) datasets. The empirical results show that our proposed method is competitive with existing methods in terms of variable selection and prediction accuracy, with the added advantage of interpretability. Overall, we have proposed a suitable and effective method to overcome the problem of overfitting in survival prediction. |
author2 |
Xiang Liming |
author_facet |
Xiang Liming Poh, Charissa Li Ann |
format |
Final Year Project |
author |
Poh, Charissa Li Ann |
author_sort |
Poh, Charissa Li Ann |
title |
Sparse supervised principal component analysis for survival models |
title_short |
Sparse supervised principal component analysis for survival models |
title_full |
Sparse supervised principal component analysis for survival models |
title_fullStr |
Sparse supervised principal component analysis for survival models |
title_full_unstemmed |
Sparse supervised principal component analysis for survival models |
title_sort |
sparse supervised principal component analysis for survival models |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156924 |
_version_ |
1759853645820592128 |