Sparse supervised principal component analysis for survival models

Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failur...

Full description

Saved in:
Bibliographic Details
Main Author: Poh, Charissa Li Ann
Other Authors: Xiang Liming
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156924
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156924
record_format dspace
spelling sg-ntu-dr.10356-1569242023-02-28T23:12:06Z Sparse supervised principal component analysis for survival models Poh, Charissa Li Ann Xiang Liming School of Physical and Mathematical Sciences LMXiang@ntu.edu.sg Science::Mathematics::Statistics Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failure Time models (SSPCA-AFT), which formulates sparse principal components with maximum dependency on the survival outcome of interest, through the optimisation of a penalised negative log-likelihood and PCA loss function. An adaptive sparse regularisation on the principal component loadings is also proposed through the adaptive SSPCA-AFT (aSSPCA-AFT) method, which was shown to incorporate sparsity more effectively than SSPCA-AFT. The proposed methods were illustrated with simulation studies and two real datasets - the Primary Biliary Cirrhosis (PBC) and Systolic Heart Failure (Peak VO2) datasets. The empirical results show that our proposed method is competitive with existing methods in terms of variable selection and prediction accuracy, with the added advantage of interpretability. Overall, we have proposed a suitable and effective method to overcome the problem of overfitting in survival prediction. Bachelor of Science in Mathematical Sciences 2022-04-28T11:26:23Z 2022-04-28T11:26:23Z 2022 Final Year Project (FYP) Poh, C. L. A. (2022). Sparse supervised principal component analysis for survival models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156924 https://hdl.handle.net/10356/156924 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics::Statistics
spellingShingle Science::Mathematics::Statistics
Poh, Charissa Li Ann
Sparse supervised principal component analysis for survival models
description Survival prediction plays a vital role in biomedical research, but the large number of patient characteristics considered as covariates raises the concern about overfitting leading to poor prediction accuracy. To address this, we propose a sparse supervised PCA method for censored Accelerated Failure Time models (SSPCA-AFT), which formulates sparse principal components with maximum dependency on the survival outcome of interest, through the optimisation of a penalised negative log-likelihood and PCA loss function. An adaptive sparse regularisation on the principal component loadings is also proposed through the adaptive SSPCA-AFT (aSSPCA-AFT) method, which was shown to incorporate sparsity more effectively than SSPCA-AFT. The proposed methods were illustrated with simulation studies and two real datasets - the Primary Biliary Cirrhosis (PBC) and Systolic Heart Failure (Peak VO2) datasets. The empirical results show that our proposed method is competitive with existing methods in terms of variable selection and prediction accuracy, with the added advantage of interpretability. Overall, we have proposed a suitable and effective method to overcome the problem of overfitting in survival prediction.
author2 Xiang Liming
author_facet Xiang Liming
Poh, Charissa Li Ann
format Final Year Project
author Poh, Charissa Li Ann
author_sort Poh, Charissa Li Ann
title Sparse supervised principal component analysis for survival models
title_short Sparse supervised principal component analysis for survival models
title_full Sparse supervised principal component analysis for survival models
title_fullStr Sparse supervised principal component analysis for survival models
title_full_unstemmed Sparse supervised principal component analysis for survival models
title_sort sparse supervised principal component analysis for survival models
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156924
_version_ 1759853645820592128