Bayesian variable selection using Knockoffs with applications to genomics

Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistan...

Full description

Saved in:
Bibliographic Details
Main Authors: Yap, Jurel K, Gauran, Iris Ivy M
Format: text
Published: Archīum Ateneo 2022
Subjects:
Online Access:https://archium.ateneo.edu/asog-pubs/239
https://doi.org/10.1007/s00180-022-01283-8
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.asog-pubs-1240
record_format eprints
spelling ph-ateneo-arc.asog-pubs-12402022-12-01T07:00:43Z Bayesian variable selection using Knockoffs with applications to genomics Yap, Jurel K Gauran, Iris Ivy M Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistance. In this study, we propose a multiple testing procedure that unifies key concepts in computational statistics, namely Model-free Knockoffs, Bayesian variable selection, and the local false discovery rate. We develop an algorithm that utilizes the augmented data-Knockoff matrix and implement Bayesian Lasso. We then identify signals using test statistics based on Markov Chain Monte Carlo outputs and local false discovery rate. We test our proposed methods against non-bayesian methods such as Benjamini–Hochberg (BHq) and Lasso regression in terms TPR and FDR. Using numerical studies, we show the proposed method yields lower FDR compared to BHq and Lasso for certain cases, such as for low and equi-dimensional cases. We also discuss an application to an HIV-1 data set, which aims to be applied analyzing genetic markers linked to drug resistant HIV in the Philippines in future work. 2022-01-01T08:00:00Z text https://archium.ateneo.edu/asog-pubs/239 https://doi.org/10.1007/s00180-022-01283-8 Ateneo School of Government Publications Archīum Ateneo Bayesian variable selection Model-free Knockoffs False discovery control Drug resistant HIV-1 Mathematics Medicine and Health Sciences Physical Sciences and Mathematics
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Bayesian variable selection
Model-free Knockoffs
False discovery control
Drug resistant HIV-1
Mathematics
Medicine and Health Sciences
Physical Sciences and Mathematics
spellingShingle Bayesian variable selection
Model-free Knockoffs
False discovery control
Drug resistant HIV-1
Mathematics
Medicine and Health Sciences
Physical Sciences and Mathematics
Yap, Jurel K
Gauran, Iris Ivy M
Bayesian variable selection using Knockoffs with applications to genomics
description Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistance. In this study, we propose a multiple testing procedure that unifies key concepts in computational statistics, namely Model-free Knockoffs, Bayesian variable selection, and the local false discovery rate. We develop an algorithm that utilizes the augmented data-Knockoff matrix and implement Bayesian Lasso. We then identify signals using test statistics based on Markov Chain Monte Carlo outputs and local false discovery rate. We test our proposed methods against non-bayesian methods such as Benjamini–Hochberg (BHq) and Lasso regression in terms TPR and FDR. Using numerical studies, we show the proposed method yields lower FDR compared to BHq and Lasso for certain cases, such as for low and equi-dimensional cases. We also discuss an application to an HIV-1 data set, which aims to be applied analyzing genetic markers linked to drug resistant HIV in the Philippines in future work.
format text
author Yap, Jurel K
Gauran, Iris Ivy M
author_facet Yap, Jurel K
Gauran, Iris Ivy M
author_sort Yap, Jurel K
title Bayesian variable selection using Knockoffs with applications to genomics
title_short Bayesian variable selection using Knockoffs with applications to genomics
title_full Bayesian variable selection using Knockoffs with applications to genomics
title_fullStr Bayesian variable selection using Knockoffs with applications to genomics
title_full_unstemmed Bayesian variable selection using Knockoffs with applications to genomics
title_sort bayesian variable selection using knockoffs with applications to genomics
publisher Archīum Ateneo
publishDate 2022
url https://archium.ateneo.edu/asog-pubs/239
https://doi.org/10.1007/s00180-022-01283-8
_version_ 1751550473624616960