Information Recovery in a Study with Surrogate Endpoints
Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2003
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/soe_research/159 https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.soe_research-1158 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.soe_research-11582018-05-07T02:16:54Z Information Recovery in a Study with Surrogate Endpoints CHEN, Song Xi LEUNG, Denis H. Y. QIN, Jing Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of two random samples from the population, a validation set where (Y, X, S) are observed on every subject and a nonvalidation set where only (X, S) are measured. We show how information from the nonvalidation set can be incorporated to improve upon estimation of a parameter using the validation data only. The method we suggest does not require knowledge on the joint distribution between (Y, S), given X. It is based on a two-sample empirical likelihood that simultaneously combines the estimating equations from the validation set and the nonvalidation set. The proposed nonparametric likelihood formulation brings a few attractive features to the inference in . First, the maximum empirical likelihood estimate is more efficient than that using only the validation sample. Second, confidence regions can be readily constructed without the need to estimate the variance-covariance matrix. Finally, the coverage of the confidence regions can be further improved by an empirical Bartlett correction based on the bootstrap. We show that the method gives favorable results in simulation studies. 2003-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/soe_research/159 info:doi/10.1198/016214503000000972 https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Economics eng Institutional Knowledge at Singapore Management University Auxiliary outcome Bartlett correction Bootstrap Confidence regions Empirical likelihood Estimating equations Surrogate end point Econometrics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Auxiliary outcome Bartlett correction Bootstrap Confidence regions Empirical likelihood Estimating equations Surrogate end point Econometrics |
spellingShingle |
Auxiliary outcome Bartlett correction Bootstrap Confidence regions Empirical likelihood Estimating equations Surrogate end point Econometrics CHEN, Song Xi LEUNG, Denis H. Y. QIN, Jing Information Recovery in a Study with Surrogate Endpoints |
description |
Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of two random samples from the population, a validation set where (Y, X, S) are observed on every subject and a nonvalidation set where only (X, S) are measured. We show how information from the nonvalidation set can be incorporated to improve upon estimation of a parameter using the validation data only. The method we suggest does not require knowledge on the joint distribution between (Y, S), given X. It is based on a two-sample empirical likelihood that simultaneously combines the estimating equations from the validation set and the nonvalidation set. The proposed nonparametric likelihood formulation brings a few attractive features to the inference in . First, the maximum empirical likelihood estimate is more efficient than that using only the validation sample. Second, confidence regions can be readily constructed without the need to estimate the variance-covariance matrix. Finally, the coverage of the confidence regions can be further improved by an empirical Bartlett correction based on the bootstrap. We show that the method gives favorable results in simulation studies. |
format |
text |
author |
CHEN, Song Xi LEUNG, Denis H. Y. QIN, Jing |
author_facet |
CHEN, Song Xi LEUNG, Denis H. Y. QIN, Jing |
author_sort |
CHEN, Song Xi |
title |
Information Recovery in a Study with Surrogate Endpoints |
title_short |
Information Recovery in a Study with Surrogate Endpoints |
title_full |
Information Recovery in a Study with Surrogate Endpoints |
title_fullStr |
Information Recovery in a Study with Surrogate Endpoints |
title_full_unstemmed |
Information Recovery in a Study with Surrogate Endpoints |
title_sort |
information recovery in a study with surrogate endpoints |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2003 |
url |
https://ink.library.smu.edu.sg/soe_research/159 https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf |
_version_ |
1770569043447644160 |