Information Recovery in a Study with Surrogate Endpoints

Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of...

Full description

Saved in:
Bibliographic Details
Main Authors: CHEN, Song Xi, LEUNG, Denis H. Y., QIN, Jing
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2003
Subjects:
Online Access:https://ink.library.smu.edu.sg/soe_research/159
https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.soe_research-1158
record_format dspace
spelling sg-smu-ink.soe_research-11582018-05-07T02:16:54Z Information Recovery in a Study with Surrogate Endpoints CHEN, Song Xi LEUNG, Denis H. Y. QIN, Jing Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of two random samples from the population, a validation set where (Y, X, S) are observed on every subject and a nonvalidation set where only (X, S) are measured. We show how information from the nonvalidation set can be incorporated to improve upon estimation of a parameter using the validation data only. The method we suggest does not require knowledge on the joint distribution between (Y, S), given X. It is based on a two-sample empirical likelihood that simultaneously combines the estimating equations from the validation set and the nonvalidation set. The proposed nonparametric likelihood formulation brings a few attractive features to the inference in . First, the maximum empirical likelihood estimate is more efficient than that using only the validation sample. Second, confidence regions can be readily constructed without the need to estimate the variance-covariance matrix. Finally, the coverage of the confidence regions can be further improved by an empirical Bartlett correction based on the bootstrap. We show that the method gives favorable results in simulation studies. 2003-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/soe_research/159 info:doi/10.1198/016214503000000972 https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Economics eng Institutional Knowledge at Singapore Management University Auxiliary outcome Bartlett correction Bootstrap Confidence regions Empirical likelihood Estimating equations Surrogate end point Econometrics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Auxiliary outcome
Bartlett correction
Bootstrap
Confidence regions
Empirical likelihood
Estimating equations
Surrogate end point
Econometrics
spellingShingle Auxiliary outcome
Bartlett correction
Bootstrap
Confidence regions
Empirical likelihood
Estimating equations
Surrogate end point
Econometrics
CHEN, Song Xi
LEUNG, Denis H. Y.
QIN, Jing
Information Recovery in a Study with Surrogate Endpoints
description Recently, there has been a lot of interest in statistical methods for analyzing data with surrogate endpoints. In this article, we consider parameter estimation from a model that relates a variable Y to a set of covariates, X, in the presence of a surrogate, S. We assume that the data are made up of two random samples from the population, a validation set where (Y, X, S) are observed on every subject and a nonvalidation set where only (X, S) are measured. We show how information from the nonvalidation set can be incorporated to improve upon estimation of a parameter using the validation data only. The method we suggest does not require knowledge on the joint distribution between (Y, S), given X. It is based on a two-sample empirical likelihood that simultaneously combines the estimating equations from the validation set and the nonvalidation set. The proposed nonparametric likelihood formulation brings a few attractive features to the inference in . First, the maximum empirical likelihood estimate is more efficient than that using only the validation sample. Second, confidence regions can be readily constructed without the need to estimate the variance-covariance matrix. Finally, the coverage of the confidence regions can be further improved by an empirical Bartlett correction based on the bootstrap. We show that the method gives favorable results in simulation studies.
format text
author CHEN, Song Xi
LEUNG, Denis H. Y.
QIN, Jing
author_facet CHEN, Song Xi
LEUNG, Denis H. Y.
QIN, Jing
author_sort CHEN, Song Xi
title Information Recovery in a Study with Surrogate Endpoints
title_short Information Recovery in a Study with Surrogate Endpoints
title_full Information Recovery in a Study with Surrogate Endpoints
title_fullStr Information Recovery in a Study with Surrogate Endpoints
title_full_unstemmed Information Recovery in a Study with Surrogate Endpoints
title_sort information recovery in a study with surrogate endpoints
publisher Institutional Knowledge at Singapore Management University
publishDate 2003
url https://ink.library.smu.edu.sg/soe_research/159
https://ink.library.smu.edu.sg/context/soe_research/article/1158/viewcontent/Information_Recovery_in_a_Study_With_Surrogate_Endpoints.pdf
_version_ 1770569043447644160