Semiparametric prevalence estimation from a two-phase survey

This paper studies a semi-parametric method for estimating the prevalence of a binary outcome using a two-phase survey. The motivation for a two-phase survey is, due to time, money and ethical considerations, it is impossible to carry out comprehensive evaluation on all subjects in a large random sa...

Full description

Saved in:
Bibliographic Details
Main Authors: LEUNG, Denis H. Y., QIN, J
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2009
Subjects:
Online Access:https://ink.library.smu.edu.sg/soe_research/2088
https://ink.library.smu.edu.sg/context/soe_research/article/3088/viewcontent/semiparametric.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:This paper studies a semi-parametric method for estimating the prevalence of a binary outcome using a two-phase survey. The motivation for a two-phase survey is, due to time, money and ethical considerations, it is impossible to carry out comprehensive evaluation on all subjects in a large random sample of the population. Rather, a relatively inexpensive "screening test" is given to all subjects in the random sample and only individuals more likely to have a positive outcome (cases) will be selected for a further "gold standard" test to verify the outcome. Therefore, individuals with verified outcome form a non-random sample from the population and care must be taken when the data are used for estimating the prevalence of the outcome. This paper proposes a semi-parametric method for estimating the outcome prevalence. It requires only an estimate of the probability of selection into the second phase, given the first phase data. This feature is desirable as in most cases, the probability of selection into the second phase is under the control of the researchers, and even when it is not, can be easily estimated given the data. The proposed method uses the empirical likelihood approach (Owen, 1988), which yields consistent prevalence estimates as long as the probability of selection into the second phase is correctly modeled.