Is Predicted Data a Viable Alternative to Real Data?
It is costly to collect the household- and individual-level data that underlies official estimates of poverty and health. For this reason, developing countries often do not have the budget to update their estimates of poverty and health regularly, even though these estimates are most needed there. O...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2016
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/soe_research/2295 https://ink.library.smu.edu.sg/context/soe_research/article/3294/viewcontent/WPS7841.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.soe_research-3294 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.soe_research-32942019-10-01T08:26:26Z Is Predicted Data a Viable Alternative to Real Data? FUJII, Tomoki VAN DER WEIDE, Roy It is costly to collect the household- and individual-level data that underlies official estimates of poverty and health. For this reason, developing countries often do not have the budget to update their estimates of poverty and health regularly, even though these estimates are most needed there. One way to reduce the financial burden is to substitute some of the real data with predicted data. An approach referred to as double sampling collects the expensive outcome variable for a sub-sample only while collecting the covariates used for prediction for the full sample. The objective of this study is to determine if this would indeed allow for realizing meaningful reductions in financial costs while preserving statistical precision. The study does this using analytical calculations that allow for considering a wide range of parameter values that are plausible to real applications. The benefits of using double sampling are found to be modest. There are circumstances for which the gains can be more substantial, but the study conjectures that these denote the exceptions rather than the rule. The recommendation is to rely on real data whenever there is a need for new data, and use the prediction estimator to leverage existing data. 2016-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/soe_research/2295 https://ink.library.smu.edu.sg/context/soe_research/article/3294/viewcontent/WPS7841.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Economics eng Institutional Knowledge at Singapore Management University Prediction Double sampling Survey costs Poverty Income Distribution Public Economics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Prediction Double sampling Survey costs Poverty Income Distribution Public Economics |
spellingShingle |
Prediction Double sampling Survey costs Poverty Income Distribution Public Economics FUJII, Tomoki VAN DER WEIDE, Roy Is Predicted Data a Viable Alternative to Real Data? |
description |
It is costly to collect the household- and individual-level data that underlies official estimates of poverty and health. For this reason, developing countries often do not have the budget to update their estimates of poverty and health regularly, even though these estimates are most needed there. One way to reduce the financial burden is to substitute some of the real data with predicted data. An approach referred to as double sampling collects the expensive outcome variable for a sub-sample only while collecting the covariates used for prediction for the full sample. The objective of this study is to determine if this would indeed allow for realizing meaningful reductions in financial costs while preserving statistical precision. The study does this using analytical calculations that allow for considering a wide range of parameter values that are plausible to real applications. The benefits of using double sampling are found to be modest. There are circumstances for which the gains can be more substantial, but the study conjectures that these denote the exceptions rather than the rule. The recommendation is to rely on real data whenever there is a need for new data, and use the prediction estimator to leverage existing data. |
format |
text |
author |
FUJII, Tomoki VAN DER WEIDE, Roy |
author_facet |
FUJII, Tomoki VAN DER WEIDE, Roy |
author_sort |
FUJII, Tomoki |
title |
Is Predicted Data a Viable Alternative to Real Data? |
title_short |
Is Predicted Data a Viable Alternative to Real Data? |
title_full |
Is Predicted Data a Viable Alternative to Real Data? |
title_fullStr |
Is Predicted Data a Viable Alternative to Real Data? |
title_full_unstemmed |
Is Predicted Data a Viable Alternative to Real Data? |
title_sort |
is predicted data a viable alternative to real data? |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2016 |
url |
https://ink.library.smu.edu.sg/soe_research/2295 https://ink.library.smu.edu.sg/context/soe_research/article/3294/viewcontent/WPS7841.pdf |
_version_ |
1770574797702430720 |