Is predicted data a viable alternative to real data?

It is costly to collect the household- andindividual-level data that underlies official estimates of poverty and health. Forthis reason, developing countries often do not have the budget to update their estimatesof poverty and health regularly, even though these estimates are most neededthere. One w...

Full description

Saved in:
Bibliographic Details
Main Authors: FUJII, Tomoki, VAN DER WEIDE, Roy
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/soe_research/2296
https://ink.library.smu.edu.sg/context/soe_research/article/3295/viewcontent/Is_Predicted_Data_a_Viable_Alternative.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.soe_research-3295
record_format dspace
spelling sg-smu-ink.soe_research-32952019-10-03T08:12:36Z Is predicted data a viable alternative to real data? FUJII, Tomoki VAN DER WEIDE, Roy It is costly to collect the household- andindividual-level data that underlies official estimates of poverty and health. Forthis reason, developing countries often do not have the budget to update their estimatesof poverty and health regularly, even though these estimates are most neededthere. One way to reduce the financial burden is to substitute some of the realdata with predicted data. An approach referred to as double sampling collectsthe expensive outcome variable for a sub-sample only while collecting thecovariates used for prediction for the full sample. The objective of this studyis to determine if this would indeed allow for realizing meaningful reductionsin financial costs while preserving statistical precision. The study does thisusing analytical calculations that allow for considering a wide range of parametervalues that are plausible to real applications. The benefits of using double samplingare found to be modest. There are circumstances for which the gains can be moresubstantial, but the study conjectures that these denote the exceptions ratherthan the rule. The recommendation is to rely on real data whenever there is aneed for new data, and use the prediction estimator to leverage existing data. 2016-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/soe_research/2296 https://ink.library.smu.edu.sg/context/soe_research/article/3295/viewcontent/Is_Predicted_Data_a_Viable_Alternative.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Economics eng Institutional Knowledge at Singapore Management University Prediction Double sampling Survey costs Poverty Income Distribution Public Economics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Prediction
Double sampling
Survey costs
Poverty
Income Distribution
Public Economics
spellingShingle Prediction
Double sampling
Survey costs
Poverty
Income Distribution
Public Economics
FUJII, Tomoki
VAN DER WEIDE, Roy
Is predicted data a viable alternative to real data?
description It is costly to collect the household- andindividual-level data that underlies official estimates of poverty and health. Forthis reason, developing countries often do not have the budget to update their estimatesof poverty and health regularly, even though these estimates are most neededthere. One way to reduce the financial burden is to substitute some of the realdata with predicted data. An approach referred to as double sampling collectsthe expensive outcome variable for a sub-sample only while collecting thecovariates used for prediction for the full sample. The objective of this studyis to determine if this would indeed allow for realizing meaningful reductionsin financial costs while preserving statistical precision. The study does thisusing analytical calculations that allow for considering a wide range of parametervalues that are plausible to real applications. The benefits of using double samplingare found to be modest. There are circumstances for which the gains can be moresubstantial, but the study conjectures that these denote the exceptions ratherthan the rule. The recommendation is to rely on real data whenever there is aneed for new data, and use the prediction estimator to leverage existing data.
format text
author FUJII, Tomoki
VAN DER WEIDE, Roy
author_facet FUJII, Tomoki
VAN DER WEIDE, Roy
author_sort FUJII, Tomoki
title Is predicted data a viable alternative to real data?
title_short Is predicted data a viable alternative to real data?
title_full Is predicted data a viable alternative to real data?
title_fullStr Is predicted data a viable alternative to real data?
title_full_unstemmed Is predicted data a viable alternative to real data?
title_sort is predicted data a viable alternative to real data?
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/soe_research/2296
https://ink.library.smu.edu.sg/context/soe_research/article/3295/viewcontent/Is_Predicted_Data_a_Viable_Alternative.pdf
_version_ 1770574798099841024