A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

In this paper we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which non-regular inference is involved, we show that under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptoti...

Full description

Saved in:

Bibliographic Details
Main Authors:	FAN, Yanqin, HE, Ming, SU, Liangjun, ZHOU, Xiao-Hua
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2019
Subjects:	Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics
Online Access:	https://ink.library.smu.edu.sg/soe_research/2044 https://ink.library.smu.edu.sg/context/soe_research/article/3043/viewcontent/Smoothed_Q_learning_algorithm_for_estimating_optimal_2016_pp.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.soe_research-3043
record_format	dspace
spelling	sg-smu-ink.soe_research-30432019-05-03T03:26:47Z A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime FAN, Yanqin HE, Ming SU, Liangjun ZHOU, Xiao-Hua In this paper we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which non-regular inference is involved, we show that under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptotically normally distributed even when the Q-learning estimator is not and its asymptotic variance can be consistently estimated. As a result, inference based on the smoothed Q-learning estimator is standard. We derive the optimal smoothing parameter and propose a data-driven method for estimating it. The ﬁnite sample properties of the smoothed Q-learning estimator are studied and compared with several existing estimators including the Q-learning estimator via an extensive simulation study. We illustrate the new method by analyzing data from the Clinical Antipsychotic Trials of Intervention EﬀectivenessAlzheimer’s Disease (CATIE-AD) study. 2019-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/soe_research/2044 info:doi/10.1111/sjos.12359 https://ink.library.smu.edu.sg/context/soe_research/article/3043/viewcontent/Smoothed_Q_learning_algorithm_for_estimating_optimal_2016_pp.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Economics eng Institutional Knowledge at Singapore Management University Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics
spellingShingle	Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics FAN, Yanqin HE, Ming SU, Liangjun ZHOU, Xiao-Hua A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
description	In this paper we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which non-regular inference is involved, we show that under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptotically normally distributed even when the Q-learning estimator is not and its asymptotic variance can be consistently estimated. As a result, inference based on the smoothed Q-learning estimator is standard. We derive the optimal smoothing parameter and propose a data-driven method for estimating it. The ﬁnite sample properties of the smoothed Q-learning estimator are studied and compared with several existing estimators including the Q-learning estimator via an extensive simulation study. We illustrate the new method by analyzing data from the Clinical Antipsychotic Trials of Intervention EﬀectivenessAlzheimer’s Disease (CATIE-AD) study.
format	text
author	FAN, Yanqin HE, Ming SU, Liangjun ZHOU, Xiao-Hua
author_facet	FAN, Yanqin HE, Ming SU, Liangjun ZHOU, Xiao-Hua
author_sort	FAN, Yanqin
title	A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
title_short	A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
title_full	A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
title_fullStr	A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
title_full_unstemmed	A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime
title_sort	smoothed q-learning algorithm for estimating optimal dynamic treatment regime
publisher	Institutional Knowledge at Singapore Management University
publishDate	2019
url	https://ink.library.smu.edu.sg/soe_research/2044 https://ink.library.smu.edu.sg/context/soe_research/article/3043/viewcontent/Smoothed_Q_learning_algorithm_for_estimating_optimal_2016_pp.pdf
_version_	1770573615961473024

A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

Similar Items