A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

In this paper we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which non-regular inference is involved, we show that under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptoti...

Saved in:

書目詳細資料
Main Authors:	FAN, Yanqin, HE, Ming, SU, Liangjun, ZHOU, Xiao-Hua
格式:	text
語言:	English
出版:	Institutional Knowledge at Singapore Management University 2019
主題:	Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics
在線閱讀:	https://ink.library.smu.edu.sg/soe_research/2044 https://ink.library.smu.edu.sg/context/soe_research/article/3043/viewcontent/Smoothed_Q_learning_algorithm_for_estimating_optimal_2016_pp.pdf
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Singapore Management University
語言:	English

相似書籍

Optimal Estimation under Nonstandard Conditions
由: PLOBERGER, Werner, et al.
出版: (2012)

On the Asymptotic Effect of Substituting Estimators for Nuisance Parameters in Inferential Statistics
由: YANG, Zhenlin, et al.
出版: (2003)

A new statistic for regression transformation
由: YANG, Zhenlin
出版: (2000)

Power Maximization and Size Control in Heteroskedasticity and Autocorrelation Robust Tests with Exponentiated Kernels
由: SUN, Yixiao, et al.
出版: (2009)

SEQUENTIAL MONTE CARLO ALGORITHMS FOR HIGH-DIMENSIONAL FILTERING AND SMOOTHING
由: XU YAXIAN
出版: (2018)

A sequential constant-stress accelerated life testing scheme and its Bayesian inference
由: Liu, X., et al.
出版: (2014)

Exception analysis for non-strict languages
由: Glynn, K., et al.
出版: (2013)

Power Maximization and Size Control of Heteroscedasticity and Autocorrelation Robust Tests with Exponentiated Kernels
由: SUN, Yixiao, et al.
出版: (2011)

Approximate Bayesian Computation for Smoothing
由: Martin, J.S., et al.
出版: (2016)

A practical test for strict exogeneity in linear panel data models with fixed effects
由: SU, Liangjun, et al.
出版: (2016)

Planning and inference of sequential accelerated life tests
由: LIU XIAO
出版: (2010)

Likelihood computation for hidden Markov models via generalized two-filter smoothing
由: Persing, A., et al.
出版: (2016)

Computing maximum smoothness forward rate curves
由: Lim, K.G., et al.
出版: (2013)

COMPUTATIONAL METHODS FOR MULTIPLE TARGET TRACKING
由: ZENG JIAJIE
出版: (2020)

Fixed Domain Asymptotics and Consistent Estimation for Gaussian Random Field Models in Spatial Statistics and Computer Experiments
由: WANG DAQING
出版: (2011)

The experience of some OECD economies on tax smoothing
由: Jayawickrama, A., et al.
出版: (2014)

Asymptotic theory for explosive fractional Ornstein–Uhlenbeck processes
由: JIANG, Hui, et al.
出版: (2023)

On the ranked-set sampling M-estimates for symmetric location families
由: Zhao, X., et al.
出版: (2014)

A new statistic for regression transformation
由: Yang, Z.
出版: (2014)

Calculating sized types
由: Chin, W.-N., et al.
出版: (2013)

Mean and Autocovariance Function Estimation Near the Boundary of Stationarity
由: GIRAITIS, Liudas, et al.
出版: (2012)

Smooth convex approximation and its applications
由: SHI SHENGYUAN
出版: (2010)

The optimal ranked-set sampling scheme for inference on population quantiles
由: Chen, Z.
出版: (2014)

Threshold regression asymptotics: From the compound Poisson process to two-sided Brownian motion
由: YU, Ping, et al.
出版: (2018)

EMERGENCY AND MODERNITY: CONTEXTUALIZING THE CONTEMPORARY DEBATE
由: PETER DAVID FINN
出版: (2020)

The grid bootstrap for continuous time models
由: LUI, Yiu Lim, et al.
出版: (2022)

Set Inference for Semiparametric Discrete Games
由: KIM, Kyoo-il
出版: (2006)

DETECTION OF SPARSE CHANGE-POINTS IN HIGH-DIMENSIONAL DATA
由: HUANG JINGYAN
出版: (2023)

Asymptotics and bootstrap for random-effects panel data transformation models
由: SU, Liangjun, et al.
出版: (2018)

Statistical inferences for functional data
由: Zhang, J.-T., et al.
出版: (2014)

Asymptotic normality of scaling functions
由: Chen, L.H.Y., et al.
出版: (2014)

Permutation-based tests for discontinuities in event studies
由: BUGNI, Federico, et al.
出版: (2022)

Relevance weighted likelihood for dependent data
由: Hu, F., et al.
出版: (2014)

Smoothing combined estimating equations in quantile regression for longitudinal data
由: Leng, C., et al.
出版: (2014)

On fixed-domain asymptotics and covariance tapering in Gaussian random field models
由: Wang, D., et al.
出版: (2014)

A selective review of Aman Ullah’s contributions to econometrics
由: BAO, Yong, et al.
出版: (2016)

Improved Inferences for Spatial Regression Models
由: LIU, Shew Fan, et al.
出版: (2015)

Unit Root and Cointegrating Limit Theory When Initialization is in the Infinite Past
由: Peter C. B. PHILLIPS,, et al.
出版: (2009)

Nonparametric adaptive design for clinical trials with continuous response
由: LI JUANJUAN
出版: (2019)

Causal change detection in possibly integrated systems: Revisiting the money-income relationship
由: SHI, Shuping, et al.
出版: (2020)