A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

A smoothed Q-learning algorithm for estimating optimal dynamic treatment regime

In this paper we propose a smoothed Q-learning algorithm for estimating optimal dynamic treatment regimes. In contrast to the Q-learning algorithm in which non-regular inference is involved, we show that under assumptions adopted in this paper, the proposed smoothed Q-learning estimator is asymptoti...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	FAN, Yanqin, HE, Ming, SU, Liangjun, ZHOU, Xiao-Hua
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2019
الموضوعات:	Asymptotic normality Exceptional law Optimal smoothing parameter Sequential randomization Wald-type inference Econometrics
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/soe_research/2044 https://ink.library.smu.edu.sg/context/soe_research/article/3043/viewcontent/Smoothed_Q_learning_algorithm_for_estimating_optimal_2016_pp.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

مواد مشابهة

Optimal Estimation under Nonstandard Conditions
بواسطة: PLOBERGER, Werner, وآخرون
منشور في: (2012)

On the Asymptotic Effect of Substituting Estimators for Nuisance Parameters in Inferential Statistics
بواسطة: YANG, Zhenlin, وآخرون
منشور في: (2003)

A new statistic for regression transformation
بواسطة: YANG, Zhenlin
منشور في: (2000)

Power Maximization and Size Control in Heteroskedasticity and Autocorrelation Robust Tests with Exponentiated Kernels
بواسطة: SUN, Yixiao, وآخرون
منشور في: (2009)

SEQUENTIAL MONTE CARLO ALGORITHMS FOR HIGH-DIMENSIONAL FILTERING AND SMOOTHING
بواسطة: XU YAXIAN
منشور في: (2018)

A sequential constant-stress accelerated life testing scheme and its Bayesian inference
بواسطة: Liu, X., وآخرون
منشور في: (2014)

Exception analysis for non-strict languages
بواسطة: Glynn, K., وآخرون
منشور في: (2013)

Power Maximization and Size Control of Heteroscedasticity and Autocorrelation Robust Tests with Exponentiated Kernels
بواسطة: SUN, Yixiao, وآخرون
منشور في: (2011)

Approximate Bayesian Computation for Smoothing
بواسطة: Martin, J.S., وآخرون
منشور في: (2016)

A practical test for strict exogeneity in linear panel data models with fixed effects
بواسطة: SU, Liangjun, وآخرون
منشور في: (2016)

Planning and inference of sequential accelerated life tests
بواسطة: LIU XIAO
منشور في: (2010)

Likelihood computation for hidden Markov models via generalized two-filter smoothing
بواسطة: Persing, A., وآخرون
منشور في: (2016)

COMPUTATIONAL METHODS FOR MULTIPLE TARGET TRACKING
بواسطة: ZENG JIAJIE
منشور في: (2020)

Computing maximum smoothness forward rate curves
بواسطة: Lim, K.G., وآخرون
منشور في: (2013)

Fixed Domain Asymptotics and Consistent Estimation for Gaussian Random Field Models in Spatial Statistics and Computer Experiments
بواسطة: WANG DAQING
منشور في: (2011)

The experience of some OECD economies on tax smoothing
بواسطة: Jayawickrama, A., وآخرون
منشور في: (2014)

Asymptotic theory for explosive fractional Ornstein–Uhlenbeck processes
بواسطة: JIANG, Hui, وآخرون
منشور في: (2023)

On the ranked-set sampling M-estimates for symmetric location families
بواسطة: Zhao, X., وآخرون
منشور في: (2014)

A new statistic for regression transformation
بواسطة: Yang, Z.
منشور في: (2014)

Calculating sized types
بواسطة: Chin, W.-N., وآخرون
منشور في: (2013)

Mean and Autocovariance Function Estimation Near the Boundary of Stationarity
بواسطة: GIRAITIS, Liudas, وآخرون
منشور في: (2012)

Smooth convex approximation and its applications
بواسطة: SHI SHENGYUAN
منشور في: (2010)

The optimal ranked-set sampling scheme for inference on population quantiles
بواسطة: Chen, Z.
منشور في: (2014)

Threshold regression asymptotics: From the compound Poisson process to two-sided Brownian motion
بواسطة: YU, Ping, وآخرون
منشور في: (2018)

EMERGENCY AND MODERNITY: CONTEXTUALIZING THE CONTEMPORARY DEBATE
بواسطة: PETER DAVID FINN
منشور في: (2020)

The grid bootstrap for continuous time models
بواسطة: LUI, Yiu Lim, وآخرون
منشور في: (2022)

Set Inference for Semiparametric Discrete Games
بواسطة: KIM, Kyoo-il
منشور في: (2006)

DETECTION OF SPARSE CHANGE-POINTS IN HIGH-DIMENSIONAL DATA
بواسطة: HUANG JINGYAN
منشور في: (2023)

Asymptotics and bootstrap for random-effects panel data transformation models
بواسطة: SU, Liangjun, وآخرون
منشور في: (2018)

Statistical inferences for functional data
بواسطة: Zhang, J.-T., وآخرون
منشور في: (2014)

Asymptotic normality of scaling functions
بواسطة: Chen, L.H.Y., وآخرون
منشور في: (2014)

Permutation-based tests for discontinuities in event studies
بواسطة: BUGNI, Federico, وآخرون
منشور في: (2022)

Relevance weighted likelihood for dependent data
بواسطة: Hu, F., وآخرون
منشور في: (2014)

Smoothing combined estimating equations in quantile regression for longitudinal data
بواسطة: Leng, C., وآخرون
منشور في: (2014)

On fixed-domain asymptotics and covariance tapering in Gaussian random field models
بواسطة: Wang, D., وآخرون
منشور في: (2014)

Improved Inferences for Spatial Regression Models
بواسطة: LIU, Shew Fan, وآخرون
منشور في: (2015)

A selective review of Aman Ullah’s contributions to econometrics
بواسطة: BAO, Yong, وآخرون
منشور في: (2016)

Unit Root and Cointegrating Limit Theory When Initialization is in the Infinite Past
بواسطة: Peter C. B. PHILLIPS,, وآخرون
منشور في: (2009)

Nonparametric adaptive design for clinical trials with continuous response
بواسطة: LI JUANJUAN
منشور في: (2019)

Causal change detection in possibly integrated systems: Revisiting the money-income relationship
بواسطة: SHI, Shuping, وآخرون
منشور في: (2020)