Deterministic Policy Gradient: Convergence Analysis
The Conference on Uncertainty in Artificial Intelligence (UAI)
Saved in:
Main Authors: | Huaqing, Xiong, Tengyu, Xu, Zhao, Lin, Yingbin, Liang, Wei, Zhang |
---|---|
Other Authors: | ELECTRICAL AND COMPUTER ENGINEERING |
Format: | Conference or Workshop Item |
Published: |
2022
|
Online Access: | https://scholarbank.nus.edu.sg/handle/10635/229019 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | National University of Singapore |
Similar Items
-
Finite-Time Analysis for Double Q-learning
by: Xiong, Huaqing, et al.
Published: (2021) -
Multi-agent deep deterministic policy gradient algorithm for swarm systems
by: Bedi, Jannat
Published: (2021) -
Reducing estimation bias via triplet-average deep deterministic policy gradient
by: WU, Dongming, et al.
Published: (2020) -
Finite-time theory of momentum Q-learning
by: Zhao, Lin, et al.
Published: (2021) -
Convergence analysis of Xu's LMSER learning algorithm via deterministic discrete time system method
by: Cheng Lv, Jian, et al.
Published: (2014)