Deterministic Policy Gradient: Convergence Analysis

Deterministic Policy Gradient: Convergence Analysis

The Conference on Uncertainty in Artificial Intelligence (UAI)

Saved in:

Bibliographic Details
Main Authors:	Huaqing, Xiong, Tengyu, Xu, Zhao, Lin, Yingbin, Liang, Wei, Zhang
Other Authors:	ELECTRICAL AND COMPUTER ENGINEERING
Format:	Conference or Workshop Item
Published:	2022
Online Access:	https://scholarbank.nus.edu.sg/handle/10635/229019
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	National University of Singapore

Similar Items

Multi-agent deep deterministic policy gradient algorithm for swarm systems
by: Bedi, Jannat
Published: (2021)

Reducing estimation bias via triplet-average deep deterministic policy gradient
by: WU, Dongming, et al.
Published: (2020)

Finite-time theory of momentum Q-learning
by: Zhao, Lin, et al.
Published: (2021)

Convergence analysis of Xu's LMSER learning algorithm via deterministic discrete time system method
by: Cheng Lv, Jian, et al.
Published: (2014)

Convergence analysis of a deterministic discrete time system of Oja's PCA learning algorithm
by: Yi, Z., et al.
Published: (2014)

A hybrid stochastic-deterministic minibatch proximal gradient method for efficient optimization and generalization
by: ZHOU, Pan, et al.
Published: (2021)

Order of convergence of splitting schemes for both deterministic and stochastic nonlinear Schrödinger equations
by: Liu, J.
Published: (2014)

TWO STAGE OPTIMIZATION OF ENERGY REGENERATION AND BRAKING STABILITY OF ELECTRIC TRIKE USING DEEP DETERMINISTIC POLICY GRADIENT AND PARTICLE SWARM OPTIMIZATION
by: Cahya Kirana, Rizky

Gradient-free distributed optimization with exact convergence
by: Pang, Yipeng, et al.
Published: (2022)

Convergence of asynchronous distributed gradient methods over stochastic networks
by: Xu, Jinming, et al.
Published: (2020)

Reliability Analysis of Non-deterministic Systems
by: GUI LIN
Published: (2014)

Hybrid stochastic-deterministic minibatch proximal gradient: Less-than-single-pass optimization with nearly optimal generalization
by: ZHOU, Pan, et al.
Published: (2020)

Stochastic and deterministic effects of a moisture gradient on soil microbial communities in the McMurdo dry valleys of Antarctica
by: Lee, K.C., et al.
Published: (2021)

Exact convergence of gradient-free distributed optimization method in a multi-agent system
by: Pang, Yipeng, et al.
Published: (2020)

CONVERGENCE OF A CONDITIONAL GRADIENT METHOD FOR RELAXED CONTROLS IN TIME-LAG CONTROL PROBLEMS.
by: Wilson, S.J.
Published: (2014)

Convergence of communications technologies : policy options
by: Kosol Petchsuwan
Published: (2008)

Incremental deterministic planning
by: Andrei, Ş., et al.
Published: (2013)

Deterministic construction of sparse binary matrices via incremental integer optimization
by: Zhang, Jun, et al.
Published: (2020)

Complete deterministic linear optics bell state analysis
by: Schuck, C., et al.
Published: (2014)

Global convergence of a two-parameter family of conjugate gradient methods without line search
by: Chen, X., et al.
Published: (2013)

Verification of deterministic solar forecasts
by: Yang, D, et al.
Published: (2020)

Bisimilarity enforcing supervisory control for deterministic specifications
by: Sun, Y., et al.
Published: (2014)

Towards Convergence in Social Protection Policies and Programmes
by: Ang, Alvin P, et al.
Published: (2012)

Equivalence of stochastic and deterministic mechanisms
by: CHEN, Yi-Chun, et al.
Published: (2019)

DISTRIBUTED DETERMINISTIC ASYNCHRONOUS ALGORITHMS
by: DORON LOH
Published: (2021)

On Deterministic Perturbations of Summability Maps
by: GAO BING
Published: (2013)

Robust PCA in high-dimension: A deterministic approach
by: Feng, J., et al.
Published: (2014)

Concentration gradient generator
by: Chua, Ivan Wei Liang.
Published: (2012)

RaPiD: A toolkit for reliability analysis of non-deterministic systems
by: GUI, Lin, et al.
Published: (2014)

Site-specific deterministic seismic hazard analysis of Surabaya, Indonesia
by: Deng, Xiaofang
Published: (2015)

A security analysis of a deterministic key generation scheme
by: SONG, Yuhao, et al.
Published: (2024)

Mining deterministic biclusters in gene expression data
by: Zhang, Z., et al.
Published: (2013)

Training algorithm design and weight convergence analysis for discrete-time recurrent neural networks
by: Xu, Zhao
Published: (2013)

Electrokinetically driven continuous-flow enrichment of colloidal particles by Joule heating induced temperature gradient focusing in a convergent-divergent microfluidic structure
by: Zhao, Cunlu, et al.
Published: (2018)

Power clones and non-deterministic hypersubstitutions
by: K. Denecke, et al.
Published: (2018)

Parallel global optimization with deterministic approaches
by: Wu, Yong
Published: (2008)

Bubble testing under deterministic trends
by: WANG, Xiaohu, et al.
Published: (2017)

Deterministic seasonal models and spurious regressions
by: Abeysinghe, T.
Published: (2011)

DYNAMICAL ANALYSIS FOR DETERMINISTIC MODELS OF TWO-PATHOGEN INFECTION DISEASE TRANSMISSIONS
by: Fahlena, Hilda

A Review on Deterministic Lateral Displacement for Particle Separation and Detection
by: Salafi, T, et al.
Published: (2020)