Difference of convex functions programming for policy optimization in reinforcement learning

Difference of convex functions programming for policy optimization in reinforcement learning

We formulate the problem of optimizing an agent's policy within the Markov decision process (MDP) model as a difference-of-convex functions (DC) program. The DC perspective enables optimizing the policy iteratively where each iteration constructs an easier-to-optimize lower bound on the value f...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	KUMAR, Akshat
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2024
الموضوعات:	Agent policy Reinforcement learning optimization Difference-of-convex functions Reinforcement learning algorithm Artificial Intelligence and Robotics
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/9926 https://ink.library.smu.edu.sg/context/sis_research/article/10926/viewcontent/ConvexFunctionsProg_pvoa_cc_by.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

مواد مشابهة

Constrained reinforcement learning in hard exploration problems
بواسطة: PATHMANATHAN, Pankayaraj, وآخرون
منشور في: (2023)

Integrating motivated learning and k-winner-take-all to coordinate multi-agent reinforcement learning
بواسطة: TENG, Teck-Hou, وآخرون
منشور في: (2014)

Motivated learning as an extension of reinforcement learning
بواسطة: STARZYK, Janusz, وآخرون
منشور في: (2010)

Reinforcement learning for zone based multiagent pathfinding under uncertainty
بواسطة: LING, Jiajing, وآخرون
منشور في: (2020)

Constrained multiagent reinforcement learning for large agent population
بواسطة: LING, Jiajing, وآخرون
منشور في: (2022)

Imitate the good and avoid the bad: An incremental approach to safe reinforcement learning
بواسطة: HOANG, Minh Huy, وآخرون
منشور في: (2024)

Benchmarking MARL on Long Horizon Sequential Multi-Objective Tasks
بواسطة: GENG, Minghong, وآخرون
منشور في: (2024)

Reinforcement Nash Equilibrium Solver
بواسطة: WANG, Xinrun, وآخرون
منشور في: (2024)

Target driven visual navigation for a mobile robot using deep reinforcement learning
بواسطة: Liu, Chengxiao
منشور في: (2025)

Constrained multiagent reinforcement learning for large agent population
بواسطة: LING, Jiajing, وآخرون
منشور في: (2023)

Context-aware techniques for real-time decision making in autonomous mobile robots (Situation awareness Part A)
بواسطة: Parittotog, Apichaya
منشور في: (2024)

Approximate difference rewards for scalable multigent reinforcement learning
بواسطة: SINGH, Arambam James, وآخرون
منشور في: (2021)

SPRINQL : Sub-optimal demonstrations driven offline imitation learning
بواسطة: HOANG, Minh Huy, وآخرون
منشور في: (2024)

End-to-end deep reinforcement learning for multi-agent collaborative exploration
بواسطة: CHEN, Zichen, وآخرون
منشور في: (2019)

Augmenting decision with hypothesis in reinforcement learning
بواسطة: NGUYEN, Minh Quang, وآخرون
منشور في: (2024)

Towards Explaining Sequences of Actions in Multi-Agent Deep Reinforcement Learning Models
بواسطة: KHAING, Phyo Wai, وآخرون
منشور في: (2023)

MODEL-BASED REINFORCEMENT LEARNING FOR COMPLEX ENVIRONMENTS
بواسطة: MA XIAO
منشور في: (2022)

Reinforcement learning for robot assembly
بواسطة: Vuong Quoc Nghia
منشور في: (2024)

Action selection for composable modular deep reinforcement learning
بواسطة: GUPTA, Vaibhav, وآخرون
منشور في: (2021)

Transition-informed reinforcement learning for large-scale Stackelberg mean-field games.
بواسطة: LI, Pengdeng, وآخرون
منشور في: (2024)

Financial portfolio optimization: an autoregressive deep reinforcement learning algorithm with learned intrinsic rewards
بواسطة: Lim, Magdalene Hui Qi
منشور في: (2024)

HIERARCHICAL REINFORCEMENT LEARNING WITH PARAMETERIZED OPTIONS FOR LONG-HORIZON ROBOTIC MANIPULATION
بواسطة: GUO CHAOQUN
منشور في: (2023)

End-to-end hierarchical reinforcement learning with integrated subgoal discovery
بواسطة: PATERIA, Shubham, وآخرون
منشور في: (2022)

SAMPLE-EFFICIENT AUTOMATED MACHINE LEARNING WITH BAYESIAN OPTIMIZATION
بواسطة: DAI ZHONGXIANG
منشور في: (2021)

Reward penalties on augmented states for solving richly constrained RL effectively
بواسطة: HAO, Jiang, وآخرون
منشور في: (2024)

TOWARDS HUMAN-CENTRIC AI: INVERSE REINFORCEMENT LEARNING MEETS ALGORITHMIC FAIRNESS
بواسطة: SREEJITH BALAKRISHNAN
منشور في: (2023)

Safety through feedback in constrained RL
بواسطة: CHIRRA, Shashank Reddy, وآخرون
منشور في: (2024)

A biologically-inspired cognitive agent model integrating declarative knowledge and reinforcement learning
بواسطة: TAN, Ah-hwee, وآخرون
منشور في: (2010)

Multi-agent reinforcement learning in spatial domain tasks using inter subtask empowerment rewards
بواسطة: PATERIA, Shubham, وآخرون
منشور في: (2019)

Deep Reinforcement Learning With Explicit Context Representation
بواسطة: Munguia-Galeano, Francisco, وآخرون
منشور في: (2023)

COGNITIVE ENGINE AND DEEP REINFORCEMENT LEARNING FOR ROBOT-ASSISTED SURGERY
بواسطة: TAN XIAOYU
منشور في: (2019)

Mimicking to dominate: Imitation learning strategies for success in multiagent competitive games
بواسطة: BUI, The Viet, وآخرون
منشور في: (2024)

Exploration of network centrality in goal conditioned reinforcement learning
بواسطة: Sharma Divyansh
منشور في: (2024)

A Survey on Visual Navigation for Artificial Agents with Deep Reinforcement Learning
بواسطة: Zeng, F., وآخرون
منشور في: (2021)

Efficient neural collaborative search for pickup and delivery problems
بواسطة: KONG, Detian, وآخرون
منشور في: (2024)

DIFFERENTIABLE SOCIAL PROJECTION WITH DEEP SELF-MODEL IMPLANTS FOR ASSISTIVE HUMAN-ROBOT COMMUNICATION
بواسطة: FONG CHEE YONG JEFFREY
منشور في: (2022)

A hybrid stochastic-deterministic minibatch proximal gradient method for efficient optimization and generalization
بواسطة: ZHOU, Pan, وآخرون
منشور في: (2021)

Reinforcement learning for sequential decision making with constraints
بواسطة: LING, Jiajing
منشور في: (2023)

Approximate difference rewards for scalable multigent reinforcement learning
بواسطة: SINGH, Arambam James, وآخرون
منشور في: (2021)

An efficient approach to model-based hierarchical reinforcement learning
بواسطة: LI, Zhuoru, وآخرون
منشور في: (2017)