Neural-progressive hedging: Enforcing constraints in reinforcement learning with stochastic programming

We propose a framework, called neural-progressive hedging (NP), that leverages stochastic programming during the online phase of executing a reinforcement learning (RL) policy. The goal is to ensure feasibility with respect to constraints and risk-based objectives such as conditional value-at-risk (...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: GHOSH, Supriyo, WYNTER, Laura, LIM, Shiau Hong, NGUYEN, Duc Thien
التنسيق: text
اللغة:English
منشور في: Institutional Knowledge at Singapore Management University 2022
الموضوعات:
الوصول للمادة أونلاين:https://ink.library.smu.edu.sg/sis_research/7760
https://ink.library.smu.edu.sg/context/sis_research/article/8763/viewcontent/ghosh22a.pdf
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Singapore Management University
اللغة: English