Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach

This paper proposes a game-theoretic inverse reinforcement learning (GT-IRL) framework, which aims to learn the parameters in both the dynamic system and individual cost function of multistage games from demonstrated trajectories. Different from the probabilistic approaches in computer science commu...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Cao, Kun, Xie, Lihua
مؤلفون آخرون:	School of Electrical and Electronic Engineering
التنسيق:	مقال
اللغة:	English
منشور في:	2022
الموضوعات:	Engineering::Electrical and electronic engineering Maximum Principle Multistage Game
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/162585
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-162585
record_format	dspace
spelling	sg-ntu-dr.10356-1625852022-10-31T08:09:52Z Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach Cao, Kun Xie, Lihua School of Electrical and Electronic Engineering Engineering::Electrical and electronic engineering Maximum Principle Multistage Game This paper proposes a game-theoretic inverse reinforcement learning (GT-IRL) framework, which aims to learn the parameters in both the dynamic system and individual cost function of multistage games from demonstrated trajectories. Different from the probabilistic approaches in computer science community and residual minimization solutions in control community, our framework addresses the problem in a deterministic setting by differentiating Pontryagin’s Maximum Principle (PMP) equations of open-loop Nash equilibrium (OLNE), which is inspired by [1]. The differentiated equations for a multi-player nonzero-sum multistage game are shown to be equivalent to the PMP equations for another affine-quadratic nonzero-sum multistage game and can be solved by some explicit recursions. A similar result is established for 2-player zero-sum games. Simulation examples are presented to demonstrate the effectiveness of our proposed algorithms. Ministry of Education (MOE) This work was supported by Ministry of Education of Republic of Singapore under Grant AcRF TIER 1-2019-T1-001-088 (RG72/19). 2022-10-31T08:09:52Z 2022-10-31T08:09:52Z 2022 Journal Article Cao, K. & Xie, L. (2022). Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach. IEEE Transactions On Neural Networks and Learning Systems. https://dx.doi.org/10.1109/TNNLS.2022.3148376 2162-237X https://hdl.handle.net/10356/162585 10.1109/TNNLS.2022.3148376 en 2019-T1- 001-088 (RG72/19) IEEE Transactions on Neural Networks and Learning Systems © 2022 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Maximum Principle Multistage Game
spellingShingle	Engineering::Electrical and electronic engineering Maximum Principle Multistage Game Cao, Kun Xie, Lihua Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
description	This paper proposes a game-theoretic inverse reinforcement learning (GT-IRL) framework, which aims to learn the parameters in both the dynamic system and individual cost function of multistage games from demonstrated trajectories. Different from the probabilistic approaches in computer science community and residual minimization solutions in control community, our framework addresses the problem in a deterministic setting by differentiating Pontryagin’s Maximum Principle (PMP) equations of open-loop Nash equilibrium (OLNE), which is inspired by [1]. The differentiated equations for a multi-player nonzero-sum multistage game are shown to be equivalent to the PMP equations for another affine-quadratic nonzero-sum multistage game and can be solved by some explicit recursions. A similar result is established for 2-player zero-sum games. Simulation examples are presented to demonstrate the effectiveness of our proposed algorithms.
author2	School of Electrical and Electronic Engineering
author_facet	School of Electrical and Electronic Engineering Cao, Kun Xie, Lihua
format	Article
author	Cao, Kun Xie, Lihua
author_sort	Cao, Kun
title	Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
title_short	Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
title_full	Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
title_fullStr	Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
title_full_unstemmed	Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
title_sort	game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach
publishDate	2022
url	https://hdl.handle.net/10356/162585
_version_	1749179194369114112

Game-theoretic inverse reinforcement learning: a differential pontryagin's maximum principle approach

مواد مشابهة