Towards Explaining Sequences of Actions in Multi-Agent Deep Reinforcement Learning Models

Although Multi-agent Deep Reinforcement Learning (MADRL) has shown promising results in solving complex real-world problems, the applicability and reliability of MADRL models are often limited by a lack of understanding of their inner workings for explaining the decisions made. To address this issue...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	KHAING, Phyo Wai, GENG, Minghong, SUBAGDJA, Budhitama, PATERIA, Shubham, TAN, Ah-hwee
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2023
الموضوعات:	Multi Agent Deep Reinforcement Learning Explainable Artificial Intelligence Explainable Deep Reinforcement Learning Databases and Information Systems
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/8076 https://ink.library.smu.edu.sg/context/sis_research/article/9079/viewcontent/p2325.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

الوصف
الملخص:	Although Multi-agent Deep Reinforcement Learning (MADRL) has shown promising results in solving complex real-world problems, the applicability and reliability of MADRL models are often limited by a lack of understanding of their inner workings for explaining the decisions made. To address this issue, this paper proposes a novel method for explaining MADRL by generalizing the sequences of action events performed by agents into high-level abstract strategies using a spatio-temporal neural network model. Specifically, an interval-based memory retrieval procedure is developed to generalize the encoded sequences of action events over time into short sequential patterns. In addition, two abstraction algorithms are introduced, one for abstracting action events across multiple agents and the other for further abstracting the episodes over time into short sequential patterns, which can then be translated into symbolic form for interpretation. We evaluate the proposed method using the StarCraft Multi Agent Challenge (SMAC) benchmark task, which shows that the method is able to derive high-level explanations of MADRL models at various levels of granularity.

Towards Explaining Sequences of Actions in Multi-Agent Deep Reinforcement Learning Models

مواد مشابهة