Prioritized Shaping of Models for Solving DEC-POMDPs

An interesting class of multi-agent POMDP planning problems can be solved by having agents iteratively solve individual POMDPs, find interactions with other individual plans, shape their transition and reward functions to encourage good interactions and discourage bad ones and then recompute a new p...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلفون الرئيسيون: VARAKANTHAM, Pradeep Reddy, YEOH, William, Velagapudi, Prasanna, Scerri, Paul
التنسيق: text
اللغة:English
منشور في: Institutional Knowledge at Singapore Management University 2012
الموضوعات:
الوصول للمادة أونلاين:https://ink.library.smu.edu.sg/sis_research/1607
https://ink.library.smu.edu.sg/context/sis_research/article/2606/viewcontent/C14___Prioritized_Shaping_of_Models_for_Solving_DEC_POMDPs__AAMAS2012_.pdf
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Singapore Management University
اللغة: English
الوصف
الملخص:An interesting class of multi-agent POMDP planning problems can be solved by having agents iteratively solve individual POMDPs, find interactions with other individual plans, shape their transition and reward functions to encourage good interactions and discourage bad ones and then recompute a new plan. D-TREMOR showed that this approach can allow distributed planning for hundreds of agents. However, the quality and speed of the planning process depends on the prioritization scheme used. Lower priority agents shape their models with respect to the models of higher priority agents. In this paper, we introduce a new prioritization scheme that is guaranteed to converge and is empirically better, in terms of solution quality and planning time, than the existing prioritization scheme for some problems.