ICAPS 2021

Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms

In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.

Session 23: Reinforcement Learning

Guiding Robot Exploration in Reinforcement Learning via Automated Planning
Authors: Yohei Hayamizu, Saeid Amiri, Kishan Chandan, Keiki Takadama and Shiqi Zhang
Keywords: Reinforcement LearningAutomated PlanningRobotics

A Simulator-based Planning Framework for Optimizing Autonomous Greenhouse Control Strategy
Authors: Zhicheng An, Xiaoyan Cao, Yao Yao, Wanpeng Zhang, Lanqing Li, Yue Wang, Shihui Guo and Dijun Luo
Keywords: Simulator-based planningGreenhouse control strategyArtificial intelligenceReinforcement learningHeuristic algorithm

A Deep Ensemble Method for Multi-Agent Reinforcement Learning: A Case Study on Air Traffic Control
Authors: Supriyo Ghosh, Sean Laguna, Shiau Hong Lim, Laura Wynter and Hasan Poonawala
Keywords: Reinforcement LearningDeep Ensemble LearningAir Traffic Control

Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms
Authors: Frits de Nijs, Erwin Walraven, Mathijs De Weerdt and Matthijs T. J. Spaan