Multi-Agent Reinforcement Learning
Multi-agent reinforcement learning (MARL) uses reinforcement learning techniques to train a set of agents to solve a specified task. This includes agents working in a team to collaboratively accomplish tasks, as well as agents in competitive scenarios with conflicting interests. Recent advances in MARL have leveraged deep learning to scale to bigger problems and address some of the inherent challenges in MARL .
A core problem in multi-agent learning is that the environment is non-stationary from the perspective of individual agents: each agent learns about and adapts to the environment which includes other agents who, likewise, are continually adapting their behaviours. Several approaches have been proposed to tackle this non-stationary, including modelling the behaviours of other agents , learning to communicate , and centralised training architectures . However, non-stationary remains a significant open challenge, which is further complicated by the need to scale to complex domains and to deal with partial observability of the environment.
The goal of this project is to develop novel algorithms for highly efficient multi-agent reinforcement learning in complex environments. Examples of potential evaluation domains include competitive games (e.g. Robocup 2D/3D soccer, Starcraft 2), autonomous vehicles in dense city traffic, and autonomous wireless networks.
 G. Papoudakis, F. Christianos, A. Rahman, S.V. Albrecht (2019). Dealing with non-stationarity in multi-agent deep reinforcement learning. ArXiv, 1906.04737 https://arxiv.org/abs/1906.04737
 Stefano Albrecht and Peter Stone (2018). Autonomous agents modelling other agents: A comprehensive survey and open problems. Artificial Intelligence 258:66-95.
 Sainbayar Sukhbaatar and Rob Fergus (2016). Learning multiagent communication with backpropagation. In: Advances in Neural Information Processing Systems, pp. 2244-2252.
 Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, and Igor Mordatch (2017). Multi-agent actor-critic for mixed cooperative-competitive environments. In: Advances in Neural Information Processing Systems, pp. 6379-6390.