Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning

doi:10.1109/IJCNN48605.2020.9207663

Open AccessProceedings Article10.1109/IJCNN48605.2020.9207663

Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning

Palanisamy Praveen

- 19 Jul 2020

- pp 1-7

168

TL;DR: In this paper, the authors proposed the use of Partially Observable Markov Games (POSG) for formulating the connected autonomous driving problems with realistic assumptions, and provided a taxonomy of multi-agent learning environments based on the nature of tasks, nature of agents and the environment.

Abstract: The capability to learn and adapt to changes in the driving environment is crucial for developing autonomous driving systems that are scalable beyond geo-fenced operational design domains. Deep Reinforcement Learning (RL) provides a promising and scalable framework for developing adaptive learning based solutions. Deep RL methods usually model the problem as a (Partially Observable) Markov Decision Process in which an agent acts in a stationary environment to learn an optimal behavior policy. However, driving involves complex interaction between multiple, intelligent (artificial or human) agents in a highly non-stationary environment. In this paper, we propose the use of Partially Observable Markov Games(POSG) for formulating the connected autonomous driving problems with realistic assumptions. We provide a taxonomy of multi-agent learning environments based on the nature of tasks, nature of agents and the nature of the environment to help in categorizing various autonomous driving problems that can be addressed under the proposed formulation. As our main contributions, we provide MACAD-Gym, a Multi-Agent Connected, Autonomous Driving agent learning platform for furthering research in this direction. Our MACAD-Gym platform provides an extensible set of Connected Autonomous Driving (CAD) simulation environments that enable the research and development of Deep RL- based integrated sensing, perception, planning and control algorithms for CAD systems with unlimited operational design domain under realistic, multi-agent settings. We also share the MACAD-Agents that were trained successfully using the MACAD-Gym platform to learn control policies for multiple vehicle agents in a partially observable, stop-sign controlled, 3-way urban intersection environment with raw (camera) sensor observations.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/tits.2021.3054625

Deep Reinforcement Learning for Autonomous Driving: A Survey

01 Jun 2022

- IEEE Transactions on Intelligent Transpo...

TL;DR: The authors provides a taxonomy of automated driving tasks where deep reinforcement learning (DRL) methods have been employed, while addressing key computational challenges in real world deployment of autonomous driving agents and delineates adjacent domains such as behavior cloning, imitation learning, inverse reinforcement learning that are related but are not classical RL algorithms.

...read moreread less

1.1K

•Posted Content

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving.

Ming Zhou, +36 more

- 19 Oct 2020

- arXiv: Multiagent Systems

TL;DR: The design goals of SMARTS (Scalable Multi-Agent RL Training School) are described, its basic architecture and its key features are explained, and its use is illustrated through concrete multi-agent experiments on interactive scenarios.

...read moreread less

183

Journal Article•10.1111/MICE.12702

Graph neural network and reinforcement learning for multi-agent cooperative control of connected autonomous vehicles

Sikai Chen, +5 more

- 01 Jul 2021

- Computer-aided Civil and Infrastructure ...

TL;DR: A connected autonomous vehicle (CAV) network can be defined as a set of connected vehicles including CAVs that operate on a specific spatial scope that may be a road network, corridor, or highway as discussed by the authors.

...read moreread less

159

Journal Article•10.1016/j.rser.2021.111833

A review of reinforcement learning based energy management systems for electrified powertrains: Progress, challenge, and potential solution

Rujing Yin

- 01 Feb 2022

TL;DR: A comprehensive review of RL-based energy management strategies (EMS) is lacking in literature as discussed by the authors , however, a comprehensive survey of the literature can be found in our previous work, which reviewed the recent penetration of RL based EMS like Q-learning, Deep Q Learning, deep deterministic policy gradient in the electrified powertrains domain.

...read moreread less

147

•Proceedings Article

Probabilistic recursive reasoning for multi-agent reinforcement learning

Ying Wen, +4 more

- 09 May 2019

TL;DR: In this paper, a probabilistic recursive reasoning (PR2) framework for multi-agent reinforcement learning is introduced, where each agent can reason about how the opponents would react to its future behaviors.

...read moreread less

133

...

Expand

References

•Posted Content

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013

- arXiv: Learning

TL;DR: This work presents the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning, which outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

...read moreread less

10.7K

•Book Chapter•10.1016/B978-1-55860-335-6.50027-1

Markov games as a framework for multi-agent reinforcement learning

Michael L. Littman

- 10 Jul 1994

TL;DR: A Q-learning-like algorithm for finding optimal policies and its application to a simple two-player game in which the optimal policy is probabilistic is demonstrated.

...read moreread less

3.2K

•Journal Article•10.1073/PNAS.39.10.1095

Stochastic Games

Lloyd S. Shapley

- 01 Oct 1953

- Proceedings of the National Academy of S...

TL;DR: In a stochastic game the play proceeds by steps from position to position, according to transition probabilities controlled jointly by the two players, and the expected total gain or loss is bounded by M, which depends on N 2 + N matrices.

...read moreread less

3.1K

Journal Article•10.1109/JPROC.2011.2132790

Dedicated Short-Range Communications (DSRC) Standards in the United States

John Kenney

- 16 Jun 2011

TL;DR: The content and status of the DSRC standards being developed for deployment in the United States are explained, including insights into why specific technical solutions are being adopted, and key challenges remaining for successful DSRC deployment.

...read moreread less

2.1K

•Book Chapter•10.1007/978-3-319-67361-5_40

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

Shital Shah, +3 more

- 15 May 2017

TL;DR: In this paper, the authors present a new simulator built on Unreal Engine that offers physically and visually realistic simulations for autonomous vehicles in real-world environments, including a physics engine that can operate at a high frequency for real-time hardware-in-the-loop (HITL) simulations with support for popular protocols (e.g., MavLink).

...read moreread less

1.6K