Multi-Agent Imitation Learning for Driving Simulation

doi:10.1109/IROS.2018.8593758

Open AccessProceedings Article10.1109/IROS.2018.8593758

Multi-Agent Imitation Learning for Driving Simulation

Raunak P. Bhattacharyya, +5 more

- 01 Oct 2018

- pp 1534-1539

116

TL;DR: This paper extended Generative Adversarial Imitation Learning (GAIL) to address these shortcomings through a parameter-sharing approach grounded in curriculum learning and showed that policies generated by their PS-GAIL method proved superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/TITS.2020.3024655

Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles

Szilárd Aradi

- 30 Jan 2020

- IEEE Transactions on Intelligent Transpo...

TL;DR: In this paper, the authors provide insight into the hierarchical motion planning problem and describe the basics of Deep Reinforcement Learning (DRL), and present state-of-the-art solutions systematized by different tasks and levels of autonomous driving, such as car-following, lane-keeping, trajectory following, merging, or driving in dense traffic.

...read moreread less

513

•Proceedings Article•10.1109/CVPR46437.2021.01026

TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors

Simon Suo, +3 more

- 01 Jun 2021

TL;DR: In this paper, the authors propose TrafficSim, a multi-agent behavior model for realistic traffic simulation, in which the policy is parameterized with an implicit la-tent variable model that generates socially consistent plans for all actors in the scene jointly.

...read moreread less

230

•Book Chapter•10.1007/978-3-030-58592-1_37

Implicit Latent Variable Model for Scene-Consistent Motion Forecasting

Sergio Casas, +5 more

- 23 Aug 2020

TL;DR: In this article, the authors use graph neural networks to learn a distributed latent representation of the scene and obtain trajectory samples that are consistent across traffic participants, achieving state-of-the-art results in motion forecasting and interaction understanding.

...read moreread less

138

•Posted Content

A Survey on Autonomous Vehicle Control in the Era of Mixed-Autonomy: From Physics-Based to AI-Guided Driving Policy Learning

Xuan Di, +1 more

- 10 Jul 2020

- arXiv: Artificial Intelligence

TL;DR: This paper will not only inspire the transportation community to rethink the conventional models that are developed in the data-shortage era, but also reach out to other disciplines, in particular robotics and machine learning, to join forces towards creating a safe and efficient mixed traffic ecosystem.

...read moreread less

117

•Posted Content

A Survey of Deep RL and IL for Autonomous Driving Policy Learning.

Zeyu Zhu, +1 more

- 06 Jan 2021

- arXiv: Robotics

TL;DR: In this paper, a comprehensive survey of deep reinforcement learning (DRL) and deep imitation learning (DIL) techniques for autonomous driving policy learning is presented, which is addressed simultaneously from the system, task-driven and problem-driven perspectives.

...read moreread less

107

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014

- arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

82.5K

•Journal Article•10.1109/JPROC.2006.887293

Consensus and Cooperation in Networked Multi-Agent Systems

Reza Olfati-Saber, +2 more

- 05 Mar 2007

TL;DR: A theoretical framework for analysis of consensus algorithms for multi-agent networked systems with an emphasis on the role of directed information flow, robustness to changes in network topology due to link/node failures, time-delays, and performance guarantees is provided.

...read moreread less

11K

•Proceedings Article

Wasserstein Generative Adversarial Networks

Martin Arjovsky, +2 more

- 17 Jul 2017

TL;DR: This work introduces a new algorithm named WGAN, an alternative to traditional GAN training that can improve the stability of learning, get rid of problems like mode collapse, and provide meaningful learning curves useful for debugging and hyperparameter searches.

...read moreread less

8.2K

•Proceedings Article

Trust Region Policy Optimization

John Schulman, +4 more

- 06 Jul 2015

TL;DR: A method for optimizing control policies, with guaranteed monotonic improvement, by making several approximations to the theoretically-justified scheme, called Trust Region Policy Optimization (TRPO).

...read moreread less

6K

...

Expand

Multi-Agent Imitation Learning for Driving Simulation

Chat with Paper

AI Agents for this Paper

Citations

Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles

TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors

Implicit Latent Variable Model for Scene-Consistent Motion Forecasting

A Survey on Autonomous Vehicle Control in the Era of Mixed-Autonomy: From Physics-Based to AI-Guided Driving Policy Learning

A Survey of Deep RL and IL for Autonomous Driving Policy Learning.

References

Adam: A Method for Stochastic Optimization

Adam: A Method for Stochastic Optimization

Consensus and Cooperation in Networked Multi-Agent Systems

Wasserstein Generative Adversarial Networks

Trust Region Policy Optimization

Related Papers (5)

Social LSTM: Human Trajectory Prediction in Crowded Spaces

Maximum entropy inverse reinforcement learning

Congested traffic states in empirical observations and microscopic simulations

Generative Adversarial Nets

Apprenticeship learning via inverse reinforcement learning