Multi-Agent Imitation Learning for Driving Simulation
Raunak P. Bhattacharyya,Derek J. Phillips,Blake Wulfe,Jeremy Morton,Alex Kuefler,Mykel J. Kochenderfer +5 more
- 01 Oct 2018
- pp 1534-1539
116
TL;DR: This paper extended Generative Adversarial Imitation Learning (GAIL) to address these shortcomings through a parameter-sharing approach grounded in curriculum learning and showed that policies generated by their PS-GAIL method proved superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers.
read more
Abstract: Simulation is an appealing option for validating the safety of autonomous vehicles. Generative Adversarial Imitation Learning (GAIL) has recently been shown to learn representative human driver models. These human driver models were learned through training in single-agent environments, but they have difficulty in generalizing to multi-agent driving scenarios. We argue these difficulties arise because observations at training and test time are sampled from different distributions. This difference makes such models unsuitable for the simulation of driving scenes, where multiple agents must interact realistically over long time horizons. We extend GAIL to address these shortcomings through a parameter-sharing approach grounded in curriculum learning. Compared with single-agent GAIL policies, policies generated by our PS-GAIL method prove superior at interacting stably in a multi-agent setting and capturing the emergent behavior of human drivers.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles
TL;DR: In this paper, the authors provide insight into the hierarchical motion planning problem and describe the basics of Deep Reinforcement Learning (DRL), and present state-of-the-art solutions systematized by different tasks and levels of autonomous driving, such as car-following, lane-keeping, trajectory following, merging, or driving in dense traffic.
513
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
Simon Suo,Sebastian Regalado,Sergio Casas,Raquel Urtasun +3 more
- 01 Jun 2021
TL;DR: In this paper, the authors propose TrafficSim, a multi-agent behavior model for realistic traffic simulation, in which the policy is parameterized with an implicit la-tent variable model that generates socially consistent plans for all actors in the scene jointly.
Implicit Latent Variable Model for Scene-Consistent Motion Forecasting
Sergio Casas,Cole Gulino,Simon Suo,Katie Luo,Renjie Liao,Raquel Urtasun +5 more
- 23 Aug 2020
TL;DR: In this article, the authors use graph neural networks to learn a distributed latent representation of the scene and obtain trajectory samples that are consistent across traffic participants, achieving state-of-the-art results in motion forecasting and interaction understanding.
138
•Posted Content
A Survey on Autonomous Vehicle Control in the Era of Mixed-Autonomy: From Physics-Based to AI-Guided Driving Policy Learning
Xuan Di,Rongye Shi +1 more
TL;DR: This paper will not only inspire the transportation community to rethink the conventional models that are developed in the data-shortage era, but also reach out to other disciplines, in particular robotics and machine learning, to join forces towards creating a safe and efficient mixed traffic ecosystem.
117
•Posted Content
A Survey of Deep RL and IL for Autonomous Driving Policy Learning.
Zeyu Zhu,Huijing Zhao +1 more
TL;DR: In this paper, a comprehensive survey of deep reinforcement learning (DRL) and deep imitation learning (DIL) techniques for autonomous driving policy learning is presented, which is addressed simultaneously from the system, task-driven and problem-driven perspectives.
107
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Consensus and Cooperation in Networked Multi-Agent Systems
Reza Olfati-Saber,J.A. Fax,Richard M. Murray +2 more
- 05 Mar 2007
TL;DR: A theoretical framework for analysis of consensus algorithms for multi-agent networked systems with an emphasis on the role of directed information flow, robustness to changes in network topology due to link/node failures, time-delays, and performance guarantees is provided.
•Proceedings Article
Wasserstein Generative Adversarial Networks
Martin Arjovsky,Soumith Chintala,Léon Bottou +2 more
- 17 Jul 2017
TL;DR: This work introduces a new algorithm named WGAN, an alternative to traditional GAN training that can improve the stability of learning, get rid of problems like mode collapse, and provide meaningful learning curves useful for debugging and hyperparameter searches.
•Proceedings Article
Trust Region Policy Optimization
John Schulman,Sergey Levine,Pieter Abbeel,Michael I. Jordan,Philipp Moritz +4 more
- 06 Jul 2015
TL;DR: A method for optimizing control policies, with guaranteed monotonic improvement, by making several approximations to the theoretically-justified scheme, called Trust Region Policy Optimization (TRPO).