Open AccessPosted Content
Modeling Human Driving Behavior through Generative Adversarial Imitation Learning.
Raunak P. Bhattacharyya,Blake Wulfe,Derek J. Phillips,Alex Kuefler,Jeremy Morton,Ransalu Senanayake,Mykel J. Kochenderfer +6 more
TL;DR: Experiments show that modifications to GAIL can successfully model highway driving behavior, accurately replicating human demonstrations and generating realistic, emergent behavior in the traffic flow arising from the interaction between driving agents.
read more
Abstract: Imitation learning is an approach for generating intelligent behavior when the cost function is unknown or difficult to specify. Building upon work in inverse reinforcement learning (IRL), Generative Adversarial Imitation Learning (GAIL) aims to provide effective imitation even for problems with large or continuous state and action spaces. Driver modeling is one example of a problem where the state and action spaces are continuous. Human driving behavior is characterized by non-linearity and stochasticity, and the underlying cost function is unknown. As a result, learning from human driving demonstrations is a promising approach for generating human-like driving behavior. This article describes the use of GAIL for learning-based driver modeling. Because driver modeling is inherently a multi-agent problem, where the interaction between agents needs to be modeled, this paper describes a parameter-sharing extension of GAIL called PS-GAIL to tackle multi-agent driver modeling. In addition, GAIL is domain agnostic, making it difficult to encode specific knowledge relevant to driving in the learning process. This paper describes Reward Augmented Imitation Learning (RAIL), which modifies the reward signal to provide domain-specific knowledge to the agent. Finally, human demonstrations are dependent upon latent factors that may not be captured by GAIL. This paper describes Burn-InfoGAIL, which allows for disentanglement of latent variability in demonstrations. Imitation learning experiments are performed using NGSIM, a real-world highway driving dataset. Experiments show that these modifications to GAIL can successfully model highway driving behavior, accurately replicating human demonstrations and generating realistic, emergent behavior in the traffic flow arising from the interaction between driving agents.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Survey on Trajectory-Prediction Methods for Autonomous Driving
01 Sep 2022
TL;DR: A comprehensive and comparative review of trajectory prediction methods for autonomous driving can be found in this article , where the authors evaluate the performance of each kind of method and outline potential research directions to guide readers.
276
A Survey on Trajectory-Prediction Methods for Autonomous Driving
TL;DR: A comprehensive and comparative review of trajectory prediction methods for autonomous driving can be found in this paper , where the authors evaluate the performance of each kind of method and outline potential research directions to guide readers.
252
TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors
Simon Suo,Sebastian Regalado,Sergio Casas,Raquel Urtasun +3 more
- 01 Jun 2021
TL;DR: In this paper, the authors propose TrafficSim, a multi-agent behavior model for realistic traffic simulation, in which the policy is parameterized with an implicit la-tent variable model that generates socially consistent plans for all actors in the scene jointly.
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Maryam Zare,Parham M. Kebria,Abbas Khosravi,Saeid Nahavandi +3 more
TL;DR: The goal of the paper is to provide a comprehensive guide to the growing field of IL in robotics and AI, where desired behavior is learned by imitating an expert's behavior, which is provided through demonstrations.
A Review of Driving Style Recognition Methods From Short-Term and Long-Term Perspectives
TL;DR: In this article , the authors survey related advances in driving style recognition along short and long-term pipelines and discuss the potential applications of driving styles recognition in intelligent vehicles. But, most works fail to consider the influence of deploying the recognition results on the vehicle side.
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
Long short-term memory
TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.
99K
•Proceedings Article
ImageNet Classification with Deep Convolutional Neural Networks
Alex Krizhevsky,Ilya Sutskever,Geoffrey E. Hinton +2 more
- 03 Dec 2012
TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.