Modeling Human Driving Behavior through Generative Adversarial Imitation Learning.

Open AccessPosted Content

Modeling Human Driving Behavior through Generative Adversarial Imitation Learning.

- 10 Jun 2020

101

TL;DR: Experiments show that modifications to GAIL can successfully model highway driving behavior, accurately replicating human demonstrations and generating realistic, emergent behavior in the traffic flow arising from the interaction between driving agents.

Abstract: Imitation learning is an approach for generating intelligent behavior when the cost function is unknown or difficult to specify. Building upon work in inverse reinforcement learning (IRL), Generative Adversarial Imitation Learning (GAIL) aims to provide effective imitation even for problems with large or continuous state and action spaces. Driver modeling is one example of a problem where the state and action spaces are continuous. Human driving behavior is characterized by non-linearity and stochasticity, and the underlying cost function is unknown. As a result, learning from human driving demonstrations is a promising approach for generating human-like driving behavior. This article describes the use of GAIL for learning-based driver modeling. Because driver modeling is inherently a multi-agent problem, where the interaction between agents needs to be modeled, this paper describes a parameter-sharing extension of GAIL called PS-GAIL to tackle multi-agent driver modeling. In addition, GAIL is domain agnostic, making it difficult to encode specific knowledge relevant to driving in the learning process. This paper describes Reward Augmented Imitation Learning (RAIL), which modifies the reward signal to provide domain-specific knowledge to the agent. Finally, human demonstrations are dependent upon latent factors that may not be captured by GAIL. This paper describes Burn-InfoGAIL, which allows for disentanglement of latent variability in demonstrations. Imitation learning experiments are performed using NGSIM, a real-world highway driving dataset. Experiments show that these modifications to GAIL can successfully model highway driving behavior, accurately replicating human demonstrations and generating realistic, emergent behavior in the traffic flow arising from the interaction between driving agents.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/tiv.2022.3167103

A Survey on Trajectory-Prediction Methods for Autonomous Driving

01 Sep 2022

TL;DR: A comprehensive and comparative review of trajectory prediction methods for autonomous driving can be found in this article , where the authors evaluate the performance of each kind of method and outline potential research directions to guide readers.

...read moreread less

276

Journal Article•10.1109/TIV.2022.3167103

A Survey on Trajectory-Prediction Methods for Autonomous Driving

Yanjun Huang, +5 more

- 01 Sep 2022

- IEEE transactions on intelligent vehicle...

TL;DR: A comprehensive and comparative review of trajectory prediction methods for autonomous driving can be found in this paper , where the authors evaluate the performance of each kind of method and outline potential research directions to guide readers.

...read moreread less

252

•Proceedings Article•10.1109/CVPR46437.2021.01026

TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors

Simon Suo, +3 more

- 01 Jun 2021

TL;DR: In this paper, the authors propose TrafficSim, a multi-agent behavior model for realistic traffic simulation, in which the policy is parameterized with an implicit la-tent variable model that generates socially consistent plans for all actors in the scene jointly.

...read moreread less

230

Journal Article•10.48550/arxiv.2309.02473

A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges

Maryam Zare, +3 more

- 05 Sep 2023

- arXiv.org

TL;DR: The goal of the paper is to provide a comprehensive guide to the growing field of IL in robotics and AI, where desired behavior is learned by imitating an expert's behavior, which is provided through demonstrations.

...read moreread less

35

•Journal Article•10.1109/tiv.2023.3279425

A Review of Driving Style Recognition Methods From Short-Term and Long-Term Perspectives

Hong Chu, +7 more

- 01 Jan 2023

- IEEE transactions on intelligent vehicle...

TL;DR: In this article , the authors survey related advances in driving style recognition along short and long-term pipelines and discuss the potential applications of driving styles recognition in intelligent vehicles. But, most works fail to consider the influence of deploying the recognition results on the vehicle side.

...read moreread less

23

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

•Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014

- arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

82.5K

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

...

Expand

Modeling Human Driving Behavior through Generative Adversarial Imitation Learning.

Chat with Paper

AI Agents for this Paper

Citations

A Survey on Trajectory-Prediction Methods for Autonomous Driving

A Survey on Trajectory-Prediction Methods for Autonomous Driving

TrafficSim: Learning to Simulate Realistic Multi-Agent Behaviors

A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges

A Review of Driving Style Recognition Methods From Short-Term and Long-Term Perspectives

References

Adam: A Method for Stochastic Optimization

Long short-term memory

ImageNet Classification with Deep Convolutional Neural Networks

Adam: A Method for Stochastic Optimization

Generative Adversarial Nets

Related Papers (5)

Multi-Agent Imitation Learning for Driving Simulation

Inferring The Latent Structure of Human Decision-Making from Raw Visual Inputs

Accelerating reinforcement learning through imitation

ADAIL: Adaptive Adversarial Imitation Learning

Expressing Diverse Human Driving Behavior with Probabilistic Rewards and Online Inference