Deep Reinforcement Learning for Autonomous Driving: A Survey

doi:10.1109/TITS.2021.3054625

Open AccessJournal Article10.1109/TITS.2021.3054625

Deep Reinforcement Learning for Autonomous Driving: A Survey

B Ravi Kiran, +6 more

- 09 Feb 2021

- IEEE Transactions on Intelligent Transpo...

- pp 1-18

1.2K

TL;DR: This review summarises deep reinforcement learning algorithms, provides a taxonomy of automated driving tasks where (D)RL methods have been employed, highlights the key challenges algorithmically as well as in terms of deployment of real world autonomous driving agents, the role of simulators in training agents, and finally methods to evaluate, test and robustifying existing solutions in RL and imitation learning.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2208.14307

Beyond Supervised Continual Learning: a Review

Benedikt Bagus, +2 more

- 30 Aug 2022

- arXiv.org

TL;DR: Books that study CL in other settings, such as learning with reduced supervision, fully unsupervised learning, and reinforcement learning are reviewed, with a simple schema for classifying CL approaches w.r.t. their level of autonomy and supervision.

...read moreread less

Proceedings Article•10.48550/arXiv.2206.02620

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Wanqi Xue, +5 more

- 01 Jun 2022

TL;DR: This paper proposes ResAct, a generative model which reconstructs behaviors of the online-serving policy by sampling multiple action estimators and designs an effective learning paradigm to train the residual actor which can output the residual for action improvement.

...read moreread less

Journal Article•10.48550/arxiv.2310.20380

Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods

Zhengpeng Xie, +2 more

- 31 Oct 2023

- arXiv.org

TL;DR: A general reinforcement learning framework applicable to mainstream policy optimization methods is introduced, and the dropout technique is applied to the PPO algorithm to obtain the D-PPO variant.

...read moreread less

•Posted Content•10.48550/arxiv.2306.05726

In-Sample Policy Iteration for Offline Reinforcement Learning

09 Jun 2023

TL;DR: In this paper , the authors propose a novel algorithm employing in-sample policy iteration that substantially enhances behavior-regularized methods in offline RL, which gradually improves itself while implicitly avoiding querying out-of-sample actions to avert catastrophic learning failures.

...read moreread less

•Posted Content•10.1101/2022.09.25.509419

Top-down design of protein nanomaterials with reinforcement learning

Isaac D. Lutz, +11 more

- 25 Sep 2022

- bioRxiv

TL;DR: In this paper , a top-down reinforcement learning-based approach to protein nanomaterial design is proposed, in which both the structures of the subunits and the interactions between them are built up coordinately in the context of the entire assembly.

...read moreread less

...

Expand

References

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

- 01 Jan 1988

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

39.7K

•Posted Content

Proximal Policy Optimization Algorithms

John Schulman, +4 more

- 20 Jul 2017

- arXiv: Learning

TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.

...read moreread less

18K