Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

doi:10.1109/icra48891.2023.10161216

Proceedings Article10.1109/icra48891.2023.10161216

Spatial-Temporal-Aware Safe Multi-Agent Reinforcement Learning of Connected Autonomous Vehicles in Challenging Scenarios

29 May 2023

8

TL;DR: In this paper , a constrained multi-agent reinforcement learning (MARL) with a parallel Safety Shield for CAVs in challenging driving scenarios that includes unconnected hazard vehicles is proposed to improve the safety and efficiency of the system in dynamic and complicated driving scenarios.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.geits.2024.100156

A Review on Reinforcement Learning-based Highway Autonomous Vehicle Control

Ali Irshayyid, +2 more

- 01 Jan 2024

- Green Energy and Intelligent Transportat...

TL;DR: This review examines recent advancements in deep reinforcement learning (DRL) for autonomous vehicle control, focusing on highway lane change, ramp merge, and platoon coordination, highlighting similarities, differences, and best practices in DRL formulations and training algorithms.

...read moreread less

9

Journal Article•10.1109/mits.2023.3335126

A Survey of Integrated Simulation Environments for Connected Automated Vehicles: Requirements, Tools, and Architecture

Vitaly Stepanyants, +1 more

- IEEE Intelligent Transportation Systems ...

TL;DR: A survey of integrated simulation environments for connected automated vehicles identifies challenges and proposes an architecture for an integrated simulation environment with full domain coverage.

...read moreread less

8

•Posted Content•10.36227/techrxiv.22817417.v1

Cooperative Decision-Making for CAVs at Unsignalized Intersections: A MARL Approach with Attention and Hierarchical Game Priors

17 May 2023

TL;DR: In this article , a multi-agent game-prior attention Deep Deterministic Policy Gradient (MA-GA-DDPG) is proposed to solve complex decision-making problems in complex human-machine mixed traffic scenarios, such as unsignalized intersections.

...read moreread less

1

Journal Article•10.1109/comst.2024.3423319

CrowdTransfer: Enabling Crowd Knowledge Transfer in AIoT Community

Yan Liu, +5 more

- 01 Jan 2024

- IEEE Communications Surveys and Tutorial...

TL;DR: A new concept of knowledge transfer, referred to as Crowd Knowledge Transfer (CrowdTransfer), which aims to transfer prior knowledge learned from a crowd of agents to reduce the training cost and as well as improve the performance of the model in real-world complicated scenarios is introduced.

...read moreread less

Journal Article•10.23919/ecc65951.2025.11187043

A Learning-Based Control Barrier Function for Car-Like Robots: Toward Less Conservative Collision Avoidance

Jianye Xu, +1 more

- 24 Jun 2025

TL;DR: A learning-based Control Barrier Function for car-like robots reduces conservatism in collision avoidance by incorporating robot headings and shapes, approximated with a neural network, to estimate safe regions and improve navigation in dense environments.

...read moreread less

References

•Proceedings Article

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

- 19 Jun 2016

TL;DR: A conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers and shows that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

...read moreread less

9.2K

•Posted Content

End to End Learning for Self-Driving Cars

Mariusz Bojarski, +12 more

- 25 Apr 2016

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A convolutional neural network is trained to map raw pixels from a single front-facing camera directly to steering commands and it is argued that this will eventually lead to better performance and smaller systems.

...read moreread less

4.6K

•Proceedings Article•10.1109/ICCV.2015.312

DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving

Chenyi Chen, +3 more

- 07 Dec 2015

TL;DR: This paper proposes to map an input image to a small number of key perception indicators that directly relate to the affordance of a road/traffic state for driving and argues that the direct perception representation provides the right level of abstraction.

...read moreread less

2K

•Posted Content

Counterfactual Multi-Agent Policy Gradients

Jakob Foerster, +4 more

- 24 May 2017

- arXiv: Artificial Intelligence

TL;DR: A new multi-agent actor-critic method called counterfactual multi- agent (COMA) policy gradients, which uses a centralised critic to estimate the Q-function and decentralised actors to optimise the agents' policies.

...read moreread less

1.1K

•Proceedings Article•10.23919/ECC.2019.8796030

Control Barrier Functions: Theory and Applications

Aaron D. Ames, +5 more

- 25 Jun 2019

TL;DR: In this paper, the authors provide an introduction and overview of control barrier functions and their use to verify and enforce safety properties in the context of (optimization based) safety-critical controllers.

...read moreread less

1.1K