Imagination-Augmented Reinforcement Learning Framework for Variable Speed Limit Control

doi:10.1109/tits.2023.3316285

Journal Article10.1109/tits.2023.3316285

Imagination-Augmented Reinforcement Learning Framework for Variable Speed Limit Control

4

TL;DR: The proposed Imagination-Augmented Agent (I2A) consists an imagination path and a model-free path, which work together to generate appropriate control actions that outperforms other tested Reinforcement Learning (RL) agents in terms of Total Time Spent and bottleneck volume.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2311.10309

Imagination-augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments

Sang-Hyun Lee, +2 more

- 17 Nov 2023

- arXiv.org

TL;DR: IAHRL efficiently integrates imagination into HRL to enable an agent to learn safe and interactive behaviors in real-world navigation tasks and introduces a new attention mechanism that allows the high-level policy to be permutation-invariant to the order of surrounding objects and to prioritize the authors' agent over them.

...read moreread less

2

Journal Article•10.1016/j.eswa.2025.129958

Exploring mechanisms of integrating global perception prediction for connected vehicles with lane-specific reinforcement learning-based variable speed limits

Li Song, +5 more

- 07 Oct 2025

- Expert systems with applications

Journal Article•10.3390/su17219831

Dynamic Pricing for Wireless Charging Lane Management Based on Deep Reinforcement Learning

Fan Liu, +5 more

- 04 Nov 2025

- Sustainability

Abstract: We consider a dynamic pricing problem in a double-lane system consisting of one general purpose lane and one wireless charging lane (WCL). The electricity price is dynamically adjusted to affect the lane-choice behaviors of incoming electric vehicles (EVs), thereby regulating the traffic assignment between the two lanes with both traffic operation efficiency and charging service efficiency considered in the control objective. We first establish an agent-based dynamic double-lane traffic system model, whereby each EV acts as an agent with distinct behavioral and operational characteristics. Then, a deep Q-learning algorithm is proposed to derive the optimal pricing decisions. A regression tree (CART) algorithm is also designed for benchmarking. The simulation results reveal that the deep Q-learning algorithm demonstrates superior capability in optimizing dynamic pricing strategies compared to CART by more effectively leveraging system dynamics and future traffic demand information, and both outperform the static pricing strategy. This study serves as a pioneering work to explore dynamic pricing issues for WCLs.

...read moreread less

Proceedings Article•10.1109/icdcs60910.2024.00109

Leveraging CAVs to Improve Traffic Efficiency: An MARL-Based Approach

Weizhen Han, +5 more

- 23 Jul 2024

TL;DR: This paper proposes a MARL-based approach, MACA, for collaborative path planning of CAVs and CVs to reduce traffic congestion and improve efficiency in urban scenarios, achieving up to 10.9% travel time reduction for CVs and 6.5% queue length reduction.

...read moreread less

References

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K

•Posted Content

Proximal Policy Optimization Algorithms

John Schulman, +4 more

- 20 Jul 2017

- arXiv: Learning

TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.

...read moreread less

18K

•Journal Article•10.1007/BF00992698

Technical Note : \cal Q -Learning

Chris Watkins, +1 more

- 01 May 1992

- Machine Learning

TL;DR: This paper presents and proves in detail a convergence theorem forQ-learning based on that outlined in Watkins (1989), showing that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action- values are represented discretely.

...read moreread less

12K

•Proceedings Article

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Richard S. Sutton, +3 more

- 29 Nov 1999

TL;DR: This paper proves for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

7.1K

...

Expand

Imagination-Augmented Reinforcement Learning Framework for Variable Speed Limit Control

Chat with Paper

AI Agents for this Paper

Citations

Imagination-augmented Hierarchical Reinforcement Learning for Safe and Interactive Autonomous Driving in Urban Environments

Exploring mechanisms of integrating global perception prediction for connected vehicles with lane-specific reinforcement learning-based variable speed limits

Dynamic Pricing for Wireless Charging Lane Management Based on Deep Reinforcement Learning

Leveraging CAVs to Improve Traffic Efficiency: An MARL-Based Approach

References

Attention Is All You Need

Human-level control through deep reinforcement learning

Proximal Policy Optimization Algorithms

Technical Note : \cal Q -Learning

Policy Gradient Methods for Reinforcement Learning with Function Approximation