Robust Multi-Agent Reinforcement Learning Method Based on Adversarial Domain Randomization for Real-World Dual-UAV Cooperation

doi:10.1109/tiv.2023.3307134

Journal Article10.1109/tiv.2023.3307134

Robust Multi-Agent Reinforcement Learning Method Based on Adversarial Domain Randomization for Real-World Dual-UAV Cooperation

- IEEE transactions on intelligent vehicle...

12

TL;DR: An adversarial domain randomization method that utilizes an adversarial generator as a “nature player” to generate a more reasonable training environment so that the trained decision policy can deal with complex situations is proposed.

Abstract: A control system of multiple unmanned aerial vehicles (multi-UAV) is generally very complex when they complete a task in a closely-cooperative manner, e.g. two UAVs cooperatively transport a package of goods. Multi-agent reinforcement learning (MARL) offers a promising solution for such a complex control. However, MARL heavily relies on trial-and-error explorations, facing a big challenge in gathering real-world training data. Simulation environments are commonly used to overcome this challenge, i.e., a control policy is trained in a simulation environment and then transferred into a real-world system. But there often exists a gap between simulation and reality and thus a successful transfer is not guaranteed easily. The domain randomization method provides a workable way to bridge this gap. Nevertheless, the traditional one used in a policy training often suffers from slow convergence and results in an unstable decision policy. To address these issues, this article proposes an adversarial domain randomization method. It utilizes an adversarial generator as a “nature player” to generate a more reasonable training environment so that the trained decision policy can deal with complex situations. Additionally, we improve the prioritized experience replay method by which we can sample those critical experiences, increasing the convergence speed of a training without decreasing the performance of the trained policy. We apply our method to a real-world task of dual-UAV cooperative transportation, and experiments illustrate its effectiveness compared to traditional ones.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/tiv.2024.3352613

Autonomous Navigation for eVTOL: Review and Future Perspectives

Henglai Wei, +5 more

- IEEE transactions on intelligent vehicle...

TL;DR: This survey paper reviews autonomous navigation capabilities for electric vertical takeoff and landing vehicles (eVTOLs), introducing a novel six-level autonomy concept and highlighting state-of-the-art developments in perception, planning, and control for enhanced eVTOL autonomy in urban environments.

...read moreread less

16

Journal Article•10.1145/3697830

Cybersecurity in Electric and Flying Vehicles: Threats, Challenges, AI Solutions & Future Directions

Hamed Alqahtani, +1 more

- 30 Sep 2024

- ACM Computing Surveys

TL;DR: This study explores cybersecurity challenges in Electric and Flying Vehicles, leveraging AI-driven solutions to mitigate cyber-physical threats, privacy vulnerabilities, and supply chain risks, and outlines future research directions for enhancing transportation system security and reliability.

...read moreread less

4

Journal Article•10.3390/app14041677

Multiagent Reinforcement Learning for Active Guidance Control of Railway Vehicles with Independently Rotating Wheels

Juyao Wei, +3 more

- 19 Feb 2024

- Applied Sciences

TL;DR: A novel data-driven multiagent reinforcement learning (MARL) controller for enhancing the running stability of independently rotating wheels (IRW) and reducing wheel–rail wear and the PER-MADDPG algorithm is compared against existing controllers, demonstrating the superior simulation performance of the proposed algorithm.

...read moreread less

1

Journal Article•10.1109/icra57147.2024.10611035

AdaptAUG: Adaptive Data Augmentation Framework for Multi-Agent Reinforcement Learning

Xin Yu, +5 more

- 13 May 2024

TL;DR: This study presents AdaptAUG, an adaptive data augmentation framework for multi-agent reinforcement learning, which selectively identifies beneficial augmentations to improve sample efficiency and performance in multi-robot tasks, validated through simulated and real-world experiments.

...read moreread less

1

Journal Article•10.1016/j.conengprac.2025.106491

Autonomous UAV last-mile delivery in urban environments: A survey on deep learning and reinforcement learning solutions

Jingrui Guo, +4 more

- Control Engineering Practice

References

•Proceedings Article•10.1109/IROS.2017.8202133

Domain randomization for transferring deep neural networks from simulation to the real world

Josh Tobin, +5 more

- 20 Mar 2017

TL;DR: This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator, and achieves the first successful transfer of a deep neural network trained only on simulated RGB images to the real world for the purpose of robotic control.

...read moreread less

3.5K

•Journal Article•10.1177/0278364919887447

Learning dexterous in-hand manipulation:

OpenAI: Marcin Andrychowicz, +15 more

- 01 Jan 2020

- The International Journal of Robotics Re...

TL;DR: This work uses reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand, and these policies transfer to the physical robot despite being trained entirely in simulation.

...read moreread less

2.1K

•Posted Content

Prioritized Experience Replay

Tom Schaul, +3 more

- 18 Nov 2015

- arXiv: Learning

TL;DR: A framework for prioritizing experience, so as to replay important transitions more frequently, and therefore learn more efficiently, in Deep Q-Networks, a reinforcement learning algorithm that achieved human-level performance across many Atari games.

...read moreread less

2K

•Journal Article•10.1007/BF00992699

Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

Long-Ji Lin

- 01 May 1992

- Machine Learning

TL;DR: This paper compares eight reinforcement learning frameworks: Adaptive heuristic critic (AHC) learning due to Sutton, Q-learning due to Watkins, and three extensions to both basic methods for speeding up learning and two extensions are experience replay, learning action models for planning, and teaching.

...read moreread less

1.9K

•Proceedings Article•10.1109/SSCI47803.2020.9308468

Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey.

Wenshuai Zhao, +2 more

- 24 Sep 2020

- arXiv: Learning

TL;DR: The fundamental background behind sim-to-real transfer in deep reinforcement learning is covered and the main methods being utilized at the moment: domain randomization, domain adaptation, imitation learning, meta-learning and knowledge distillation are overviewed.

...read moreread less

469

...

Expand

Robust Multi-Agent Reinforcement Learning Method Based on Adversarial Domain Randomization for Real-World Dual-UAV Cooperation

Chat with Paper

AI Agents for this Paper

Citations

Autonomous Navigation for eVTOL: Review and Future Perspectives

Cybersecurity in Electric and Flying Vehicles: Threats, Challenges, AI Solutions &amp; Future Directions

Multiagent Reinforcement Learning for Active Guidance Control of Railway Vehicles with Independently Rotating Wheels

AdaptAUG: Adaptive Data Augmentation Framework for Multi-Agent Reinforcement Learning

Autonomous UAV last-mile delivery in urban environments: A survey on deep learning and reinforcement learning solutions

References

Domain randomization for transferring deep neural networks from simulation to the real world

Learning dexterous in-hand manipulation:

Prioritized Experience Replay

Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching

Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey.

Cybersecurity in Electric and Flying Vehicles: Threats, Challenges, AI Solutions & Future Directions