Journal Article10.1109/tiv.2023.3307134
Robust Multi-Agent Reinforcement Learning Method Based on Adversarial Domain Randomization for Real-World Dual-UAV Cooperation
12
TL;DR: An adversarial domain randomization method that utilizes an adversarial generator as a “nature player” to generate a more reasonable training environment so that the trained decision policy can deal with complex situations is proposed.
read more
Abstract: A control system of multiple unmanned aerial vehicles (multi-UAV) is generally very complex when they complete a task in a closely-cooperative manner, e.g. two UAVs cooperatively transport a package of goods. Multi-agent reinforcement learning (MARL) offers a promising solution for such a complex control. However, MARL heavily relies on trial-and-error explorations, facing a big challenge in gathering real-world training data. Simulation environments are commonly used to overcome this challenge, i.e., a control policy is trained in a simulation environment and then transferred into a real-world system. But there often exists a gap between simulation and reality and thus a successful transfer is not guaranteed easily. The domain randomization method provides a workable way to bridge this gap. Nevertheless, the traditional one used in a policy training often suffers from slow convergence and results in an unstable decision policy. To address these issues, this article proposes an adversarial domain randomization method. It utilizes an adversarial generator as a “nature player” to generate a more reasonable training environment so that the trained decision policy can deal with complex situations. Additionally, we improve the prioritized experience replay method by which we can sample those critical experiences, increasing the convergence speed of a training without decreasing the performance of the trained policy. We apply our method to a real-world task of dual-UAV cooperative transportation, and experiments illustrate its effectiveness compared to traditional ones.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Autonomous Navigation for eVTOL: Review and Future Perspectives
Henglai Wei,Baichuan Lou,Zezhong Zhang,Bohang Liang,Fei Wang,Chengqi Lv +5 more
TL;DR: This survey paper reviews autonomous navigation capabilities for electric vertical takeoff and landing vehicles (eVTOLs), introducing a novel six-level autonomy concept and highlighting state-of-the-art developments in perception, planning, and control for enhanced eVTOL autonomy in urban environments.
16
Cybersecurity in Electric and Flying Vehicles: Threats, Challenges, AI Solutions & Future Directions
Hamed Alqahtani,Gulshan Kumar +1 more
TL;DR: This study explores cybersecurity challenges in Electric and Flying Vehicles, leveraging AI-driven solutions to mitigate cyber-physical threats, privacy vulnerabilities, and supply chain risks, and outlines future research directions for enhancing transportation system security and reliability.
4
Multiagent Reinforcement Learning for Active Guidance Control of Railway Vehicles with Independently Rotating Wheels
Juyao Wei,Zhenggang Lu,Zheng Yin,Zhipeng Jing +3 more
TL;DR: A novel data-driven multiagent reinforcement learning (MARL) controller for enhancing the running stability of independently rotating wheels (IRW) and reducing wheel–rail wear and the PER-MADDPG algorithm is compared against existing controllers, demonstrating the superior simulation performance of the proposed algorithm.
1
AdaptAUG: Adaptive Data Augmentation Framework for Multi-Agent Reinforcement Learning
Xin Yu,Yongkai Tian,Li Wang,Pu Feng,Wenjun Wu,Rongye Shi +5 more
- 13 May 2024
TL;DR: This study presents AdaptAUG, an adaptive data augmentation framework for multi-agent reinforcement learning, which selectively identifies beneficial augmentations to improve sample efficiency and performance in multi-robot tasks, validated through simulated and real-world experiments.
1
Autonomous UAV last-mile delivery in urban environments: A survey on deep learning and reinforcement learning solutions
Jingrui Guo,Yangyang Zhou,L. Burlion,Andrey V. Savkin,Chao Huang +4 more
References
Domain randomization for transferring deep neural networks from simulation to the real world
Josh Tobin,Rachel Fong,Alex Ray,Jonas Schneider,Wojciech Zaremba,Pieter Abbeel +5 more
- 20 Mar 2017
TL;DR: This paper explores domain randomization, a simple technique for training models on simulated images that transfer to real images by randomizing rendering in the simulator, and achieves the first successful transfer of a deep neural network trained only on simulated RGB images to the real world for the purpose of robotic control.
3.5K
Learning dexterous in-hand manipulation:
OpenAI: Marcin Andrychowicz,Bowen Baker,Maciek Chociej,Rafal Jozefowicz,Bob McGrew,Jakub Pachocki,Arthur Petron,Matthias Plappert,Glenn Powell,Alex Ray,Jonas Schneider,Szymon Sidor,Josh Tobin,Peter Welinder,Lilian Weng,Wojciech Zaremba +15 more
TL;DR: This work uses reinforcement learning (RL) to learn dexterous in-hand manipulation policies that can perform vision-based object reorientation on a physical Shadow Dexterous Hand, and these policies transfer to the physical robot despite being trained entirely in simulation.
•Posted Content
Prioritized Experience Replay
TL;DR: A framework for prioritizing experience, so as to replay important transitions more frequently, and therefore learn more efficiently, in Deep Q-Networks, a reinforcement learning algorithm that achieved human-level performance across many Atari games.
2K
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
TL;DR: This paper compares eight reinforcement learning frameworks: Adaptive heuristic critic (AHC) learning due to Sutton, Q-learning due to Watkins, and three extensions to both basic methods for speeding up learning and two extensions are experience replay, learning action models for planning, and teaching.
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey.
TL;DR: The fundamental background behind sim-to-real transfer in deep reinforcement learning is covered and the main methods being utilized at the moment: domain randomization, domain adaptation, imitation learning, meta-learning and knowledge distillation are overviewed.
469