MW-MADDPG: a meta-learning based decision-making method for collaborative UAV swarm.

Minrui Zhao Gang Wang Qiang Fu Xiangke Guo Yu Chen Tengda Li XiangYu Liu

Front Neurorobot

College of Air and Missile Defense, Air Force Engineering University, Xi'an, China.

Published: September 2023

Unmanned Aerial Vehicles (UAVs) have gained popularity due to their low lifecycle cost and minimal human risk, resulting in their widespread use in recent years. In the UAV swarm cooperative decision domain, multi-agent deep reinforcement learning has significant potential. However, current approaches are challenged by the multivariate mission environment and mission time constraints. In light of this, the present study proposes a meta-learning based multi-agent deep reinforcement learning approach that provides a viable solution to this problem. This paper presents an improved MAML-based multi-agent deep deterministic policy gradient (MADDPG) algorithm that achieves an unbiased initialization network by automatically assigning weights to meta-learning trajectories. In addition, a Reward-TD prioritized experience replay technique is introduced, which takes into account immediate reward and TD-error to improve the resilience and sample utilization of the algorithm. Experiment results show that the proposed approach effectively accomplishes the task in the new scenario, with significantly improved task success rate, average reward, and robustness compared to existing methods.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10551453	PMC
http://dx.doi.org/10.3389/fnbot.2023.1243174	DOI Listing

Publication Analysis

Top Keywords

multi-agent deep

meta-learning based

uav swarm

deep reinforcement

reinforcement learning

mw-maddpg meta-learning

based decision-making

decision-making method

method collaborative

collaborative uav

Similar Publications

Breast radiation therapy fluence painting with multi-agent deep reinforcement learning.

Med Phys

January 2025

Department of Radiation Oncology, Duke University, North Carolina, USA.

Yang Dongrong Li Xinyi Yoo Sua Blitzblau Rachel McDuff Susan

Background: The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.

Purpose: To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.

View Article and Find Full Text PDF

Similar Publications

Task Offloading with LLM-Enhanced Multi-Agent Reinforcement Learning in UAV-Assisted Edge Computing.

Sensors (Basel)

December 2024

School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.

Feifan Zhu Fei Huang Yantao Yu Guojin Liu Tiancong Huang

Unmanned aerial vehicles (UAVs) furnished with computational servers enable user equipment (UE) to offload complex computational tasks, thereby addressing the limitations of edge computing in remote or resource-constrained environments. The application of value decomposition algorithms for UAV trajectory planning has drawn considerable research attention. However, existing value decomposition algorithms commonly encounter obstacles in effectively associating local observations with the global state of UAV clusters, which hinders their task-solving capabilities and gives rise to reduced task completion rates and prolonged convergence times.

View Article and Find Full Text PDF

Similar Publications

Multi-Agent Reinforcement Learning-Based Computation Offloading for Unmanned Aerial Vehicle Post-Disaster Rescue.

Sensors (Basel)

December 2024

School of Computer Science and Engineering, Northeastern University, Shenyang 110000, China.

Lixing Wang Huirong Jiao

Natural disasters cause significant losses. Unmanned aerial vehicles (UAVs) are valuable in rescue missions but need to offload tasks to edge servers due to their limited computing power and battery life. This study proposes a task offloading decision algorithm called the multi-agent deep deterministic policy gradient with cooperation and experience replay (CER-MADDPG), which is based on multi-agent reinforcement learning for UAV computation offloading.

View Article and Find Full Text PDF

Similar Publications

Electromagnetic metamaterial agent.

Light Sci Appl

January 2025

State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.

Shengguo Hu Mingyi Li Jiawen Xu Hongrui Zhang Shanghang Zhang

Metamaterials have revolutionized wave control; in the last two decades, they evolved from passive devices via programmable devices to sensor-endowed self-adaptive devices realizing a user-specified functionality. Although deep-learning techniques play an increasingly important role in metamaterial inverse design, measurement post-processing and end-to-end optimization, their role is ultimately still limited to approximating specific mathematical relations; the metamaterial is still limited to serving as proxy of a human operator, realizing a predefined functionality. Here, we propose and experimentally prototype a paradigm shift toward a metamaterial agent (coined metaAgent) endowed with reasoning and cognitive capabilities enabling the autonomous planning and successful execution of diverse long-horizon tasks, including electromagnetic (EM) field manipulations and interactions with robots and humans.

View Article and Find Full Text PDF

Similar Publications

A multi-agent reinforcement learning based approach for automatic filter pruning.

Sci Rep

December 2024

College of Sciences, National University of Defense Technology, 410073, Changsha, China.

Zhemin Li Xiaojing Zuo Yiping Song Dong Liang Zheng Xie

Deep Convolutional Neural Networks (DCNNs), due to their high computational and memory requirements, face significant challenges in deployment on resource-constrained devices. Network Pruning, an essential model compression technique, contributes to enabling the efficient deployment of DCNNs on such devices. Compared to traditional rule-based pruning methods, Reinforcement Learning(RL)-based automatic pruning often yields more effective pruning strategies through its ability to learn and adapt.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!