Learning to school in dense configurations with multi-agent deep reinforcement learning.

Yi Zhu Jian-Hua Pang Tong Gao Fang-Bao Tian

Bioinspir Biomim

School of Engineering and Information Technology, University of New South Wales, Canberra, ACT 2600, Australia.

Published: November 2022

Fish are observed to school in different configurations. However, how and why fish maintain a stable schooling formation still remains unclear. This work presents a numerical study of the dense schooling of two free swimmers by a hybrid method of the multi-agent deep reinforcement learning and the immersed boundary-lattice Boltzmann method. Active control policies are developed by synchronously training the leader to swim at a given speed and orientation and the follower to hold close proximity to the leader. After training, the swimmers could resist the strong hydrodynamic force to remain in stable formations and meantime swim in desired path, only by their tail-beat flapping. The tail movement of the swimmers in the stable formations are irregular and asymmetrical, indicating the swimmers are carefully adjusting their body-kinematics to balance the hydrodynamic force. In addition, a significant decrease in the mean amplitude and the cost of transport is found for the followers, indicating these swimmers could maintain the swimming speed with less efforts. The results also show that the side-by-side formation is hydrodynamically more stable but energetically less efficient than other configurations, while the full-body staggered formation is energetically more efficient as a whole.

Download full-text PDF	Source
http://dx.doi.org/10.1088/1748-3190/ac9fb5	DOI Listing

Publication Analysis

Top Keywords

multi-agent deep

deep reinforcement

reinforcement learning

hydrodynamic force

stable formations

indicating swimmers

energetically efficient

swimmers

learning school

school dense

Similar Publications

Breast radiation therapy fluence painting with multi-agent deep reinforcement learning.

Med Phys

January 2025

Department of Radiation Oncology, Duke University, North Carolina, USA.

Yang Dongrong Li Xinyi Yoo Sua Blitzblau Rachel McDuff Susan

Background: The electronic compensation (ECOMP) technique for breast radiation therapy provides excellent dose conformity and homogeneity. However, the manual fluence painting process presents a challenge for efficient clinical operation.

Purpose: To facilitate the clinical treatment planning automation of breast radiation therapy, we utilized reinforcement learning (RL) to develop an auto-planning tool that iteratively edits the fluence maps under the guidance of clinically relevant objectives.

View Article and Find Full Text PDF

Similar Publications

Task Offloading with LLM-Enhanced Multi-Agent Reinforcement Learning in UAV-Assisted Edge Computing.

Sensors (Basel)

December 2024

School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.

Feifan Zhu Fei Huang Yantao Yu Guojin Liu Tiancong Huang

Unmanned aerial vehicles (UAVs) furnished with computational servers enable user equipment (UE) to offload complex computational tasks, thereby addressing the limitations of edge computing in remote or resource-constrained environments. The application of value decomposition algorithms for UAV trajectory planning has drawn considerable research attention. However, existing value decomposition algorithms commonly encounter obstacles in effectively associating local observations with the global state of UAV clusters, which hinders their task-solving capabilities and gives rise to reduced task completion rates and prolonged convergence times.

View Article and Find Full Text PDF

Similar Publications

Multi-Agent Reinforcement Learning-Based Computation Offloading for Unmanned Aerial Vehicle Post-Disaster Rescue.

Sensors (Basel)

December 2024

School of Computer Science and Engineering, Northeastern University, Shenyang 110000, China.

Lixing Wang Huirong Jiao

Natural disasters cause significant losses. Unmanned aerial vehicles (UAVs) are valuable in rescue missions but need to offload tasks to edge servers due to their limited computing power and battery life. This study proposes a task offloading decision algorithm called the multi-agent deep deterministic policy gradient with cooperation and experience replay (CER-MADDPG), which is based on multi-agent reinforcement learning for UAV computation offloading.

View Article and Find Full Text PDF

Similar Publications

Electromagnetic metamaterial agent.

Light Sci Appl

January 2025

State Key Laboratory of Advanced Optical Communication Systems and Networks, School of Electronics, Peking University, Beijing, 100871, China.

Shengguo Hu Mingyi Li Jiawen Xu Hongrui Zhang Shanghang Zhang

Metamaterials have revolutionized wave control; in the last two decades, they evolved from passive devices via programmable devices to sensor-endowed self-adaptive devices realizing a user-specified functionality. Although deep-learning techniques play an increasingly important role in metamaterial inverse design, measurement post-processing and end-to-end optimization, their role is ultimately still limited to approximating specific mathematical relations; the metamaterial is still limited to serving as proxy of a human operator, realizing a predefined functionality. Here, we propose and experimentally prototype a paradigm shift toward a metamaterial agent (coined metaAgent) endowed with reasoning and cognitive capabilities enabling the autonomous planning and successful execution of diverse long-horizon tasks, including electromagnetic (EM) field manipulations and interactions with robots and humans.

View Article and Find Full Text PDF

Similar Publications

A multi-agent reinforcement learning based approach for automatic filter pruning.

Sci Rep

December 2024

College of Sciences, National University of Defense Technology, 410073, Changsha, China.

Zhemin Li Xiaojing Zuo Yiping Song Dong Liang Zheng Xie

Deep Convolutional Neural Networks (DCNNs), due to their high computational and memory requirements, face significant challenges in deployment on resource-constrained devices. Network Pruning, an essential model compression technique, contributes to enabling the efficient deployment of DCNNs on such devices. Compared to traditional rule-based pruning methods, Reinforcement Learning(RL)-based automatic pruning often yields more effective pruning strategies through its ability to learn and adapt.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!