Consensus, cooperative learning, and flocking for multiagent predator avoidance.

Int J Adv Robot Syst

Department of Computer Science and Engineering, Advanced Robotics and Automation (ARA) Laboratory, University of Nevada, Reno, NV, USA.

Published: September 2020

Multiagent coordination is highly desirable with many uses in a variety of tasks. In nature, the phenomenon of coordinated flocking is highly common with applications related to defending or escaping from predators. In this article, a hybrid multiagent system that integrates consensus, cooperative learning, and flocking control to determine the direction of attacking predators and learns to flock away from them in a coordinated manner is proposed. This system is entirely distributed requiring only communication between neighboring agents. The fusion of consensus and collaborative reinforcement learning allows agents to cooperatively learn in a variety of multiagent coordination tasks, but this article focuses on flocking away from attacking predators. The results of the flocking show that the agents are able to effectively flock to a target without collision with each other or obstacles. Multiple reinforcement learning methods are evaluated for the task with cooperative learning utilizing function approximation for state-space reduction performing the best. The results of the proposed consensus algorithm show that it provides quick and accurate transmission of information between agents in the flock. Simulations are conducted to show and validate the proposed hybrid system in both one and two predator environments, resulting in an efficient cooperative learning behavior. In the future, the system of using consensus to determine the state and reinforcement learning to learn the states can be applied to additional multiagent tasks.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8609419PMC
http://dx.doi.org/10.1177/1729881420960342DOI Listing

Publication Analysis

Top Keywords

cooperative learning
16
reinforcement learning
12
consensus cooperative
8
learning flocking
8
multiagent coordination
8
attacking predators
8
learning
7
consensus
5
flocking
5
multiagent
5

Similar Publications

Unlabelled: In most cancers, including endometrial cancer, tumor suppressor genes harboring inactivating mutations have been systematically cataloged. However, locus-specific epigenetic alterations contributing to cancer initiation and progression remain only partly described, creating knowledge gaps about functionally significant tumor suppressors and underlying mechanisms associated with their inactivation. Here, we show that PAX2 is an endometrial tumor suppressor recurrently inactivated by a distinct epigenetic reprogramming event not associated with promoter hypermethylation.

View Article and Find Full Text PDF

Background And Aims: Artificial Intelligence (AI) beginning to integrate in healthcare, is ushering in a transformative era, impacting diagnostics, altering personalized treatment, and significantly improving operational efficiency. The study aims to describe AI in healthcare, including important technologies like robotics, machine learning (ML), deep learning (DL), and natural language processing (NLP), and to investigate how these technologies are used in patient interaction, predictive analytics, and remote monitoring. The goal of this review is to present a thorough analysis of AI's effects on healthcare while providing stakeholders with a road map for navigating this changing environment.

View Article and Find Full Text PDF

Background: Advances in artificial intelligence and machine learning have facilitated the creation of mortality prediction models which are increasingly used to assess quality of care and inform clinical practice. One open question is whether a hospital should utilize a mortality model trained from a diverse nationwide dataset or use a model developed primarily from their local hospital data.

Objective: To compare performance of a single-hospital, 30-day all-cause mortality model against an established national benchmark on the task of mortality prediction.

View Article and Find Full Text PDF

MACRPO: Multi-agent cooperative recurrent policy optimization.

Front Robot AI

December 2024

Intelligent Robotics Group, Electrical Engineering and Automation Department, Aalto University, Helsinki, Finland.

This work considers the problem of learning cooperative policies in multi-agent settings with partially observable and non-stationary environments without a communication channel. We focus on improving information sharing between agents and propose a new multi-agent actor-critic method called (MACRPO). We propose two novel ways of integrating information across agents and time in MACRPO: First, we use a recurrent layer in the critic's network architecture and propose a new framework to use the proposed meta-trajectory to train the recurrent layer.

View Article and Find Full Text PDF

Targeted therapy and immunotherapy drugs for oncology have greater efficacy and tolerability than cytotoxic chemotherapeutic drugs. However, the cutaneous adverse drug reactions associated with these newer therapies are more common and remain poorly predicted. An effective prediction model is urgently needed and essential.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!