Heterogeneous multiagent systems are characterized by diverse task distributions, which are prevalent in practical scenarios, such as distributed decision making and robotic collaboration. A significant challenge in these systems is the constraint of limited observations, where each agent has access only to partial information. Many studies facilitate information exchange by employing shared parameters among agents. However, this approach is generally more effective for homogeneous systems where agents have similar observation or action spaces. In heterogeneous systems, indiscriminate parameter sharing can significantly increase the exploration cost required for effective adaptation. To address this challenge, we propose a novel communication complementary graph model (CCGM) for enhancing collaboration in heterogeneous multiagent systems. Our approach builds upon the training framework of heterogeneous agent reinforcement learning (HARL) with trust region learning and nonparameter sharing. This model utilizes advantage function decomposition and sequential updates to promote policy convergence. Within this framework, we introduce a novel communication method inspired by signaling games, where agents acting as receivers, process messages from other agents alongside their own observations. CCGM aligns the messages with observations in a graph-based communication module, which establishes communication relationships and supplements observational information. Subsequently, agents generate self-interested information, which they then share with others as senders. We evaluate our algorithm across various environments, including multiagent particle environments (MPE) and multiagent MuJoCo (MAMuJoCo) robot experiments. The results demonstrate the effectiveness of CCGM in enhancing HARL-based algorithms.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TCYB.2024.3453892 | DOI Listing |
Mol Ther
December 2024
State Key Laboratory of Medicinal Chemical Biology, College of Life Sciences and Frontiers Science Center for Cell Responses, Nankai University, Tianjin 300071, China; Beijing Institute of Biological Products Company Limited and CNBG-Nankai University Joint Research and Development Center, Beijing 100176, China; Shanghai Institute for Advanced Immunochemical Studies, ShanghaiTech University, Shanghai, China. Electronic address:
Oncolytic viruses have been considered promising cancer immunotherapies. However, oncovirotherapy agents impart durable responses in only a subset of cancer patients. Thus, exploring the cellular and molecular mechanisms underlying the heterogeneous responses in patients can provide guidance to develop more effective oncolytic virus therapies.
View Article and Find Full Text PDFISA Trans
November 2024
The School of Electrical and Mechanical Engineering, University of Adelaide, Adelaide, SA, 5005, Australia. Electronic address:
This paper focuses on the design of event-triggered observer-based heterogeneous memory controllers for leader-following multi-agent systems with time-varying topology. In order to save limited on-board resources, a novel adaptive event-triggered strategy based on the nonlinear transformation law of the estimation error is proposed in this paper, which can effectively reduce some unnecessary data transmission due to small fluctuations after the estimation error converges. Then, a more general topology structure described by an interval type-2 fuzzy model is adopted, which contains both nonlinear time-varying law and uncertain parameters.
View Article and Find Full Text PDFSensors (Basel)
November 2024
Graduate School of Informatics, Nagoya University, Furo-cho, Chikusa-Ward, Nagoya 464-8601, Japan.
The accurate prediction of vehicle behavior is crucial for autonomous driving systems, impacting their safety and efficiency in complex urban environments. To address the challenge of multi-agent trajectory prediction, we propose a novel model integrating multiple input modalities, including historical trajectories, map data, vehicle features, and interaction information. Our approach employs a Conditional Variational Autoencoder (CVAE) framework with a decoder that predicts control actions using the Gaussian Mixture Model (GMM) and then converts these actions into dynamically feasible trajectories through a bicycle model.
View Article and Find Full Text PDFThis article investigates the tracking control problem of heterogeneous multiagent systems (MASs) with intrinsic nonlinear dynamics in noisy and time-delayed environments. First, a stability criterion for nonlinear stochastic delay systems with multiplicative noise and time-varying delay is proposed by applying the appropriate Lyapunov-Krasovskii functional. Then, based on the proposed stability criterion, sufficient conditions are derived for mean square (m.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
October 2024
In this paper, we study the application of Vehicle-to-Everything (V2X) communication to improve the perception performance of autonomous vehicles. We present V2X-ViTs, a robust cooperative perception framework with V2X communication using novel vision Transformer models. First, we present V2X-ViTv1 containing holistic attention modules that can effectively fuse information across on-road agents (i.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!