Cooperation in learning (CL) can be realized in a multiagent system, if agents are capable of learning from both their own experiments and other agents' knowledge and expertise. Extra resources are exploited into higher efficiency and faster learning in CL as compared to that of individual learning (IL). In the real world, however, implementation of CL is not a straightforward task, in part due to possible differences in area of expertise (AOE). In this paper, reinforcement-learning homogenous agents are considered in an environment with multiple goals or tasks. As a result, they become expert in different domains with different amounts of expertness. Each agent uses a one-step Q-learning algorithm and is capable of exchanging its Q-table with those of its teammates. Two crucial questions are addressed in this paper: "How the AOE of an agent can be extracted?" and "How agents can improve their performance in CL by knowing their AOEs?" An algorithm is developed to extract the AOE based on state transitions as a gold standard from a behavioral point of view. Moreover, it is discussed that the AOE can be implicitly obtained through agents' expertness in the state level. Three new methods for CL through the combination of Q-tables are developed and examined for overall performance after CL. The performances of developed methods are compared with that of IL, strategy sharing (SS), and weighted SS (WSS). Obtained results show the superior performance of AOE-based methods as compared to that of existing CL methods, which do not use the notion of AOE. These results are very encouraging in support of the idea that "cooperation based on the AOE" performs better than the general CL methods.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/tsmcb.2006.883264 | DOI Listing |
Scand J Prim Health Care
January 2025
Unit of Physiotherapy, Department of Health and Rehabilitation, Institute of Neuroscience and Physiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden.
Research has shown that physical activity on prescription (PAP), used in Swedish healthcare, increases patients' physical activity, but data are lacking regarding the long-term effects of PAP on exercise capacity. Therefor exercise capacity was evaluated in patients with metabolic risk factors, after 4.5 years of PAP treatment provided by physiotherapists in primary healthcare.
View Article and Find Full Text PDFPharmacoecon Open
January 2025
Optimax Access Ltd, Kenneth Dibben House, Enterprise Rd, Chilworth, Southampton University Science Park, Southampton, UK.
Background: Patients with a left ventricular ejection fraction ≤ 35% are at increased risk of sudden cardiac death (SCD) within the first months after a myocardial infarction (MI). The wearable cardioverter defibrillator (WCD) is an established, safe and effective solution which can protect patients from SCD during the first months after an MI, when the risk of SCD is at its peak. This study aimed to evaluate the cost-effectiveness of WCD combined with guideline-directed medical therapy (GDMT) compared to GDMT alone, after MI in the English National Health Service (NHS).
View Article and Find Full Text PDFJpn J Ophthalmol
January 2025
Department of Ophthalmology, Eye center, China Medical University Hospital, Taichung City, Taiwan.
Purpose: To compare the efficac and safety of a dual-blade 20,000 cuts per minute (cpm) vitrectomy probe with a single-blade 10,000 cpm probe for primary rhegmatogenous retinal detachment (RRD).
Study Design: Prospective, randomized controlled clinical trial.
Methods: Evaluations were conducted preoperatively, intraoperatively, and at three months postoperatively.
World J Microbiol Biotechnol
January 2025
College of Chemistry and Chemical Engineering, Xi'an Shiyou University, Xi'an, 710065, China.
This paper developed an efficient microbial activator formula and conducted an in-depth study on its efficacy and mechanism in promoting the degradation of petroleum hydrocarbons in oil-contaminated soil. A 60-day microbial remediation experiment conducted on oily soil revealed that the microbial activators significantly boosted the activities of dehydrogenase and catalase, subsequently speeding up the degradation of petroleum hydrocarbons in the soil. The overall degradation rate reached as high as 71.
View Article and Find Full Text PDFLasers Med Sci
January 2025
Universidade Federal de Pelotas, Pelotas, Brazil.
This systematic review aimed to compare postoperative pain in endodontic treatments using PIPS Er: YAG laser-activated irrigation (LAI) versus conventional needle irrigation. An electronic search was conducted to identify randomized clinical trials (RCT) investigating postoperative pain in patients who underwent root canal treatments in permanent teeth using PIPS Er: YAG laser-activated irrigation or conventional needle irrigation. Two reviewers performed study selection, data extraction, risk of bias assessment (RoB 2.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!