AlphaZe∗∗: AlphaZero-like baselines for imperfect information games are surprisingly strong.

Front Artif Intell

Artificial Intelligence and Machine Learning Lab, Technical University of Darmstadt, Darmstadt, Germany.

Published: May 2023

In recent years, deep neural networks for strategy games have made significant progress. AlphaZero-like frameworks which combine Monte-Carlo tree search with reinforcement learning have been successfully applied to numerous games with perfect information. However, they have not been developed for domains where uncertainty and unknowns abound, and are therefore often considered unsuitable due to imperfect observations. Here, we challenge this view and argue that they are a viable alternative for games with imperfect information-a domain currently dominated by heuristic approaches or methods explicitly designed for hidden information, such as oracle-based techniques. To this end, we introduce a novel algorithm based solely on reinforcement learning, called AlphaZe∗∗, which is an AlphaZero-based framework for games with imperfect information. We examine its learning convergence on the games Stratego and DarkHex and show that it is a surprisingly strong baseline, while using a model-based approach: it achieves similar win rates against other Stratego bots like Pipeline Policy Space Response Oracle (P2SRO), while not winning in direct comparison against P2SRO or reaching the much stronger numbers of DeepNash. Compared to heuristics and oracle-based approaches, AlphaZe∗∗ can easily deal with rule changes, e.g., when more information than usual is given, and drastically outperforms other approaches in this respect.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10213697PMC
http://dx.doi.org/10.3389/frai.2023.1014561DOI Listing

Publication Analysis

Top Keywords

surprisingly strong
8
reinforcement learning
8
games imperfect
8
games
6
alphaze∗∗ alphazero-like
4
alphazero-like baselines
4
imperfect
4
baselines imperfect
4
imperfect games
4
games surprisingly
4

Similar Publications

Classical tissue recombination experiments demonstrate that cell-fate determination along the anterior-posterior axis of the Müllerian duct occurs prior to postnatal day 7 in mice. However, little is known about how these cell types are maintained in adults. In this study, we provide genetic evidence that a balance between antagonistic retinoic acid (RA) and estrogen signaling activity is required to maintain simple columnar cell fate in adult uterine epithelium.

View Article and Find Full Text PDF

Plasma proteomic technologies are rapidly evolving and of critical importance to the field of biomedical research. Here we report a technical evaluation of six notable plasma proteomic technologies - unenriched (Neat), Acid depletion, PreOmics ENRICHplus, Mag-Net, Seer Proteograph XT, Olink Explore HT. The methods were compared on proteomic depth, reproducibility, linearity, tolerance to lipid interference, and limit of detection/quantification.

View Article and Find Full Text PDF

Many important processes in cells depend on the transfer of protons through water wires embedded in transmembrane proteins. Herein, we have performed more than 55 μs all-atom simulations of the light-harvesting complex of a diatom, i.e.

View Article and Find Full Text PDF

The interplay between attractive London dispersion forces and steric effects due to repulsive forces resulting from the Pauli principle often determines the geometry and stability of nanostructures. Aromatic polyimides (PI) and carbon nanotubes (CNT) were chosen as building blocks as two components in the hetero delocalized electron nanostructures. Two PIs, having the same diamine part and different linkage substituents between two phenyl rings of dianhydride part, one linked with ether bond (C-O-C) (OPI), the other with C-(CF3)2 (FPI), were investigated.

View Article and Find Full Text PDF

Discovery of Quinazolone Pyridiniums as Potential Broad-Spectrum Antibacterial Agents.

Molecules

January 2025

Institute of Bioorganic & Medicinal Chemistry, Key Laboratory of Applied Chemistry of Chongqing Municipality, School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715, China.

The overprescription of antibiotics in medicine and agriculture has accelerated the development and spread of antibiotic resistance in bacteria, which severely limits the arsenal available to clinicians for treating bacterial infections. This work discovered a new class of heteroarylcyanovinyl quinazolones and quinazolone pyridiniums to surmount the increasingly severe bacterial resistance. Bioactive assays manifested that the highly active compound exhibited strong inhibition against MRSA and with extremely low MICs of 0.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!