In recent years, deep neural networks for strategy games have made significant progress. AlphaZero-like frameworks which combine Monte-Carlo tree search with reinforcement learning have been successfully applied to numerous games with perfect information. However, they have not been developed for domains where uncertainty and unknowns abound, and are therefore often considered unsuitable due to imperfect observations. Here, we challenge this view and argue that they are a viable alternative for games with imperfect information-a domain currently dominated by heuristic approaches or methods explicitly designed for hidden information, such as oracle-based techniques. To this end, we introduce a novel algorithm based solely on reinforcement learning, called AlphaZe∗∗, which is an AlphaZero-based framework for games with imperfect information. We examine its learning convergence on the games Stratego and DarkHex and show that it is a surprisingly strong baseline, while using a model-based approach: it achieves similar win rates against other Stratego bots like Pipeline Policy Space Response Oracle (P2SRO), while not winning in direct comparison against P2SRO or reaching the much stronger numbers of DeepNash. Compared to heuristics and oracle-based approaches, AlphaZe∗∗ can easily deal with rule changes, e.g., when more information than usual is given, and drastically outperforms other approaches in this respect.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10213697 | PMC |
http://dx.doi.org/10.3389/frai.2023.1014561 | DOI Listing |
Proc Natl Acad Sci U S A
February 2025
Division of Dermatology, Department of Medicine, Washington University School of Medicine, St. Louis, MO 63110.
Classical tissue recombination experiments demonstrate that cell-fate determination along the anterior-posterior axis of the Müllerian duct occurs prior to postnatal day 7 in mice. However, little is known about how these cell types are maintained in adults. In this study, we provide genetic evidence that a balance between antagonistic retinoic acid (RA) and estrogen signaling activity is required to maintain simple columnar cell fate in adult uterine epithelium.
View Article and Find Full Text PDFPlasma proteomic technologies are rapidly evolving and of critical importance to the field of biomedical research. Here we report a technical evaluation of six notable plasma proteomic technologies - unenriched (Neat), Acid depletion, PreOmics ENRICHplus, Mag-Net, Seer Proteograph XT, Olink Explore HT. The methods were compared on proteomic depth, reproducibility, linearity, tolerance to lipid interference, and limit of detection/quantification.
View Article and Find Full Text PDFACS Phys Chem Au
January 2025
School of Science, Constructor University, Campus Ring 1, 28759 Bremen, Germany.
Many important processes in cells depend on the transfer of protons through water wires embedded in transmembrane proteins. Herein, we have performed more than 55 μs all-atom simulations of the light-harvesting complex of a diatom, i.e.
View Article and Find Full Text PDFCommun Chem
January 2025
Institute of Physics, Albert-Ludwig-University of Freiburg, Freiburg, Germany.
The interplay between attractive London dispersion forces and steric effects due to repulsive forces resulting from the Pauli principle often determines the geometry and stability of nanostructures. Aromatic polyimides (PI) and carbon nanotubes (CNT) were chosen as building blocks as two components in the hetero delocalized electron nanostructures. Two PIs, having the same diamine part and different linkage substituents between two phenyl rings of dianhydride part, one linked with ether bond (C-O-C) (OPI), the other with C-(CF3)2 (FPI), were investigated.
View Article and Find Full Text PDFMolecules
January 2025
Institute of Bioorganic & Medicinal Chemistry, Key Laboratory of Applied Chemistry of Chongqing Municipality, School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715, China.
The overprescription of antibiotics in medicine and agriculture has accelerated the development and spread of antibiotic resistance in bacteria, which severely limits the arsenal available to clinicians for treating bacterial infections. This work discovered a new class of heteroarylcyanovinyl quinazolones and quinazolone pyridiniums to surmount the increasingly severe bacterial resistance. Bioactive assays manifested that the highly active compound exhibited strong inhibition against MRSA and with extremely low MICs of 0.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!