Publications by authors named "M J Lanctot"

Games have a long history as benchmarks for progress in artificial intelligence. Approaches using search and learning produced strong performance across many perfect information games, and approaches using game-theoretic reasoning and learning demonstrated strong performance for specific imperfect information poker variants. We introduce Student of Games, a general-purpose algorithm that unifies previous approaches, combining guided search, self-play learning, and game-theoretic reasoning.

View Article and Find Full Text PDF

We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence (AI) has not yet mastered. It is a game characterized by a twin challenge: It requires long-term strategic thinking as in chess, but it also requires dealing with imperfect information as in poker.

View Article and Find Full Text PDF

We introduce α-Rank, a principled evolutionary dynamics methodology, for the evaluation and ranking of agents in large-scale multi-agent interactions, grounded in a novel dynamical game-theoretic solution concept called Markov-Conley chains (MCCs). The approach leverages continuous-time and discrete-time evolutionary dynamical systems applied to empirical games, and scales tractably in the number of agents, in the type of interactions (beyond dyadic), and the type of empirical games (symmetric and asymmetric). Current models are fundamentally limited in one or more of these dimensions, and are not guaranteed to converge to the desired game-theoretic solution concept (typically the Nash equilibrium).

View Article and Find Full Text PDF

The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play.

View Article and Find Full Text PDF

Adding toroidal arrays of magnetic probes at the top and bottom of NSTX-U would improve both the detection of the multimodal plasma response to applied magnetic perturbations and the identification of the poloidal structure of unstable plasma modes, as well as contribute to the validation of MHD models, improve the understanding of the plasma response to external fields, and improve the error field correction. In this paper, the linear MHD code MARS-F/K has been used to identify poloidal locations that would improve the capability to measure stationary or near-stationary 3D fields that may result from the plasma response to external sources of non-axisymmetric fields. The study highlighted 6 poloidal positions where new arrays of both poloidal and radial magnetic field sensors would improve the poloidal resolution.

View Article and Find Full Text PDF