Publications by Ioannis Antonoglou

Publications by authors named "Ioannis Antonoglou"

Page 1 of 1

Mastering Atari, Go, chess and shogi by planning with a learned model.

Julian Schrittwieser Ioannis Antonoglou Thomas Hubert Karen Simonyan Laurent Sifre

Nature

December 2020

Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge success in challenging domains, such as chess and Go, where a perfect simulator is available. However, in real-world problems, the dynamics governing the environment are often complex and unknown.

View Article and Find Full Text PDF

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

David Silver Thomas Hubert Julian Schrittwieser Ioannis Antonoglou Matthew Lai

Science

December 2018

The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play.

View Article and Find Full Text PDF

Mastering the game of Go without human knowledge.

David Silver Julian Schrittwieser Karen Simonyan Ioannis Antonoglou Aja Huang

Nature

October 2017

A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks.

View Article and Find Full Text PDF

Mastering the game of Go with deep neural networks and tree search.

David Silver Aja Huang Chris J Maddison Arthur Guez Laurent Sifre Ioannis Antonoglou

Nature

January 2016

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play.

View Article and Find Full Text PDF

Human-level control through deep reinforcement learning.

Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Ioannis Antonoglou

Nature

February 2015

The theory of reinforcement learning provides a normative account, deeply rooted in psychological and neuroscientific perspectives on animal behaviour, of how agents may optimize their control of an environment. To use reinforcement learning successfully in situations approaching real-world complexity, however, agents are confronted with a difficult task: they must derive efficient representations of the environment from high-dimensional sensory inputs, and use these to generalize past experience to new situations. Remarkably, humans and other animals seem to solve this problem through a harmonious combination of reinforcement learning and hierarchical sensory processing systems, the former evidenced by a wealth of neural data revealing notable parallels between the phasic signals emitted by dopaminergic neurons and temporal difference reinforcement learning algorithms.

View Article and Find Full Text PDF