Many potential applications of artificial intelligence involve making real-time decisions in physical systems while interacting with humans. Automobile racing represents an extreme example of these conditions; drivers must execute complex tactical manoeuvres to pass or block opponents while operating their vehicles at their traction limits. Racing simulations, such as the PlayStation game Gran Turismo, faithfully reproduce the non-linear control challenges of real race cars while also encapsulating the complex multi-agent interactions. Here we describe how we trained agents for Gran Turismo that can compete with the world's best e-sports drivers. We combine state-of-the-art, model-free, deep reinforcement learning algorithms with mixed-scenario training to learn an integrated control policy that combines exceptional speed with impressive tactics. In addition, we construct a reward function that enables the agent to be competitive while adhering to racing's important, but under-specified, sportsmanship rules. We demonstrate the capabilities of our agent, Gran Turismo Sophy, by winning a head-to-head competition against four of the world's best Gran Turismo drivers. By describing how we trained championship-level racers, we demonstrate the possibilities and challenges of using these techniques to control complex dynamical systems in domains where agents must respect imprecisely defined human norms.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41586-021-04357-7DOI Listing

Publication Analysis

Top Keywords

gran turismo
20
turismo drivers
8
deep reinforcement
8
reinforcement learning
8
world's best
8
gran
5
turismo
5
outracing champion
4
champion gran
4
drivers
4

Similar Publications

Modelling the heterogeneity of tourist spending in a mature destination: An approach through infinite mixture.

Heliyon

October 2024

Department of Quantitative Methods, University of Las Palmas de G.C. and TIDES Institute, Campus Universitario de Tafira, Las Palmas de Gran Canaria, 35017, Las Palmas, Spain.

Identifying tourists' preferences is essential for stakeholders to provide better products and services. Among the tools to classify such choices, expenditure segmentation is valuable for separating tourist groups with shared interests. The underlying idea of the (infinite) mixture model is that tourists spend on a specific activity depending on their preferences.

View Article and Find Full Text PDF

Bioaccumulation is the process by which living organisms accumulate substances, such as pesticides, heavy metals, and other pollutants, from their environment. These substances can accumulate in the organism's tissues over time, leading to potential health risks. Bioaccumulation can occur in both aquatic and terrestrial ecosystems, and can have a significant impact on the health of both humans and wildlife.

View Article and Find Full Text PDF

In the last few years, esports have become popular among older individuals. Although participation in esports can become a novel activity for older adults, evidence on their effects is limited to young individuals. This study investigated the effects of esports participation on the emotional and physiological states of older adults.

View Article and Find Full Text PDF

Many potential applications of artificial intelligence involve making real-time decisions in physical systems while interacting with humans. Automobile racing represents an extreme example of these conditions; drivers must execute complex tactical manoeuvres to pass or block opponents while operating their vehicles at their traction limits. Racing simulations, such as the PlayStation game Gran Turismo, faithfully reproduce the non-linear control challenges of real race cars while also encapsulating the complex multi-agent interactions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!