Publications by Yuval Tassa

Publications by authors named "Yuval Tassa"

Page 1 of 1

A virtual rodent predicts the structure of neural activity across behaviours.

Diego Aldarondo Josh Merel Jesse D Marshall Leonard Hasenclever Ugne Klibaite Yuval Tassa

Nature

August 2024

Animals have exquisite control of their bodies, allowing them to perform a diverse range of behaviours. How such control is implemented by the brain, however, remains unclear. Advancing our understanding requires models that can relate principles of control to the structure of neural activity in behaving animals.

View Article and Find Full Text PDF

Learning agile soccer skills for a bipedal robot with deep reinforcement learning.

Tuomas Haarnoja Ben Moran Guy Lever Sandy H Huang Dhruva Tirumala Yuval Tassa

Sci Robot

April 2024

We investigated whether deep reinforcement learning (deep RL) is able to synthesize sophisticated and safe movement skills for a low-cost, miniature humanoid robot that can be composed into complex behavioral strategies. We used deep RL to train a humanoid robot to play a simplified one-versus-one soccer game. The resulting agent exhibits robust and dynamic movement skills, such as rapid fall recovery, walking, turning, and kicking, and it transitions between them in a smooth and efficient manner.

View Article and Find Full Text PDF

From motor control to team play in simulated humanoid football.

Siqi Liu Guy Lever Zhe Wang Josh Merel S M Ali Eslami Yuval Tassa

Sci Robot

August 2022

Learning to combine control at the level of joint torques with longer-term goal-directed behavior is a long-standing challenge for physically embodied artificial agents. Intelligent behavior in the physical world unfolds across multiple spatial and temporal scales: Although movements are ultimately executed at the level of instantaneous muscle tensions or joint torques, they must be selected to serve goals that are defined on much longer time scales and that often involve complex interactions with the environment and other agents. Recent research has demonstrated the potential of learning-based approaches applied to the respective problems of complex movement, long-term planning, and multiagent coordination.

View Article and Find Full Text PDF

Least squares solutions of the HJB equation with neural network value-function approximators.

Yuval Tassa Tom Erez

IEEE Trans Neural Netw

July 2007

In this paper, we present an empirical study of iterative least squares minimization of the Hamilton-Jacobi-Bellman (HJB) residual with a neural network (NN) approximation of the value function. Although the nonlinearities in the optimal control problem and NN approximator preclude theoretical guarantees and raise concerns of numerical instabilities, we present two simple methods for promoting convergence, the effectiveness of which is presented in a series of experiments. The first method involves the gradual increase of the horizon time scale, with a corresponding gradual increase in value function complexity.

View Article and Find Full Text PDF