41
/
AIzaSyB4mHJ5NPEv-XzF7P6NDYXjlkCWaeKw5bc
May 31, 2026
3431534
69881
1
Public Timelines
FAQ

jan 1, 2017 - Google Deepmind, Emergence of Locomotion Behaviours in Rich Environments

Description:

The research explores how reinforcement learning (or RL) can be used to teach a computer to navigate unfamiliar and complex environments.
Everything the stick figure is doing in this video is self-taught. The jumping, the limboing, the leaping — all of these are behaviors that the computer has devised itself as the best way of getting from A to B. All DeepMind’s programmers have done is give the agent a set of virtual sensors (so it can tell whether it’s upright or not, for example) and then incentivize to move forward. The computer works the rest out for itself, using trial and error to come up with different ways of moving.

Added to timeline:

Date:

jan 1, 2017
Now
~ 9 years and 5 months ago

Images:

YouTube: