1 gen 2017 anni - Google Deepmind, Emergence of Locomotion Behaviours in Rich Environments
Descrizione:
The research explores how reinforcement learning (or RL) can be used to teach a computer to navigate unfamiliar and complex environments. Everything the stick figure is doing in this video is self-taught. The jumping, the limboing, the leaping — all of these are behaviors that the computer has devised itself as the best way of getting from A to B. All DeepMind’s programmers have done is give the agent a set of virtual sensors (so it can tell whether it’s upright or not, for example) and then incentivize to move forward. The computer works the rest out for itself, using trial and error to come up with different ways of moving.