OPEN AI Evolved Policy Gradients

Post Reply
admin
Site Admin
Articles: 0
Posts: 1162
Joined: Sat May 02, 2026 10:05 am

OPEN AI Evolved Policy Gradients

Post by admin »

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on novel tasks. Agents trained with EPG can succeed at basic tasks at test time that were outside their training regime, like learning to navigate to an object on a different side of the room from where it was placed during training.

Source: https://openai.com/index/evolved-policy-gradients
Post Reply