OPEN AI Gathering human feedback

Post Reply
admin
Site Admin
Articles: 0
Posts: 1162
Joined: Sat May 02, 2026 10:05 am

OPEN AI Gathering human feedback

Post by admin »

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Source: https://openai.com/index/gathering-human-feedback
Post Reply