OPEN AI Learning from human preferences

Post Reply
admin
Site Admin
Articles: 0
Posts: 1162
Joined: Sat May 02, 2026 10:05 am

OPEN AI Learning from human preferences

Post by admin »

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

Source: https://openai.com/index/learning-from- ... references
Post Reply