Gathering human feedback

📰 OpenAI News

OpenAI introduces RL-Teacher, an open-source tool for training AIs with human feedback

intermediate Published 3 Aug 2017

Action Steps

Explore the RL-Teacher open-source implementation
Understand the underlying technique for training AIs with human feedback
Apply RL-Teacher to reinforcement learning problems with hard-to-specify rewards

Who Needs to Know This

AI engineers and researchers can utilize RL-Teacher to develop more efficient and safe AI systems, while product managers can leverage it to improve Reinforcement Learning models

Key Insight

💡 RL-Teacher enables training AIs with occasional human feedback, reducing reliance on hand-crafted reward functions

Key Takeaways

OpenAI introduces RL-Teacher, an open-source tool for training AIs with human feedback

Full Article

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Read full article → ← Back to Reads