Learning from human preferences

📰 OpenAI News

OpenAI and DeepMind developed an algorithm that infers human preferences by comparing two proposed behaviors

advanced Published 13 Jun 2017
Action Steps
  1. Collaborate with safety teams to identify complex goals
  2. Develop algorithms that can infer human preferences from comparisons
  3. Test and refine the algorithm with human feedback
  4. Integrate the algorithm into AI systems to improve safety and alignment
Who Needs to Know This

AI researchers and engineers on a team can benefit from this algorithm to build safer AI systems, and product managers can use it to develop more aligned AI products

Key Insight

💡 Inferring human preferences from comparisons can help build safer AI systems

Share This
🤖 New algorithm infers human preferences from comparisons! 🚀
Read full article → ← Back to News