DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL

📰 Hacker News · gradus_ad

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL. 1056 comments, 1351 points on Hacker News.

Published 25 Jan 2025
Read full article → ← Back to Reads