LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

📰 Hacker News · gpjt

LLM from scratch, part 28 – training a base model from scratch on an RTX 3090. 121 comments, 540 points on Hacker News.

Published 2 Dec 2025
Read full article → ← Back to Reads