LLM from scratch, part 28 – training a base model from scratch on an RTX 3090
📰 Hacker News · gpjt
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090. 121 comments, 540 points on Hacker News.
LLM from scratch, part 28 – training a base model from scratch on an RTX 3090. 121 comments, 540 points on Hacker News.