RedPajama v2 Open Dataset with 30T Tokens for Training LLMs

📰 Hacker News · programd

RedPajama v2 Open Dataset with 30T Tokens for Training LLMs. 60 comments, 236 points on Hacker News.

Published 30 Oct 2023
Read full article → ← Back to Reads