Pretraining LLMs

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Pretraining LLMs

Coursera · Advanced ·🧠 Large Language Models ·2mo ago
In Pretraining LLMs you’ll explore the first step of training large language models using a technique called pretraining. You’ll learn the essential steps to pretrain an LLM, understand the associated costs, and discover how starting with smaller, existing open source models can be more cost-effective. Pretraining involves teaching an LLM to predict the next token using vast text datasets, resulting in a base model, and this base model requires further fine-tuning for optimal performance and safety. In this course, you’ll learn to pretrain a model from scratch and also to take a model that’s already been pretrained and continue the pretraining process on your own data. In detail: 1. Explore scenarios where pretraining is the optimal choice for model performance. Compare text generation across different versions of the same model to understand the performance differences between base, fine-tuned, and specialized pre-trained models. 2. Learn how to create a high-quality training dataset using web text and existing datasets, which is crucial for effective model pretraining. 3. Prepare your cleaned dataset for training. Learn how to package your training data for use with the Hugging Face library. 4. Explore ways to configure and initialize a model for training and see how these choices impact the speed of pretraining. 5. Learn how to configure and execute a training run, enabling you to train your own model. 6. Learn how to assess your trained model’s performance and explore common evaluation strategies for LLMs, including important benchmark tasks used to compare different models’ performance. After taking this course, you’ll be equipped with the skills to pretrain a model—from data preparation and model configuration to performance evaluation.
Watch on External: Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Whether Artificial General Intelligence Will Arise Spontaneously Or Via Slow Roll
Learn about the potential emergence of Artificial General Intelligence (AGI) and its implications, whether it arises spontaneously or through a slow roll, and why it matters for professionals and leaders in the tech industry
Forbes Innovation
5 Mistakes That Destroy Your AI Discoverability
Learn the 5 mistakes that destroy AI discoverability and how to improve your brand's visibility in generative search answers
Medium · LLM
When AI Remembers You Better Than You Remember Yourself
Learn how AI-powered personalized assistants can remember and influence your past, present, and future interactions, and what this means for your digital identity
Medium · AI
Revolutionising EMR Note Summarisation with LLM: A Practical Guide
Learn how to revolutionize EMR note summarization using Large Language Models (LLMs) to improve healthcare technology
Medium · Startup
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →