Accelerate LLM post training with W&B Serverless SFT

Weights & Biases · Beginner ·🧠 Large Language Models ·1w ago

Skills: Fine-tuning LLMs90%LLM Engineering80%

W&B Training offers Serverless SFT powered by CoreWeave to help AI engineers fine-tune large language models for agentic tasks without managing infrastructure. In this video, we show how Serverless SFT makes it faster to customize model output format and style, distill knowledge from curated datasets, and warm-start models for reinforcement learning in a unified post-training workflow. We also demonstrate how fine-tuned LoRA adapters can be served using W&B Inference for evaluation and deployment. *https://wandb.ai/site/serverless-sft* ⏳Timestamps: 0:00 Introducing W&B Training Serverless SFT powered by CoreWeave 0:25 AI applications are hard to productionize 1:23 Post-training LLMs with SFT and RL 2:16 Why switching between SFT and RL is difficult 2:46 Using SFT and RL in a unified workflow with W&B Training 3:51 Simple coding agent example 4:39 Evaluating coding agent LLMs 5:32 Getting started with Serverless SFT 6:09 Fine-tuning a Qwen model using Serverless SFT 7:36 Running Weave Evaluations during SFT 8:33 Post-training using SFT and RL together 9:32 Serving fine-tuned models using W&B Inference 9:55 Testing our fine-tuned model in the Weave Playground 10:27 Recap, conclusion, and invitation to try the Weights & Biases AI developer platform

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Fine-tuning LLMs

View skill →

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Fine-tuning T5 LLM for Text Generation: Complete Tutorial w/ free COLAB #coding

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Train image classifier using transfer learning - Fine-tuning MobileNet with Keras

Advanced Fine-Tuning in Rust

Advanced Fine-Tuning in Rust

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

GPT-4o: Fine-tune OpenAI's Multimodal Model | Live Coding & Q&A (Oct 3rd)

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

LLM Fine-tuning: Two Crucial Tips for New Models - LLama 2

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

SDXL LORA STYLE Training! Get THE PERFECT RESULTS!

Related AI Lessons

Beyond RAG: How MCP Native Knowledge Graphs Unlock Full Codebase Structural Awareness

Unlock full codebase structural awareness by moving beyond RAG with MCP Native Knowledge Graphs, enabling more efficient code searching and understanding.

Beyond RAG: How MCP Native Knowledge Graphs Unlock Full Codebase Structural Awareness

Learn how MCP Native Knowledge Graphs can unlock full codebase structural awareness beyond RAG, and apply this knowledge to improve your codebase management

Medium · Machine Learning

Beyond RAG: How MCP Native Knowledge Graphs Unlock Full Codebase Structural Awareness

Unlock full codebase structural awareness with MCP Native Knowledge Graphs, going beyond RAG and enhancing code understanding and management.

Medium · Data Science

Getting Consistent LLM Output Starts Here — Temperature & Top-P

Learn to control LLM output consistency using temperature and Top-P parameters

Chapters (14)

Introducing W&B Training Serverless SFT powered by CoreWeave

0:25 AI applications are hard to productionize

1:23 Post-training LLMs with SFT and RL

2:16 Why switching between SFT and RL is difficult

2:46 Using SFT and RL in a unified workflow with W&B Training

3:51 Simple coding agent example

4:39 Evaluating coding agent LLMs

5:32 Getting started with Serverless SFT

6:09 Fine-tuning a Qwen model using Serverless SFT

7:36 Running Weave Evaluations during SFT

8:33 Post-training using SFT and RL together

9:32 Serving fine-tuned models using W&B Inference

9:55 Testing our fine-tuned model in the Weave Playground

10:27 Recap, conclusion, and invitation to try the Weights & Biases AI developer platf

5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems

Dave Ebbelaar (LLM Eng)