Your Coding Agent Should Do AI System Engineering — Ben Burtenshaw, Hugging Face

AI Engineer · Advanced ·🤖 AI Agents & Automation ·1h ago
An agent written RMSNorm kernel hit 1.88x speedups on H100s. A finetuned Qwen3 0.6B hit 35% on LiveCodeBench. Neither result required a systems engineer. Just coding agents with the right skills loaded. Ben Burtenshaw from Hugging Face walks through three levels: using Claude Code interactively to write and benchmark CUDA kernels distributed as versioned repos on the Hub, a zero-shot task where an agent finetunes a model end to end from a single prompt, and a multi agent research lab running parallel experiments overnight on Hub compute while a reporter agent pushes results to a live Trackio dashboard. The through line is skills: file based context that turns a zero shot failure into a few shot workflow. CUDA programming and ML training pipelines were deep specializations that took years. Skills compress that timeline to hours. Speaker info: - https://x.com/ben_burtenshaw - https://www.linkedin.com/in/ben-burtenshaw/ - https://github.com/burtenshaw
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

AI Is Quietly Learning Human Behavior In Ways Most People Never Notice
AI is learning human behavior in subtle ways, predicting habits, emotions, and daily decisions, which is crucial for developing more human-centric AI systems
Medium · AI
Why AI Systems Need Memory Now
AI systems need memory to learn from past experiences and make progress, otherwise they're limited to repetitive and inefficient behavior
Dev.to AI
Building Persistent Memory Into AI Tutoring: The Evenfield Architecture
Learn how the Evenfield Architecture builds persistent memory into AI tutoring for more effective learning experiences
Dev.to AI
AI Receptionist for Australian Dental Practices: The Honest 2026 Guide
Learn how AI receptionists can automate tasks for Australian dental practices, reducing front-desk call volume by up to 60% and capturing new patient enquiries 24/7
Dev.to AI
Up next
How to Build a Self-Improving Company with AI
Y Combinator
Watch →