Sponsored Session: TorchTPU: Expanding TPU Programmabil... Kat Ko, Claudio Basile & Jana van Greunen

PyTorch · Advanced ·🧬 Deep Learning ·1mo ago
Sponsored Session: TorchTPU: Expanding TPU Programmability to Pytorch - Kat Ko & Claudio Basile, Google; Jana van Greunen, Meta Google Tensor Processing Units (TPUs) are designed for ML at massive scale, offering significant benefits in performance, energy, and cost. While TPUs have historically been associated with the TensorFlow and JAX ecosystems, we introduce TorchTPU: a new Google effort to expand TPU programmability to PyTorch. This talk charts TorchTPU’s evolution, from the initial RFC to establishing a native, eager-first PyTorch backend. We will outline the core technical challenges overcome during this transition—particularly the complexities of translating dynamic, eager execution into highly optimized TPU computations. We’ll highlight current milestones, including native integration with torch.compile, DTensor, and robust support for the latest Ironwood (TPU v7) architecture. These advancements collectively enable multi-billion parameter models to run on TPUs with minimal code changes, while retaining the ability for users to apply model-specific optimizations (e.g., custom kernels, quantization, sharding) to reach peak performance. Finally, we’ll provide a sneak peek at our roadmap for 2026.
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Day 3 — The Transformer Architecture Deep Dive
Learn the fundamentals of the Transformer architecture and its key components, including self-attention and residual connections, to improve your deep learning skills
Medium · Deep Learning
what is a optimizer in deep learning? what does it actually do? raw thoughts and intuition.
Learn what an optimizer is in deep learning and its role in improving model performance
Medium · Deep Learning
The Deep Learning Compendium: From Mathematical Foundations to Modern Architectures
Learn the fundamentals of Deep Learning, from mathematical foundations to modern architectures, to drive adaptive intelligence in the tech sector
Medium · AI
The Deep Learning Compendium: From Mathematical Foundations to Modern Architectures
Learn the fundamentals of Deep Learning, from mathematical foundations to modern architectures, to stay ahead in the tech industry
Medium · Machine Learning
Up next
Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 4 - Latent Space & Guidance
Stanford Online
Watch →