A year in, Google wants its Axion processors to feel like a scheduling decision

The New Stack · Intermediate ·🏗️ Systems Design & Architecture ·3w ago
At KubeCon Europe, Google Cloud’s Jago Macleod and Abdel Sghiouar argued that adopting Arm for Kubernetes workloads has shifted from a complex migration to a practical, low-friction choice. After a year of production use, Google’s custom Arm-based Axion processors—powering C4A and N4A instances—are positioned as broadly viable for most containerized applications, offering strong gains in performance, cost efficiency, and energy usage compared to x86. Rather than requiring a full overhaul, moving to Arm typically involves recompiling containers for a multi-architecture target and gradually rolling out via Kubernetes practices like canary deployments. While edge cases exist, they are relatively uncommon. A key enabler is GKE’s compute classes, which allow workloads to express preferences across VM types, turning infrastructure decisions into automated scheduling choices rather than manual provisioning. Ultimately, the conversation points to a larger constraint: energy. As AI workloads grow, efficiency—measured in “tokens per watt”—is emerging as the defining metric, with cost savings translating directly into greater compute capacity. Here's the full article to go along with the video: https://thenewstack.io/google-axion-kubernetes-arm/ Learn more from The New Stack about the latest developments around Google’s work with Axion: Arm: See a Demo About Migrating a x86-Based App to ARM64 https://thenewstack.io/arm-see-a-demo-about-migrating-a-x86-based-app-to-arm64/ Do All Your AI Workloads Actually Require Expensive GPUs? https://thenewstack.io/do-all-your-ai-workloads-actually-require-expensive-gpus/ Join our community of newsletter subscribers to stay on top of the news and at the top of your game. https://thenewstack.io/newsletter
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

The Real Reason Your System Is Slow Has Nothing to Do With Your Language
The slow performance of a system is often not due to the programming language used, but rather other factors such as poor design, inefficient algorithms, and inadequate optimization
Medium · Programming
Next.js vs Vite in 2026: What you should actually use
Learn when to use Next.js vs Vite in 2026 for optimal performance and ideal use cases
Dev.to AI
What an architecture audit actually finds (a story in eight bugs)
Learn how an architecture audit of a 29-microservice platform revealed unexpected bugs and areas for improvement
Medium · DevOps
What Happens in Your Brain When the Interviewer Says “Design YouTube” — and How to Stop It
Learn to overcome interview freeze and decision paralysis when designing complex systems like YouTube
Medium · Python
Up next
Scaling Continuous Delivery, GitOps & AI-Powered Deployments with Octopus Deploy
The New Stack
Watch →