GKE for gen AI Apps: How HubX achieves scale, speed and agility

Google Cloud · Beginner ·🛠️ AI Tools & Apps ·4d ago
Learn more → https://goo.gle/4maopln Discover how HubX, a leading AI-powered mobile app company with flagship apps such as Nova, DaVinci and Momo is building and serving their gen AI apps on Google Kubernetes Engine (GKE) and TPUs. With GKE and Trillium TPUs, HubX has cut down their inference latency by 66% - users get responses in under 10 seconds instead of waiting up to 30 seconds.
Watch on YouTube ↗ (saves to browser)
Perplexity “Computer” Explained
Next Up
Perplexity “Computer” Explained
Full Disclosure