GKE for gen AI Apps: How HubX achieves scale, speed and agility
Learn more → https://goo.gle/4maopln
Discover how HubX, a leading AI-powered mobile app company with flagship apps such as Nova, DaVinci and Momo is building and serving their gen AI apps on Google Kubernetes Engine (GKE) and TPUs. With GKE and Trillium TPUs, HubX has cut down their inference latency by 66% - users get responses in under 10 seconds instead of waiting up to 30 seconds.
Watch on YouTube ↗
(saves to browser)
DeepCamp AI