New ways to balance cost and reliability in the Gemini API

📰 Google AI Blog

Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.

Published 2 Apr 2026
Read full article → ← Back to News