gpt.buzz

news

New ways to balance cost and reliability in the Gemini API

April 2, 2026

Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.

Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.

Source: blog.google

← All news