ModelMar 3, 2026
Google

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Why It Matters

Gemini 3.1 Flash-Lite offers enhanced performance and cost-efficiency, making it ideal for high-volume and complex workloads, enabling more developers to leverage AI at scale.

Release Summary

  • Gemini 3.1 Flash-Lite is Google's fastest and most cost-efficient Gemini 3 series model.

  • Available in preview via the Gemini API in Google AI Studio and for enterprises via Vertex AI.

  • Priced at $0.25/1M input tokens and $1.50/1M output tokens.

  • Outperforms previous models with a 2.5X faster Time to First Answer Token.

  • Achieves high scores on benchmarks like GPQA Diamond and MMMU Pro.

This entry is based on publicly available announcements. AI Product Release Radar is not affiliated with Google. No guarantee of accuracy. Not financial advice.

AD_SLOT