ModelMar 3, 2026

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Why It Matters

Gemini 3.1 Flash-Lite offers enhanced performance and cost-efficiency, making it ideal for high-volume and complex workloads, enabling more developers to leverage AI at scale.

Release Summary

Gemini 3.1 Flash-Lite is Google's fastest and most cost-efficient Gemini 3 series model.
Available in preview via the Gemini API in Google AI Studio and for enterprises via Vertex AI.
Priced at $0.25/1M input tokens and $1.50/1M output tokens.
Outperforms previous models with a 2.5X faster Time to First Answer Token.
Achieves high scores on benchmarks like GPQA Diamond and MMMU Pro.

Source Links

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3

Gemini 3.1 Flash-Lite: Built for intelligence at scale

Release Summary

Source Links

Tags