-
Highly efficient
Better quality than 1.5 Flash, at the same speed and cost.
-
More context
A 1 million token context window and multimodal input.
Model information
Model deployment status
Public preview
Supported data types for input
Text, Image, Video, Audio
Supported data types for output
Text
Supported # tokens for input
1M
Supported # tokens for output
8k
Knowledge cutoff
June 2024
Best for
Low-cost workflows
Availability
Google AI Studio
Gemini API
Vertex AI
Try Gemini Flash
General availability
2.0 Flash
Our powerful workhorse model with low latency and enhanced performance, built to power agentic experiences
Experimental
2.0 Flash Thinking
Our enhanced reasoning model, capable of showing its thoughts to improve performance and explainability.
Public preview
2.0 Flash-Lite
Our most cost-efficient model yet.
General availability
1.5 Flash
Our lightweight model, optimized for tasks where speed and efficiency matter the most.
General availability
1.5 Flash-8B
Our smaller, faster and most cost-efficient Flash model.