Gemini 3 Flash

Best for frontier intelligence at speed

Our latest Gemini 3 model that helps you bring any idea to life - faster.

Intelligence in a Flash

Get Gemini 3 Pro’s level of reasoning with Flash-level latency and efficiency – at a fraction of the cost. Gemini 3 Flash is our most impressive model for agentic workflows

Speed and scale don’t have to come at the cost of intelligence.

Frontier-level multimodal understanding across text, audio, images, code, and video.

High quality intelligence at a fraction of the cost.


Hands-on

Efficiency doesn’t come at the cost of intelligence. Here are just a few ways you can use Gemini 3 Flash’s multimodal and frontier-level, real-time reasoning capabilities at speed


Performance

Gemini 3 Flash demonstrates that speed and scale don’t have to come at the cost of intelligence

BenchmarkNotesGemini 3 Flash ThinkingGemini 3 Pro ThinkingGemini 2.5 Flash ThinkingGemini 2.5 Pro ThinkingClaude Sonnet 4.5 ThinkingGPT-5.2 Extra highGrok 4.1 Fast Reasoning
Input price$/1M tokens$0.50$2.00 $4.00 > 200k tokens$0.30$1.25 $2.50 > 200k tokens$3.00 $6.00 /MTok > 200k tokens$1.75$0.20
Output price$/1M tokens$3.00$12.00 $18.00 > 200k tokens$2.50$10.00 $15.00 > 200k tokens$15.00 $22.50 > 200k tokens$14.00$0.50
Academic reasoning
(full set, text + MM) Humanity's Last ExamNo tools33.7%37.5%11.0%21.6%13.7%34.5%17.6%
With search and code execution43.5%45.8%45.5%
Visual reasoning puzzles ARC-AGI-2ARC Prize Verified33.6%31.1%2.5%4.9%13.6%52.9%
Scientific knowledge GPQA DiamondNo tools90.4%91.9%82.8%86.4%83.4%92.4%84.3%
Mathematics AIME 2025No tools95.2%95.0%72.0%88.0%87.0%100%91.9%
With code execution99.7%100%75.7%100%
Multimodal understanding and reasoning MMMU-Pro81.2%81.0%66.7%68.0%68.0%79.5%63.0%
Screen understanding ScreenSpot-ProNo tools unless specified69.1%72.7%3.9%11.4%36.2%86.3% with python
Information synthesis from complex charts CharXiv ReasoningNo tools80.3%81.4%63.7%69.6%68.5%82.1%
OCR OmniDocBench 1.5Overall Edit Distance, lower is better0.1210.1150.1540.1450.1450.143
Knowledge acquisition from videos Video-MMMU86.9%87.6%79.2%83.6%77.8%85.9%
Competitive coding problems from Codeforces, ICPC, and IOI LiveCodeBench ProElo Rating, higher is better231624391143177514182393
Agentic terminal coding Terminal-Bench 2.0Terminus-2 harness47.6%54.2%16.9%32.6%42.8%
Agentic coding SWE-bench VerifiedSingle attempt78.0%76.2%60.4%59.6%77.2%80.0%50.6%
Agentic tool use τ2-bench90.2%90.7%79.5%77.8%87.2%
Long horizon real-world software tasks Toolathlon49.4%36.4%3.7%10.5%38.9%46.3%
Multi-step workflows using MCP MCP Atlas57.4%54.1%3.4%8.8%43.8%60.6%
Agentic long term coherence Vending-Bench 2Net worth (mean), higher is better$3,635$5,478$549$574$3,839$3,952$1,107
Factuality benchmark across grounding, parametric, search, and MM FACTS Benchmark Suite61.9%70.5%50.4%63.4%48.9%61.4%42.1%
Parametric knowledge SimpleQA Verified68.7%72.1%28.1%54.5%29.3%38.0%19.5%
Multilingual Q&A MMMLU91.8%91.8%86.6%89.5%89.1%89.6%86.8%
Commonsense reasoning across 100 Languages and Cultures Global PIQA92.8%93.4%90.2%91.5%90.1%91.2%85.6%
Long context performance MRCR v2 (8-needle)128k (average)67.2%77.0%54.3%58.0%47.1%81.9%54.6%
1M (pointwise)22.1%26.3%21.0%16.4%not supportednot supported6.1%

For details on our evaluation methodology please see deepmind.google/models/evals-methodology/gemini-3-flash

Model information

Name
3 Flash
Status
Preview
Input
  • Text
  • Image
  • Video
  • Audio
  • PDF
Output
  • Text
Input tokens
1M
Output tokens
64k
Knowledge cutoff
January 2025
Tool use
  • Function calling
  • Structured output
  • Search as a tool
  • Code execution
Best for
  • Everyday tasks
  • Agentic coding
  • Advanced reasoning
  • Multimodal understanding
  • Long context understanding
Availability
  • Gemini App
  • Google Cloud / Vertex AI
  • Google AI Studio
  • Gemini API
  • Gemini CLI
  • Gemini Enterprise
  • Google AI Mode
  • Google Antigravity
  • Android Studio
Documentation
View developer docs
Model card
View model card