This is a previous-generation family. Most teams should look at Gemini 3: Gemini 3.1 Pro, Flash, Lite Compared instead.

The variants on this page still work and are still listed, but pricing, capabilities, and benchmarks below describe the older generation. Use this page for migration planning, not as a starting point.

Google family

Gemini 2

Gemini 2.5 Flash ships at $0.30/$2.50 per 1M with 1M-token context. When 2.5 Pro and the 2.0 family beat upgrading to Gemini 3 on cost or workload.

Top in this family

Gemini 2.5 Pro ranks #68 of 186 on overall quality (QS 79.9) at $1.25/$10 per 1M tokens.

Practical pick

Gemini 2.5 Flash (Thinking) at $0.3/$2.5 per 1M tokens (rank #112 of 186).

Variants
6
License
Closed weights
Provider
Google

Best variant by workload

One pick per common job. Pick by what you need to ship — not by which variant has the highest score on a leaderboard you don't use.

Note — picks are framed for direct API usage where cost per million tokens is load-bearing. If you're inside an agent harness (Claude Code, Cursor, etc.) the calculus changes: the harness sets the model, the per-task cost is usually negligible, and the flagship variant tends to win. See our piece on Claude Code for the harness-vs-API framing.
WorkloadBest pickWhy
General API workhorse
Google Gemini 2.5 Flash
Thinking
$0.300/1M / $2.50/1M
Previous-generation default for chat-and-tooling workloads. Choose when the cost delta to Gemini 3 Flash is the deciding factor.
High-volume chat
gemini-2.5-flash-lite-preview-09-2025
Preview 09 2025 No Thinking
$0.100/1M / $0.400/1M
Cheapest 2.5-tier option at usable quality. Use for high-volume chat where per-token cost compounds.

All variants

20 variants across 6 models (+ 2 cross-family for context). Sorted by quality score (descending).

VariantQSGPQAHLESWESWE-ProTerminalTauMCPAIMEIn $/MOut $/MContextReleasedLic.
Gemini 2.5 ProPrevious
Gemini 2.5 Pro
79.9
#68/186
86.421.659.632.667.08.888.0$1.25$10Jun 17, 2025
ThinkingPrevious
Gemini 2.5 Flash
71.1
#112/186
82.811.060.416.93.472.0$0.3$2.51.0MJun 17, 2025
Non-thinkingPrevious
Gemini 2.0 Pro
70.1
#116/186
64.7Feb 5, 2025
LatestPrevious
gemini-2.5-flash-lite-preview-09-2025
68.5
#122/186
66.76.9$0.1$0.4Sep 25, 2025
Non-ThinkingPrevious
Gemini 2.5 Flash
67.6
#126/186
68.264.346.6$0.3$2.51.0MJun 17, 2025
Non-thinkingPrevious
Gemini 2.0 Flash-Lite
63.5
#141/186
51.5$0.075$0.3Feb 5, 2025
2.0Previous
Gemini 2.0 Flash
63.0
#144/186
60.1$0.1$0.4Feb 5, 2025
ReasoningPrevious
Gemini 2.0 Flash
61.4
#150/186
$0.1$0.4Feb 5, 2025
Max ThinkingPrevious
Gemini 2.5 Flash
$0.3$2.51.0MJun 17, 2025
Max Thinking 2025 06 17Previous
gemini-2.5-flash-lite-preview-09-2025
$0.1$0.4Sep 25, 2025
Max Thinking 2025 09 25Previous
gemini-2.5-flash-lite-preview-09-2025
$0.1$0.4Sep 25, 2025
Preview 06 17 ThinkingPrevious
gemini-2.5-flash-lite-preview-09-2025
$0.1$0.4Sep 25, 2025
Preview 09 2025 No ThinkingPrevious
gemini-2.5-flash-lite-preview-09-2025
$0.1$0.4Sep 25, 2025
Preview 01 01Previous
Gemini 2.0 Flash
6.6$0.1$0.4Feb 5, 2025
V4 Pro Thinkingcross-family
DeepSeek V4
98.0
#15/186
90.137.780.655.473.6$0.435$0.871.0MApr 24, 2026
V4 Flash Thinkingcross-family
DeepSeek V4
92.0
#27/186
88.134.879.052.669.0$0.098$0.1971.0MApr 24, 2026
3.0cross-family
Gemini 3 Flash
88.9
#32/186
90.433.778.049.647.662.0$0.5$3Dec 17, 2025
Previewcross-family
Gemini 3 Flash
87.3
#36/186
62.0$0.5$3Dec 17, 2025
V4 Procross-family
DeepSeek V4
80.9
#61/186
72.97.773.652.169.4$0.435$0.871.0MApr 24, 2026
V4 Flashcross-family
DeepSeek V4
78.1
#78/186
71.28.173.749.164.0$0.098$0.1971.0MApr 24, 2026

Benchmark evidence

Every benchmark we track for this family, across capabilities. The headline Quality Score draws from a deliberately narrow, governed panel (72 of 165 rows here feed it); the rest is tracked evidence — recorded and comparable, but not folded into one synthetic score.

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveBench82.41 / 110In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_08_2025_0577.11 / 17In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_single75.61 / 2In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_07_2025_0180.12 / 8In Quality Score
Google Gemini 2.0 Pro · Non-thinkingLiveCodeBench · 2024_10_01_to_2025_02_01362 / 9In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProAider (Polyglot)83.13 / 45In Quality Score
Google Gemini 2.0 Flash · 2.0LiveCodeBench · 2024_10_01_to_2025_02_0134.53 / 9In Quality Score
Google Gemini 2.5 Flash · ThinkingAIME 2025 · code_exec75.74 / 4In Quality Score
Show all benchmark evidence (165 rows)

Reasoning

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveBench82.41 / 110In Quality Score
Google Gemini 2.5 Flash · ThinkingAIME 2025 · code_exec75.74 / 4In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProSimpleBench62.49 / 61In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProAIME 2025 · no_tools8810 / 15In Quality Score
Google Gemini 2.5 Flash · ThinkingAIME 2025 · no_tools7213 / 15In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProAIME 20258816 / 88In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProMMLU Pro8617 / 86In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProHumanity's Last Exam · hle_text18.421 / 56In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProGPQA Diamond86.428 / 143In Quality Score
Google Gemini 2.5 Flash · ThinkingHumanity's Last Exam · hle_text12.628 / 56In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · LatestSimpleBench41.233 / 61In Quality Score
Google Gemini 2.5 Flash · ThinkingAIME 20257239 / 88In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProHumanity's Last Exam · hle21.639 / 90In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProArena Elo144641 / 158In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingLiveBench67.843 / 110In Quality Score
Google Gemini 2.0 Flash · ReasoningSimpleBench30.743 / 61In Quality Score
Google Gemini 2.5 Flash · ThinkingGPQA Diamond82.844 / 143In Quality Score
Google Gemini 2.0 Flash · ReasoningHumanity's Last Exam · hle_text6.544 / 56In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingHumanity's Last Exam · hle_text5.649 / 56In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingMMLU Pro79.452 / 86In Quality Score
Google Gemini 2.0 Pro · Non-thinkingMMLU Pro79.153 / 86In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingAIME 202546.653 / 88In Quality Score
Google Gemini 2.0 Flash · 2.0MMLU Pro77.655 / 86In Quality Score
Google Gemini 2.0 Flash · 2.0SimpleBench18.958 / 61In Quality Score
Google Gemini 2.5 Flash · ThinkingHumanity's Last Exam · hle1160 / 90In Quality Score
Gemini 2.0 Flash-Lite · Non-thinkingMMLU Pro71.663 / 86In Quality Score
Google Gemini 2.5 Flash · Max ThinkingLiveBench53.177 / 110In Quality Score
Google Gemini 2.5 Flash · ThinkingArena Elo141180 / 158In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · LatestHumanity's Last Exam · hle6.980 / 90In Quality Score
Google Gemini 2.0 Flash · Preview 01 01Humanity's Last Exam · hle6.683 / 90In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingGPQA Diamond68.284 / 143In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · LatestGPQA Diamond66.788 / 143In Quality Score
Google Gemini 2.0 Pro · Non-thinkingGPQA Diamond64.794 / 143In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · Max Thinking 2025 06 17LiveBench42.695 / 110In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · Max Thinking 2025 09 25LiveBench42.496 / 110In Quality Score
Google Gemini 2.0 Flash · 2.0GPQA Diamond60.199 / 143In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · Preview 09 2025 No ThinkingArena Elo1380106 / 158In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · Preview 06 17 ThinkingArena Elo1375107 / 158In Quality Score
Gemini 2.0 Flash-Lite · Non-thinkingGPQA Diamond51.5111 / 143In Quality Score
Google Gemini 2.0 Flash · 2.0Arena Elo1360117 / 158In Quality Score
Gemini 2.0 Flash-Lite · Non-thinkingArena Elo1353121 / 158In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProArena-Hard96.41 / 40Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v1_average931 / 1Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v1_pointwise82.91 / 1Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMulti-IF77.81 / 32Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v2_average581 / 6Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v2_pointwise16.41 / 1Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMATH 50098.82 / 55Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProGlobalPIQA91.52 / 4Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProFACTS Benchmark Suite63.42 / 12Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProVendingBench2573.64 / 4Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProAIME 2024924 / 69Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProGlobal PIQA91.54 / 26Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMMMU · mmmu_single79.64 / 22Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProSimpleQA54.54 / 40Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMMMLU89.55 / 38Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingAceBench74.55 / 7Tracked evidence
Google Gemini 2.5 Flash · ThinkingFACTS Benchmark Suite50.45 / 12Tracked evidence
Google Gemini 2.5 Flash · ThinkingGlobal PIQA90.26 / 26Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMathArenaApex0.57 / 8Tracked evidence
Google Gemini 2.5 Flash · ThinkingMRCR · v2_1m219 / 14Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingMMLU90.110 / 33Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v2_1m16.410 / 14Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestFACTS Benchmark Suite17.912 / 12Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMMLU89.513 / 33Tracked evidence
Google Gemini 2.5 Flash · ThinkingMMMLU86.613 / 38Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMRCR · v2_128k5813 / 23Tracked evidence
Google Gemini 2.5 Flash · ThinkingAIME 202482.314 / 69Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMRCR · v2_1m5.414 / 14Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProSciCode42.815 / 24Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProFinanceAgent29.415 / 15Tracked evidence
Google Gemini 2.5 Flash · ThinkingMRCR · v2_128k54.316 / 23Tracked evidence
Google Gemini 2.5 Flash · ThinkingSimpleQA28.117 / 40Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMMLU84.519 / 38Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMRCR · v2_128k30.621 / 23Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingSimpleQA23.322 / 40Tracked evidence
Google Gemini 2.0 Pro · Non-thinkingMATH 50091.825 / 55Tracked evidence
Google Gemini 2.5 Flash · ThinkingHMMT Feb 202564.227 / 44Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProBFCL v362.927 / 49Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProMMMU PRO6830 / 52Tracked evidence
Google Gemini 2.5 Flash · ThinkingMMMU PRO66.731 / 52Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestSimpleQA11.532 / 40Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingAIME 202461.333 / 69Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingHMMT Feb 202534.737 / 44Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMMU PRO5142 / 52Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProBrowseComp7.644 / 51Tracked evidence

Coding

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_08_2025_0577.11 / 17In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_single75.61 / 2In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2024_07_2025_0180.12 / 8In Quality Score
Google Gemini 2.0 Pro · Non-thinkingLiveCodeBench · 2024_10_01_to_2025_02_01362 / 9In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProAider (Polyglot)83.13 / 45In Quality Score
Google Gemini 2.0 Flash · 2.0LiveCodeBench · 2024_10_01_to_2025_02_0134.53 / 9In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingSWE-bench Verified · single_agentless32.67 / 7In Quality Score
Google Gemini 2.5 Flash · ThinkingLiveCodeBench · 2024_08_2025_0562.38 / 17In Quality Score
Gemini 2.0 Flash-Lite · Non-thinkingLiveCodeBench · 2024_10_01_to_2025_02_0128.98 / 9In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench · 2025_01_2025_05_single699 / 11In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProSWE-bench Verified · multiple67.29 / 10In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProLiveCodeBench70.413 / 69In Quality Score
Google Gemini 2.5 Flash · ThinkingLiveCodeBench62.620 / 69In Quality Score
Google Gemini 2.5 Flash · ThinkingAider (Polyglot)55.122 / 45In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProGSO (Global Software Optimization) · opt_at_1023 / 24In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingAider (Polyglot)4432 / 45In Quality Score
Google Gemini 2.5 Flash · Non-ThinkingLiveCodeBench · v644.734 / 40In Quality Score
Google Gemini 2.0 Flash · 2.0Aider (Polyglot)22.238 / 45In Quality Score
Google Gemini 2.0 Flash · ReasoningAider (Polyglot)18.240 / 45In Quality Score
gemini-2.5-flash-lite-preview-09-2025 · LatestLiveCodeBench34.341 / 69In Quality Score
Google Gemini 2.5 Flash · ThinkingSWE-bench Verified60.452 / 68In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProSWE-bench Verified59.653 / 68In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProCodeforces200111 / 47Tracked evidence
Google Gemini 2.5 Flash · Non-ThinkingOJ-Bench19.516 / 19Tracked evidence

Agentic

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Flash · Thinkingτ²-bench · average79.515 / 30In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 Proτ²-bench · average77.819 / 30In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 Proτ²-bench · airline5021 / 29In Quality Score
Google Gemini 2.5 Flash · Non-Thinkingτ²-bench · airline42.525 / 29In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 Proτ²-bench · retail6726 / 34In Quality Score
Google Gemini 2.5 Flash · Non-Thinkingτ²-bench · retail64.328 / 34In Quality Score
Google Gemini 2.5 Flash · Non-Thinkingτ²-bench · telecom16.928 / 28In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProMCP Atlas8.832 / 33In Quality Score
Google Gemini 2.5 Flash · ThinkingMCP Atlas3.433 / 33In Quality Score
Google Gemini 2.5 Pro · Gemini 2.5 ProVendingBench · v25746 / 7Tracked evidence
Google Gemini 2.5 Flash · ThinkingVendingBench · v25497 / 7Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProToolathlon10.530 / 31Tracked evidence
Google Gemini 2.5 Flash · ThinkingToolathlon3.731 / 31Tracked evidence

Multimodal

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Pro · Gemini 2.5 ProVideoMME86.92 / 4Tracked evidence
Google Gemini 2.0 Flash · 2.0ChartQA88.33 / 9Tracked evidence
Gemini 2.0 Flash-Lite · Non-thinkingChartQA739 / 9Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProVideo-MMMU83.610 / 28Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestLVBench60.912 / 18Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestVLMs Are Blind68.414 / 18Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMVU65.314 / 20Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestLingoQA17.815 / 16Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMLVU · mavg78.516 / 22Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestVideoMME · without_sub72.716 / 21Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestAI2D · test85.717 / 33Tracked evidence
Google Gemini 2.5 Flash · ThinkingVideo-MMMU79.217 / 28Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestSLAKE6517 / 22Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMathVision52.117 / 17Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestZEROBench · sub19.217 / 23Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestRealWorldQA72.218 / 24Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMBench · en_dev_v1_182.719 / 24Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestVideoMME · with_sub74.619 / 22Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestDynaMath69.919 / 23Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestBabyVision17.519 / 22Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestRefSpatialBench11.219 / 21Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestZEROBench119 / 27Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestV*69.620 / 23Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestHallusionBench64.520 / 33Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestSimpleVQA54.120 / 29Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestCountBench79.221 / 23Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMedXpertQA · mm35.321 / 31Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestEmbSpatialBench66.122 / 24Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestERQA44.322 / 27Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProScreenSpot-Pro11.422 / 24Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMStar69.123 / 33Tracked evidence
Google Gemini 2.5 Flash · ThinkingScreenSpot-Pro3.923 / 24Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestVideo-MMMU60.726 / 28Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProCharXiv Reasoning69.627 / 48Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMathVista · mini72.829 / 36Tracked evidence
Google Gemini 2.5 Flash · ThinkingCharXiv Reasoning63.733 / 48Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestCharXiv Reasoning55.539 / 48Tracked evidence

Document/OCR

Model / VariantBenchmarkScoreRankScoring
Google Gemini 2.5 Flash · ThinkingOmniDocBench · v1_50.21 / 6Tracked evidence
Google Gemini 2.5 Pro · Gemini 2.5 ProOmniDocBench · v1_50.13 / 6Tracked evidence
Gemini 2.0 Flash-Lite · Non-thinkingDocVQA91.25 / 8Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestMMLongBench-Doc46.514 / 22Tracked evidence
gemini-2.5-flash-lite-preview-09-2025 · LatestOCRBench82.519 / 35Tracked evidence

Where this family sits in the market

Gemini 2.5 Flash Lite and 2.0 Flash Lite take the family's cost-efficiency frontier across all served Gemini tiers.

AnthropicCohereDeepSeekGoogleMetaMicrosoftMiniMaxMistralMoonshotnvidiaOpenAIQwenxAIZhipu

Dashed line = Pareto frontier (no model both cheaper and better). Thinking/non-thinking pairs of the same model are connected — line length = cost of reasoning. Hover any dot for details.

Alternatives to consider

Peer families that solve overlapping problems. Pick by your binding constraint (cost, latency, open weights, vendor lock-in), not by leaderboard order.

Caveats

What this page does not tell you, listed honestly.

  • No tracked API pricing for: Google Gemini 2.0 Pro. Variants without hosted-provider pricing are listed for completeness; cost columns show a dash.
  • Context window not declared for: Google Gemini 2.5 Pro, gemini-2.5-flash-lite-preview-09-2025, Google Gemini 2.0 Pro, Google Gemini 2.0 Flash, Gemini 2.0 Flash-Lite.
  • Cross-family models (marked "cross-family" in the variants table) are shown for context only. Their canonical page lives on the family that owns them.

Editor's notes

By borisLast verified AI-assisted, human-reviewed

If you are already on Gemini 2

If you have a working Vertex or AI Studio deployment pinned to a Gemini 2-era SKU, the question is when staying is defensible and where the data is thin enough to verify before committing.

The one fact that complicates the headline migration call: Gemini 2.5 Flash sits at $0.3 input / $2.5 output per million with a verified 1M-token context window. Gemini 3 Flash is more expensive on both axes ($0.5 / $3) and our index has a coverage gap on its context window. The trade is a Quality Score lift from 67.6 (2.5 Flash thinking) to 88.9 (3 Flash) for higher unit cost and an unverified context profile. For workloads that picked 2.5 Flash specifically for cheap long-context, the migration is not a unit-economics win. Verify the workload tolerates the quality gap before paying more per call.

Reasons to stay on the previous generation that are defensible:

  • Vertex routing or fine-tunes pinned to 2.5. If a deployment is going through Vertex with model-specific routing, fine-tunes, or enterprise SLAs tied to a 2.5 SKU, the migration cost includes the Vertex-side work. Plan it; do not assume the SDK is the only thing to update.
  • 2.0 Flash is genuinely cheap and your workload is tolerant. At $0.1 input / $0.4 output per million with 1M context, 2.0 Flash (and its reasoning preview variants) sit at one of the cheapest 1M-context price points in our index. Quality Score around 63 puts the line well below 3 Flash, but for repetitive low-stakes turns with long context the cost-per-call advantage compounds.
  • 2.5 Pro is in production and your evals qualified it. 2.5 Pro (thinking mode) at QS 79.9 with Arena ELO rank 38 is not a bad model; it just sits below 3 Pro 3.1's tier. If your workload is qualified on 2.5 Pro output behaviour, treat the migration to 3 Pro as a re-qualification exercise, not a drop-in.

Where the data is weak

  • Context window declarations are partial. Several 2-era variants list a 1M context window in our index (2.5 Flash, 2.0 Flash preview), while others (2.5 Pro, 2.5 Flash Lite, 2.0 Pro) show the field as unset. That is a coverage gap; verify on the deployment surface you actually use before committing for a long-document workload.
  • The 2.5 Flash Lite preview SKUs have multiple variants (2025-06-17, 2025-09-25) with similar pricing but distinct behaviour. Pin the variant identifier you are calling explicitly rather than relying on latest-style aliases for production.
  • Pricing on this page is the published list price. Vertex AI routing, batch pricing, and enterprise agreements change the unit economics; list price is a calibration anchor.

When to look outside this era

  • Gemini 3 family (/en/ai/llm/gemini-3) is the natural successor for every tier on this page. If the migration question is still open, that surface is the comparison to read.
  • Cheapest competent long-context API outside Google: DeepSeek V4 Flash ($0.098 / $0.197 with QS 78.1 and 1M context) is the price anchor to beat at the workhorse-with-long-context tier. Gemini 2.5 Flash already loses to it on both quality and pricing.

Sources worth reading

How we score

Quality scores combine multiple public benchmarks (LMArena, LiveBench, SWE-bench, Aider and others) into a single comparable number. Pricing is the published API list price; self-hosted cost depends on your own hardware. We do not accept paid placements.

Author: Boris. Read the full methodology.

Get the next Gemini 2 update

New variants, repriced models, and recommendation changes, in plain English. No spam, no paid placements.

Subscribe →

Need help picking for production?

Independent evaluation against your real workload, your real data, and your real cost ceiling. No vendor incentives.

See services →