Qwen family

Qwen3

Qwen3: Qwen 3.7 Max Preview ranks #9/186 with 262K context at $0.78/$3.9 per 1M. Compare Qwen3, 3.5, 3.6 by workload.

Top in this family

Qwen 3.7 Max Preview ranks #9 of 186 on overall quality (QS 100.9) at $0.78/$3.9 per 1M tokens.

Practical pick

Qwen 3.6 35B-A3B (Thinking) at $0.14/$1 per 1M tokens (rank #54 of 186).

Variants
30
License
Open weights
Provider
Qwen

★ Most teams should start here

Qwen 3.6 35B-A3B

Variant: Thinking

The family's value champion. Mixture-of-experts: 35B total parameters, only ~3B active per token, so it costs and serves like a small model on capable hardware. Pick this unless you have a specific reason not to.

Quality Score
82.0
Input
$0.140/1M
Output
$1.00/1M
Context
262K
License
Open weights

Best variant by workload

One pick per common job. Pick by what you need to ship — not by which variant has the highest score on a leaderboard you don't use.

Note — picks are framed for direct API usage where cost per million tokens is load-bearing. If you're inside an agent harness (Claude Code, Cursor, etc.) the calculus changes: the harness sets the model, the per-task cost is usually negligible, and the flagship variant tends to win. See our piece on Claude Code for the harness-vs-API framing.
WorkloadBest pickWhy
Coding agents
Qwen 3 Coder 480B A35B Instruct
Non-thinking
$0.220/1M / $1.80/1M
Purpose-built coder variant. Strongest coding-focused option in the family. Use when agentic coding throughput matters more than the price gap to the value pick.
General API workhorse
Qwen 3.6 35B-A3B
Thinking
$0.140/1M / $1.00/1M
Best quality-per-dollar for chat-and-tooling at API scale. The MoE active-param footprint means you pay closer to a 3B model than a 35B one, on most providers' billing.
Long-context RAG
Qwen 3.6 Plus
Thinking
$0.325/1M / $1.95/1M
Largest context window in the family. Prefer when document scale dominates the workload and recall over long inputs is the binding constraint.
Self-host on 1 GPU
Qwen 3.6 35B-A3B
Thinking
$0.140/1M / $1.00/1M
Mixture-of-experts means the active-param compute footprint is closer to a 3B model than a 35B one. The trade-off is memory: total weights still need to fit, so plan for the full parameter count when sizing GPU memory, not the active subset.
Edge / on-device
Qwen3-1.7B (Thinking)
Non-thinking
Smallest open-weights variant with usable quality. Fits CPU + small-GPU inference for local or on-device deployment when round-trip latency rules out hosted APIs.
Document AI / OCR
Qwen3 VL 32B
Thinking
$0.104/1M / $0.416/1M
VL variant in the mid-size band. Vision-language coverage is strong enough for layout-aware OCR and table extraction.

All variants

56 variants across 30 models. Sorted by quality score (descending).

VariantQSGPQAHLESWESWE-ProTerminalTauMCPAIMEIn $/MOut $/MContextReleasedLic.
Qwen 3.7 Max Preview
Qwen3 Max
100.9
#9/186
$0.78$3.9262K
Qwen 3.6 Max Preview
Qwen3 Max
96.6
#17/186
$1.04$6.24262K
Thinking
3.6 Plus
92.6
#26/186
90.428.856.661.6$0.325$1.951.0M
Qwen3 Max
Qwen3 Max
89.4
#30/186
30.275.3$0.78$3.9262K
Thinking
3.6 27B
87.5
#35/186
87.824.077.253.5$0.29$3.2262KApr 29, 2026
Thinking
3.6 35B-A3B
82.0
#54/186
86.021.473.449.562.8$0.14$1262KApr 29, 2026
Non-thinking
3 Coder 480B A35B Instruct
64.9
#137/186
38.723.9$0.22$1.81.0MJul 22, 2025
Thinking
3.6 Flash
$0.188$1.1251.0M
Qwen 3.5 Max Preview
Qwen3 Max
$0.78$3.9262K
Thinking
Qwen3 VL 2B
Thinking
Qwen3 VL 4B
Thinking
Qwen3 VL 8B
$0.117$1.365256K
Non-Thinking
Qwen3 VL 8B
$0.08$0.5256K
Thinking
Qwen3 VL 30B A3B
$0.13$1.56131K
Thinking
Qwen3 VL 32B
$0.104$0.416262K
Non-Thinking
Qwen3 VL 32B
$0.104$0.416262K
Thinking
Qwen3 VL 235B A22B
$0.26$2.6131K
Non-ThinkingPrevious
3.5 122b A10b
92.7
#24/186
25.3$0.26$2.08262K
ThinkingPrevious
3.5 397b A17b
92.7
#25/186
28.776.483.1$0.39$2.34262K
ThinkingPrevious
3.5 122b A10b
86.4
#40/186
86.625.372.0$0.26$2.08262K
ThinkingPrevious
3.5 27b
84.7
#46/186
85.524.375.0$0.195$1.56262K
ThinkingPrevious
3 30b A3b
83.1
#51/186
65.870.9$0.08$0.4131KApr 29, 2025
ThinkingPrevious
3.5 9B
81.9
#55/186
81.7$0.04$0.15262K
ThinkingPrevious
3 14b
81.0
#60/186
64.070.4$0.1$0.24132KApr 29, 2025
ThinkingPrevious
3.5 35b A3b
80.8
#62/186
84.222.469.2$0.14$1262K
ThinkingPrevious
Qwen3 Next 80B A3B
80.4
#66/186
77.2$0.098$0.78262KSep 11, 2025
ThinkingPrevious
3 8b
77.8
#81/186
62.067.3$0.05$0.4131KApr 29, 2025
ThinkingPrevious
3.5 4B
75.4
#91/186
76.2
thinking-2507Previous
3 235b A22b
75.4
#92/186
15.871.9$0.1$0.1262KApr 29, 2025
ThinkingPrevious
3 235b A22b
72.5
#105/186
71.17.634.421.458.681.5$0.455$1.82131KApr 29, 2025
ThinkingPrevious
3 32b
72.5
#106/186
68.472.9$0.08$0.28131KApr 29, 2025
ThinkingPrevious
Qwen3-4B (Thinking)
72.5
#107/186
55.965.6Apr 29, 2025
non-thinking-2507Previous
3 235b A22b
70.8
#113/186
$0.071$0.1262KApr 29, 2025
Thinking (2507)Previous
3 30b A3b
68.4
#124/186
73.49.822.085.0$0.09$0.45131KApr 29, 2025
Non-thinkingPrevious
3 14b
67.2
#127/186
54.823.3$0.1$0.24132KApr 29, 2025
Non-thinkingPrevious
3 32b
66.3
#133/186
54.620.2$0.08$0.28131KApr 29, 2025
Non-thinkingPrevious
3 30b A3b
65.4
#135/186
54.821.6$0.09$0.45131KApr 29, 2025
Non-ThinkingPrevious
3 235b A22b
64.0
#140/186
62.934.457.024.7$0.455$1.82131KApr 29, 2025
ThinkingPrevious
Qwen3-1.7B (Thinking)
62.8
#146/186
40.136.8Apr 29, 2025
Non-thinkingPrevious
3 8b
60.9
#152/186
39.320.9$0.05$0.4131KApr 29, 2025
Non-thinkingPrevious
Qwen3-4B (Thinking)
60.0
#158/186
41.719.1Apr 29, 2025
ThinkingPrevious
Qwen3-0.6B (Thinking)
50.0
#177/186
27.915.1Apr 29, 2025
Non-thinkingPrevious
Qwen3-1.7B (Thinking)
48.0
#180/186
28.69.8Apr 29, 2025
Non-thinkingPrevious
Qwen3-0.6B (Thinking)
37.2
#185/186
22.92.6Apr 29, 2025
Non-Thinking (2507)Previous
3 30b A3b
$0.043$0.172131KApr 29, 2025
vl-235b-a22b-instructPrevious
3 235b A22b
$0.455$1.82131KApr 29, 2025
vl-235b-a22b-thinkingPrevious
3 235b A22b
$0.455$1.82131KApr 29, 2025
ThinkingPrevious
3.5 0.8B
Non-ThinkingPrevious
3.5 0.8B
ThinkingPrevious
3.5 2B
Non-ThinkingPrevious
3.5 2B
Non-ThinkingPrevious
3.5 4B
Non-ThinkingPrevious
3.5 9B
$0.04$0.15262K
Non-ThinkingPrevious
3.5 27b
$0.195$1.56262K
Non-ThinkingPrevious
3.5 35b A3b
$0.14$1262K
ThinkingPrevious
3.5 Flash
$0.065$0.261.0M

Benchmark evidence

Every benchmark we track for this family, across capabilities. The headline Quality Score draws from a deliberately narrow, governed panel (203 of 1031 rows here feed it); the rest is tracked evidence — recorded and comparable, but not folded into one synthetic score.

Model / VariantBenchmarkScoreRankScoring
Qwen 3.6 Plus · ThinkingMCP Atlas · public_set74.11 / 13In Quality Score
Qwen 3.5 397b A17b · ThinkingMMLU Pro87.83 / 86In Quality Score
Qwen 3.5 122b A10b · Non-ThinkingLiveCodeBench · v578.93 / 5In Quality Score
Qwen 3 235b A22b · thinking-2507LiveCodeBench · 2024_07_2025_0178.24 / 8In Quality Score
Qwen3 Max · Qwen 3.7 Max PreviewSimpleBench70.44 / 61In Quality Score
Qwen 3 235b A22b · ThinkingLiveCodeBench · 2024_08_2025_0566.54 / 17In Quality Score
Qwen 3.5 397b A17b · ThinkingHumanity's Last Exam · verified37.64 / 5In Quality Score
Qwen3 Max · Qwen3 MaxLiveCodeBench · v685.95 / 40In Quality Score
Show all benchmark evidence (1031 rows)

Reasoning

Model / VariantBenchmarkScoreRankScoring
Qwen 3.5 397b A17b · ThinkingMMLU Pro87.83 / 86In Quality Score
Qwen3 Max · Qwen 3.7 Max PreviewSimpleBench70.44 / 61In Quality Score
Qwen 3.5 397b A17b · ThinkingHumanity's Last Exam · verified37.64 / 5In Quality Score
Qwen3 Max · Qwen3 MaxHumanity's Last Exam · verified37.65 / 5In Quality Score
Qwen 3 235b A22b · ThinkingLiveBench77.16 / 110In Quality Score
Qwen 3.5 122b A10b · Non-ThinkingAIME 2025 · no_tools90.47 / 15In Quality Score
Qwen3 Max · Qwen 3.6 Max PreviewSimpleBench638 / 61In Quality Score
Qwen 3.5 122b A10b · ThinkingMMLU Pro86.710 / 86In Quality Score
Qwen 3.5 122b A10b · Non-ThinkingMMLU Pro86.711 / 86In Quality Score
Qwen 3.6 Plus · ThinkingHumanity's Last Exam · tools50.611 / 38In Quality Score
Qwen3 Max · Qwen 3.7 Max PreviewArena Elo147513 / 158In Quality Score
Qwen 3.6 Plus · ThinkingGPQA Diamond90.413 / 143In Quality Score
Qwen 3 32b · ThinkingLiveBench74.914 / 110In Quality Score
Qwen3 Max · Qwen3 MaxHumanity's Last Exam · tools49.814 / 38In Quality Score
Qwen 3.6 27B · ThinkingMMLU Pro86.215 / 86In Quality Score
Qwen 3.5 27b · ThinkingMMLU Pro86.116 / 86In Quality Score
Qwen 3.5 27b · ThinkingHumanity's Last Exam · tools48.516 / 38In Quality Score
Qwen 3.5 397b A17b · ThinkingHumanity's Last Exam · tools48.317 / 38In Quality Score
Qwen3 Max · Qwen3 MaxMMLU Pro85.718 / 86In Quality Score
Qwen 3.5 35b A3b · ThinkingMMLU Pro85.319 / 86In Quality Score
Qwen 3 30b A3b · ThinkingLiveBench74.319 / 110In Quality Score
Qwen 3.5 122b A10b · ThinkingHumanity's Last Exam · tools47.519 / 38In Quality Score
Qwen3 Max · Qwen 3.7 Max PreviewLiveBench74.320 / 110In Quality Score
Qwen 3.5 35b A3b · ThinkingHumanity's Last Exam · tools47.420 / 38In Quality Score
Qwen 3.6 27B · ThinkingGPQA Diamond87.821 / 143In Quality Score
Qwen 3.6 35B-A3B · ThinkingMMLU Pro85.221 / 86In Quality Score
Qwen3 Max · Qwen3 MaxHumanity's Last Exam · hle30.221 / 90In Quality Score
Qwen3 Max · Qwen 3.5 Max PreviewArena Elo146622 / 158In Quality Score
Qwen 3.6 Plus · ThinkingHumanity's Last Exam · hle28.822 / 90In Quality Score
Qwen 3.5 397b A17b · ThinkingHumanity's Last Exam · hle28.723 / 90In Quality Score
Qwen 3 30b A3b · Thinking (2507)AIME 20258524 / 88In Quality Score
Qwen 3 235b A22b · thinking-2507Humanity's Last Exam · hle_text15.424 / 56In Quality Score
Qwen 3.5 397b A17b · ThinkingAIME 202583.126 / 88In Quality Score
Qwen3 Max · Qwen 3.6 Max PreviewArena Elo145927 / 158In Quality Score
Qwen 3.5 122b A10b · ThinkingGPQA Diamond86.627 / 143In Quality Score
Qwen 3 235b A22b · thinking-2507MMLU Pro84.529 / 86In Quality Score
Qwen 3 235b A22b · ThinkingAIME 202581.529 / 88In Quality Score
Qwen 3.5 122b A10b · ThinkingHumanity's Last Exam · hle25.329 / 90In Quality Score
Qwen 3.6 35B-A3B · ThinkingGPQA Diamond8630 / 143In Quality Score
Qwen 3.5 122b A10b · Non-ThinkingHumanity's Last Exam · hle25.330 / 90In Quality Score
Qwen 3 14b · ThinkingLiveBench71.331 / 110In Quality Score
Qwen 3.6 Plus · ThinkingLiveBench70.832 / 110In Quality Score
Qwen 3.5 27b · ThinkingGPQA Diamond85.534 / 143In Quality Score
Qwen 3 235b A22b · ThinkingMMLU Pro8335 / 86In Quality Score
Qwen 3.5 27b · ThinkingHumanity's Last Exam · hle24.335 / 90In Quality Score
Qwen 3.6 27B · ThinkingHumanity's Last Exam · hle2436 / 90In Quality Score
Qwen3 Next 80B A3B · ThinkingMMLU Pro82.737 / 86In Quality Score
Qwen 3.5 35b A3b · ThinkingHumanity's Last Exam · hle22.437 / 90In Quality Score
Qwen 3 32b · ThinkingAIME 202572.938 / 88In Quality Score
Qwen 3.5 9B · ThinkingMMLU Pro82.539 / 86In Quality Score
Qwen 3.5 35b A3b · ThinkingGPQA Diamond84.240 / 143In Quality Score
Qwen 3 30b A3b · ThinkingAIME 202570.940 / 88In Quality Score
Qwen 3.6 35B-A3B · ThinkingHumanity's Last Exam · hle21.440 / 90In Quality Score
Qwen 3 14b · ThinkingAIME 202570.442 / 88In Quality Score
Qwen 3.5 397b A17b · ThinkingArena Elo144543 / 158In Quality Score
Qwen 3.6 Plus · ThinkingArena Elo144444 / 158In Quality Score
Qwen 3 235b A22b · Non-ThinkingLiveBench67.644 / 110In Quality Score
Qwen 3 8b · ThinkingAIME 202567.345 / 88In Quality Score
Qwen3-4B (Thinking) · ThinkingAIME 202565.646 / 88In Quality Score
Qwen 3 8b · ThinkingLiveBench67.147 / 110In Quality Score
Qwen 3 30b A3b · Thinking (2507)MMLU Pro80.948 / 86In Quality Score
Qwen 3 235b A22b · Non-ThinkingHumanity's Last Exam · hle_text5.748 / 56In Quality Score
Qwen 3.5 9B · ThinkingGPQA Diamond81.749 / 143In Quality Score
Qwen 3.6 27B · ThinkingLiveBench65.649 / 110In Quality Score
Qwen3-4B (Thinking) · ThinkingLiveBench63.650 / 110In Quality Score
Qwen 3 235b A22b · thinking-2507Humanity's Last Exam · hle15.852 / 90In Quality Score
Qwen 3.5 4B · ThinkingMMLU Pro79.154 / 86In Quality Score
Qwen3-1.7B (Thinking) · ThinkingAIME 202536.855 / 88In Quality Score
Qwen 3 235b A22b · Non-ThinkingMMLU Pro77.356 / 86In Quality Score
Qwen3 Next 80B A3B · ThinkingGPQA Diamond77.260 / 143In Quality Score
Qwen3-4B (Thinking) · ThinkingMMLU Pro7461 / 86In Quality Score
Qwen 3 235b A22b · Non-ThinkingAIME 202524.761 / 88In Quality Score
Qwen 3 235b A22b · non-thinking-2507Arena Elo142362 / 158In Quality Score
Qwen 3.5 4B · ThinkingGPQA Diamond76.262 / 143In Quality Score
Qwen 3.6 Flash · ThinkingLiveBench60.462 / 110In Quality Score
Qwen 3 14b · Non-thinkingAIME 202523.363 / 88In Quality Score
Qwen3-4B (Thinking) · Non-thinkingMMLU Pro69.664 / 86In Quality Score
Qwen 3 30b A3b · Non-thinkingAIME 202521.664 / 88In Quality Score
Qwen 3 30b A3b · Thinking (2507)Humanity's Last Exam · hle9.865 / 90In Quality Score
Qwen 3 32b · Non-thinkingLiveBench59.866 / 110In Quality Score
Qwen 3 8b · Non-thinkingAIME 202520.966 / 88In Quality Score
Qwen 3 30b A3b · Thinking (2507)GPQA Diamond73.467 / 143In Quality Score
Qwen 3 14b · Non-thinkingLiveBench59.667 / 110In Quality Score
Qwen 3 32b · Non-thinkingAIME 202520.268 / 88In Quality Score
Qwen 3 30b A3b · Non-thinkingLiveBench59.469 / 110In Quality Score
Qwen3-4B (Thinking) · Non-thinkingAIME 202519.169 / 88In Quality Score
Qwen 3.5 2B · ThinkingMMLU Pro66.570 / 86In Quality Score
Qwen 3.5 122b A10b · ThinkingArena Elo141771 / 158In Quality Score
Qwen3-0.6B (Thinking) · ThinkingAIME 202515.173 / 88In Quality Score
Qwen 3 235b A22b · vl-235b-a22b-instructArena Elo141574 / 158In Quality Score
Qwen 3 235b A22b · ThinkingGPQA Diamond71.174 / 143In Quality Score
Qwen3-1.7B (Thinking) · ThinkingMMLU Pro56.576 / 86In Quality Score
Qwen 3 8b · Non-thinkingLiveBench53.576 / 110In Quality Score
Qwen 3.5 2B · Non-ThinkingMMLU Pro55.377 / 86In Quality Score
Qwen 3 235b A22b · ThinkingHumanity's Last Exam · hle7.677 / 90In Quality Score
Qwen 3 235b A22b · thinking-2507LiveBench53.078 / 110In Quality Score
Qwen3-1.7B (Thinking) · Non-thinkingAIME 20259.880 / 88In Quality Score
Qwen3-1.7B (Thinking) · ThinkingLiveBench51.182 / 110In Quality Score
Qwen 3.5 0.8B · ThinkingMMLU Pro42.382 / 86In Quality Score
Qwen 3 32b · ThinkingGPQA Diamond68.483 / 143In Quality Score
Qwen 3.5 27b · ThinkingArena Elo140884 / 158In Quality Score
Qwen3-1.7B (Thinking) · Non-thinkingMMLU Pro40.284 / 86In Quality Score
Qwen 3.5 0.8B · Non-ThinkingMMLU Pro29.785 / 86In Quality Score
Qwen 3 235b A22b · Non-ThinkingArena Elo140386 / 158In Quality Score
Qwen3-0.6B (Thinking) · Non-thinkingAIME 20252.687 / 88In Quality Score
Qwen 3 235b A22b · thinking-2507Arena Elo140088 / 158In Quality Score
Qwen 3 235b A22b · non-thinking-2507LiveBench48.888 / 110In Quality Score
Qwen3-4B (Thinking) · Non-thinkingLiveBench48.489 / 110In Quality Score
Qwen 3 30b A3b · ThinkingGPQA Diamond65.890 / 143In Quality Score
Qwen 3 235b A22b · vl-235b-a22b-thinkingArena Elo139691 / 158In Quality Score
Qwen 3.5 35b A3b · ThinkingArena Elo139692 / 158In Quality Score
Qwen 3.5 Flash · ThinkingArena Elo139693 / 158In Quality Score
Qwen 3 14b · ThinkingGPQA Diamond6495 / 143In Quality Score
Qwen 3 235b A22b · Non-ThinkingGPQA Diamond62.996 / 143In Quality Score
Qwen 3 8b · ThinkingGPQA Diamond6298 / 143In Quality Score
Qwen 3 Coder 480B A35B Instruct · Non-thinkingArena Elo1388100 / 158In Quality Score
Qwen3-1.7B (Thinking) · Non-thinkingLiveBench35.6101 / 110In Quality Score
Qwen 3 30b A3b · Non-Thinking (2507)Arena Elo1384104 / 158In Quality Score
Qwen3-0.6B (Thinking) · ThinkingLiveBench30.3105 / 110In Quality Score
Qwen3-4B (Thinking) · ThinkingGPQA Diamond55.9106 / 143In Quality Score
Qwen 3 14b · Non-thinkingGPQA Diamond54.8107 / 143In Quality Score
Qwen 3 235b A22b · ThinkingArena Elo1375108 / 158In Quality Score
Qwen 3 30b A3b · Non-thinkingGPQA Diamond54.8108 / 143In Quality Score
Qwen 3 32b · Non-thinkingGPQA Diamond54.6109 / 143In Quality Score
Qwen3-0.6B (Thinking) · Non-thinkingLiveBench21.8109 / 110In Quality Score
Qwen 3 32b · Non-thinkingArena Elo1347123 / 158In Quality Score
Qwen3-4B (Thinking) · Non-thinkingGPQA Diamond41.7126 / 143In Quality Score
Qwen3-1.7B (Thinking) · ThinkingGPQA Diamond40.1130 / 143In Quality Score
Qwen 3 8b · Non-thinkingGPQA Diamond39.3131 / 143In Quality Score
Qwen 3 30b A3b · Non-thinkingArena Elo1327133 / 158In Quality Score
Qwen3-1.7B (Thinking) · Non-thinkingGPQA Diamond28.6137 / 143In Quality Score
Qwen3-0.6B (Thinking) · ThinkingGPQA Diamond27.9138 / 143In Quality Score
Qwen3-0.6B (Thinking) · Non-thinkingGPQA Diamond22.9142 / 143In Quality Score
Qwen 3.5 27b · ThinkingIFBench76.51 / 28Tracked evidence
Qwen3 Max · Qwen3 MaxHMMT Feb 2025982 / 44Tracked evidence
Qwen 3 235b A22b · Non-ThinkingArena-Hard96.12 / 40Tracked evidence
Qwen 3 235b A22b · thinking-2507AIME 202494.12 / 69Tracked evidence
Qwen 3.5 397b A17b · ThinkingMAXIFE88.22 / 21Tracked evidence
Qwen 3.5 397b A17b · ThinkingIFBench76.52 / 28Tracked evidence
Qwen 3 235b A22b · ThinkingArena-Hard95.63 / 40Tracked evidence
Qwen 3.5 27b · ThinkingMAXIFE883 / 21Tracked evidence
Qwen 3.5 397b A17b · ThinkingWMT24++78.93 / 6Tracked evidence
Qwen 3.5 122b A10b · ThinkingIFBench76.13 / 28Tracked evidence
Qwen 3 14b · ThinkingMulti-IF74.83 / 32Tracked evidence
Qwen 3.5 397b A17b · ThinkingBrowseComp_zh70.33 / 20Tracked evidence
Qwen 3 32b · ThinkingArena-Hard93.84 / 40Tracked evidence
Qwen 3.5 122b A10b · ThinkingMAXIFE87.94 / 21Tracked evidence
Qwen 3 32b · ThinkingMulti-IF734 / 32Tracked evidence
Qwen 3.5 122b A10b · ThinkingBrowseComp_zh69.94 / 20Tracked evidence
Qwen 3.5 397b A17b · ThinkingHMMT Feb 202594.85 / 44Tracked evidence
Qwen 3 32b · Non-thinkingArena-Hard92.85 / 40Tracked evidence
Qwen 3.6 Plus · ThinkingHMMT Feb 202687.85 / 16Tracked evidence
Qwen 3.5 397b A17b · ThinkingBrowseComp · context_manage78.65 / 15Tracked evidence
Qwen 3 30b A3b · ThinkingMulti-IF72.25 / 32Tracked evidence
Qwen 3.5 35b A3b · ThinkingBrowseComp_zh69.55 / 20Tracked evidence
Qwen3 Max · Qwen3 MaxHMMT Nov 202594.76 / 31Tracked evidence
Qwen 3.6 27B · ThinkingHMMT Feb 202593.86 / 44Tracked evidence
Qwen 3.5 35b A3b · ThinkingMAXIFE86.66 / 21Tracked evidence
Qwen3 Max · Qwen3 MaxIMO AnswerBench83.96 / 28Tracked evidence
Qwen3 Max · Qwen3 MaxWMT24++77.66 / 6Tracked evidence
Qwen 3 235b A22b · ThinkingMulti-IF71.96 / 32Tracked evidence
Qwen3 Max · Qwen3 MaxIFBench70.96 / 28Tracked evidence
Qwen 3.6 Plus · ThinkingHMMT Nov 202594.67 / 31Tracked evidence
Qwen 3.6 Plus · ThinkingIMO AnswerBench83.87 / 28Tracked evidence
Qwen 3 235b A22b · thinking-2507BFCL v371.97 / 49Tracked evidence
Qwen 3 32b · Non-thinkingMulti-IF70.77 / 32Tracked evidence
Qwen 3 235b A22b · Non-ThinkingAceBench70.57 / 7Tracked evidence
Qwen 3.5 35b A3b · ThinkingIFBench70.27 / 28Tracked evidence
Qwen 3 14b · ThinkingArena-Hard91.78 / 40Tracked evidence
Qwen 3.5 397b A17b · ThinkingGlobal PIQA89.88 / 26Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMMLU88.58 / 38Tracked evidence
Qwen 3.6 27B · ThinkingHMMT Feb 202684.38 / 16Tracked evidence
Qwen3 Max · Qwen3 MaxMAXIFE848 / 21Tracked evidence
Qwen 3 235b A22b · Non-ThinkingMulti-IF70.28 / 32Tracked evidence
Qwen 3.6 Plus · ThinkingAIME 202695.19 / 19Tracked evidence
Qwen 3 30b A3b · ThinkingArena-Hard919 / 40Tracked evidence
Qwen 3.6 35B-A3B · ThinkingHMMT Feb 202683.69 / 16Tracked evidence
Qwen 3 235b A22b · ThinkingBFCL v370.89 / 49Tracked evidence
Qwen 3 235b A22b · thinking-2507MATH 5009810 / 55Tracked evidence
Qwen 3.5 9B · ThinkingMAXIFE83.410 / 21Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMMU PRO7910 / 52Tracked evidence
Qwen 3 14b · ThinkingBFCL v370.410 / 49Tracked evidence
Qwen 3 8b · Non-thinkingMulti-IF69.210 / 32Tracked evidence
Qwen 3 30b A3b · ThinkingMATH 5009811 / 55Tracked evidence
Qwen 3.6 27B · ThinkingAIME 202694.111 / 19Tracked evidence
Qwen 3.5 397b A17b · ThinkingHMMT Nov 202592.711 / 31Tracked evidence
Qwen 3.5 27b · ThinkingHMMT Feb 20259211 / 44Tracked evidence
Qwen 3.5 122b A10b · ThinkingGlobal PIQA88.411 / 26Tracked evidence
Qwen 3 235b A22b · ThinkingAIME 202485.711 / 69Tracked evidence
Qwen 3 235b A22b · ThinkingMAXIFE83.211 / 21Tracked evidence
Qwen 3 32b · ThinkingBFCL v370.311 / 49Tracked evidence
Qwen 3.5 122b A10b · ThinkingHMMT Feb 202591.412 / 44Tracked evidence
Qwen 3 30b A3b · Non-thinkingArena-Hard8812 / 40Tracked evidence
Qwen 3.5 27b · ThinkingGlobal PIQA87.512 / 26Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMMLU86.712 / 38Tracked evidence
Qwen 3 30b A3b · ThinkingBFCL v369.112 / 49Tracked evidence
Qwen 3 8b · ThinkingMATH 50097.413 / 55Tracked evidence
Qwen3 Max · Qwen3 MaxAIME 202693.313 / 19Tracked evidence
Qwen 3.6 35B-A3B · ThinkingHMMT Feb 202590.713 / 44Tracked evidence
Qwen 3.5 35b A3b · ThinkingGlobal PIQA86.613 / 26Tracked evidence
Qwen3 Next 80B A3B · ThinkingMAXIFE79.913 / 21Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMMU PRO76.913 / 52Tracked evidence
Qwen3 VL 32B · ThinkingMMMU · mmmu_single72.213 / 22Tracked evidence
Qwen 3.5 397b A17b · ThinkingBrowseComp6913 / 51Tracked evidence
Qwen3-4B (Thinking) · ThinkingMulti-IF66.313 / 32Tracked evidence
Qwen 3.5 27b · ThinkingBrowseComp_zh62.113 / 20Tracked evidence
Qwen 3.6 27B · ThinkingHMMT Nov 202590.714 / 31Tracked evidence
Qwen 3 14b · Non-thinkingArena-Hard86.314 / 40Tracked evidence
Qwen3 Max · Qwen3 MaxGlobal PIQA8614 / 26Tracked evidence
Qwen 3.5 397b A17b · ThinkingIMO AnswerBench80.914 / 28Tracked evidence
Qwen3 VL 32B · Non-ThinkingMMMU · mmmu_single70.614 / 22Tracked evidence
Qwen 3 8b · ThinkingBFCL v368.114 / 49Tracked evidence
Qwen3 Max · Qwen3 MaxBrowseComp_zh60.914 / 20Tracked evidence
Qwen 3 235b A22b · thinking-2507SciCode42.914 / 24Tracked evidence
Qwen 3 32b · ThinkingMATH 50097.215 / 55Tracked evidence
Qwen 3.6 35B-A3B · ThinkingAIME 202692.715 / 19Tracked evidence
Qwen 3.5 122b A10b · ThinkingHMMT Nov 202590.315 / 31Tracked evidence
Qwen 3 8b · ThinkingArena-Hard85.815 / 40Tracked evidence
Qwen 3 235b A22b · ThinkingGlobal PIQA85.715 / 26Tracked evidence
Qwen 3 32b · ThinkingAIME 202481.415 / 69Tracked evidence
Qwen 3.6 27B · ThinkingIMO AnswerBench80.815 / 28Tracked evidence
Qwen 3.5 4B · ThinkingMAXIFE7815 / 21Tracked evidence
Qwen 3 235b A22b · Non-ThinkingBFCL v36815 / 49Tracked evidence
Qwen3 VL 8B · ThinkingMMMU · mmmu_single65.315 / 22Tracked evidence
Qwen3-4B (Thinking) · ThinkingMATH 5009716 / 55Tracked evidence
Qwen 3.5 27b · ThinkingAIME 202692.616 / 19Tracked evidence
Qwen 3.5 27b · ThinkingMMMLU85.916 / 38Tracked evidence
Qwen 3.6 35B-A3B · ThinkingIMO AnswerBench78.916 / 28Tracked evidence
Qwen 3 30b A3b · Thinking (2507)MAXIFE77.416 / 21Tracked evidence
Qwen3 VL 8B · Non-ThinkingMMMU · mmmu_single64.616 / 22Tracked evidence
Qwen 3.5 9B · ThinkingIFBench64.516 / 28Tracked evidence
Qwen 3.5 397b A17b · ThinkingAIME 202691.317 / 19Tracked evidence
Qwen 3.5 35b A3b · ThinkingHMMT Feb 20258917 / 44Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMMLU85.217 / 38Tracked evidence
Qwen 3 30b A3b · ThinkingAIME 202480.417 / 69Tracked evidence
Qwen 3.6 27B · ThinkingMMMU PRO75.817 / 52Tracked evidence
Qwen 3.5 122b A10b · ThinkingBrowseComp63.817 / 51Tracked evidence
Qwen3 Next 80B A3B · ThinkingIFBench61.517 / 28Tracked evidence
Qwen 3 14b · ThinkingMATH 50096.818 / 55Tracked evidence
Qwen 3.5 27b · ThinkingHMMT Nov 202589.818 / 31Tracked evidence
Qwen3 Next 80B A3B · ThinkingGlobal PIQA83.518 / 26Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMMMU PRO75.318 / 52Tracked evidence
Qwen3-4B (Thinking) · ThinkingMAXIFE72.118 / 21Tracked evidence
Qwen 3.5 4B · ThinkingIFBench59.218 / 28Tracked evidence
Qwen 3 235b A22b · ThinkingHMMT Nov 202589.519 / 31Tracked evidence
Qwen 3.5 9B · ThinkingGlobal PIQA83.219 / 26Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingMulti-IF61.319 / 32Tracked evidence
Qwen 3.5 27b · ThinkingBrowseComp6119 / 51Tracked evidence
Qwen 3.5 2B · ThinkingMAXIFE60.619 / 21Tracked evidence
Qwen 3 235b A22b · ThinkingMATH 50096.220 / 55Tracked evidence
Qwen 3.5 35b A3b · ThinkingHMMT Nov 202589.220 / 31Tracked evidence
Qwen3 Max · Qwen3 MaxMMMLU84.420 / 38Tracked evidence
Qwen 3.5 9B · ThinkingHMMT Feb 202583.220 / 44Tracked evidence
Qwen 3 30b A3b · Thinking (2507)Global PIQA80.220 / 26Tracked evidence
Qwen 3 14b · ThinkingAIME 202479.320 / 69Tracked evidence
Qwen3-4B (Thinking) · ThinkingBFCL v365.920 / 49Tracked evidence
Qwen 3.5 35b A3b · ThinkingBrowseComp6120 / 51Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingMAXIFE50.720 / 21Tracked evidence
Qwen 3.6 35B-A3B · ThinkingHMMT Nov 202589.121 / 31Tracked evidence
Qwen 3 235b A22b · Non-ThinkingMMLU8721 / 33Tracked evidence
Qwen 3 235b A22b · ThinkingMMMLU83.421 / 38Tracked evidence
Qwen 3 8b · Non-thinkingArena-Hard79.621 / 40Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMMU PRO75.121 / 52Tracked evidence
Qwen 3.5 0.8B · ThinkingMAXIFE39.221 / 21Tracked evidence
Qwen 3.5 4B · ThinkingGlobal PIQA78.922 / 26Tracked evidence
Qwen 3 8b · ThinkingAIME 20247622 / 69Tracked evidence
Qwen 3.5 27b · ThinkingMMMU PRO7522 / 52Tracked evidence
Qwen 3.5 9B · ThinkingHMMT Nov 202582.923 / 31Tracked evidence
Qwen3 Next 80B A3B · ThinkingMMMLU81.323 / 38Tracked evidence
Qwen3-4B (Thinking) · ThinkingArena-Hard76.623 / 40Tracked evidence
Qwen3-4B (Thinking) · ThinkingGlobal PIQA73.523 / 26Tracked evidence
Qwen 3 235b A22b · ThinkingIFBench51.723 / 28Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingMulti-IF51.223 / 32Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingMATH 50093.424 / 55Tracked evidence
Qwen 3.5 9B · ThinkingMMMLU81.224 / 38Tracked evidence
Qwen 3.5 4B · ThinkingHMMT Feb 20257424 / 44Tracked evidence
Qwen 3.5 2B · ThinkingGlobal PIQA69.324 / 26Tracked evidence
Qwen3 Max · Qwen3 MaxBrowseComp53.924 / 51Tracked evidence
Qwen 3 30b A3b · Thinking (2507)IFBench51.524 / 28Tracked evidence
Qwen3 Next 80B A3B · ThinkingHMMT Nov 202581.225 / 31Tracked evidence
Qwen 3 30b A3b · Thinking (2507)MMMLU78.425 / 38Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingGlobal PIQA63.125 / 26Tracked evidence
Qwen3-4B (Thinking) · ThinkingIFBench50.425 / 28Tracked evidence
Qwen 3 235b A22b · Non-ThinkingMATH 50091.226 / 55Tracked evidence
Qwen3-4B (Thinking) · ThinkingAIME 202473.826 / 69Tracked evidence
Qwen3 Next 80B A3B · ThinkingHMMT Feb 202573.726 / 44Tracked evidence
Qwen 3 32b · Non-thinkingBFCL v36326 / 49Tracked evidence
Qwen 3.5 0.8B · ThinkingGlobal PIQA59.426 / 26Tracked evidence
Qwen 3.5 2B · ThinkingIFBench41.326 / 28Tracked evidence
Qwen 3.5 4B · ThinkingHMMT Nov 202576.827 / 31Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingMulti-IF44.727 / 32Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingIFBench26.727 / 28Tracked evidence
Qwen 3.5 4B · ThinkingMMMLU76.128 / 38Tracked evidence
Qwen 3 30b A3b · Thinking (2507)HMMT Nov 202573.828 / 31Tracked evidence
Qwen 3.5 9B · ThinkingMMMU PRO70.128 / 52Tracked evidence
Qwen 3 30b A3b · Thinking (2507)HMMT Feb 202563.128 / 44Tracked evidence
Qwen 3.5 0.8B · ThinkingIFBench2128 / 28Tracked evidence
Qwen3-4B (Thinking) · ThinkingMMMLU70.829 / 38Tracked evidence
Qwen3-4B (Thinking) · ThinkingHMMT Nov 202569.629 / 31Tracked evidence
Qwen3 VL 235B A22B · ThinkingMMMU PRO69.329 / 52Tracked evidence
Qwen 3 235b A22b · ThinkingHMMT Feb 202562.529 / 44Tracked evidence
Qwen 3 14b · Non-thinkingBFCL v361.529 / 49Tracked evidence
Qwen3-0.6B (Thinking) · ThinkingMulti-IF36.129 / 32Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingArena-Hard66.230 / 40Tracked evidence
Qwen 3 8b · Non-thinkingBFCL v360.230 / 49Tracked evidence
Qwen3-4B (Thinking) · ThinkingHMMT Feb 202557.530 / 44Tracked evidence
Qwen3-0.6B (Thinking) · Non-thinkingMulti-IF33.330 / 32Tracked evidence
Qwen 3.5 2B · ThinkingHMMT Nov 202519.630 / 31Tracked evidence
Qwen 3 235b A22b · Non-ThinkingSimpleQA13.231 / 40Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingHMMT Nov 20258.931 / 31Tracked evidence
Qwen 3 14b · Non-thinkingMATH 5009032 / 55Tracked evidence
Qwen 3.5 4B · ThinkingMMMU PRO66.332 / 52Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingMMMLU64.932 / 38Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingArena-Hard43.132 / 40Tracked evidence
Qwen 3 30b A3b · Non-thinkingMATH 50089.833 / 55Tracked evidence
Qwen 3.5 2B · ThinkingMMMLU63.133 / 38Tracked evidence
Qwen 3 30b A3b · Non-thinkingBFCL v358.633 / 49Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingArena-Hard36.933 / 40Tracked evidence
Qwen 3 235b A22b · ThinkingSimpleQA1133 / 40Tracked evidence
Qwen 3 32b · Non-thinkingMATH 50088.634 / 55Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingMMMLU5734 / 38Tracked evidence
Qwen 3 8b · Non-thinkingMATH 50087.435 / 55Tracked evidence
Qwen3 VL 30B A3B · ThinkingMMMU PRO6335 / 52Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingBFCL v357.635 / 49Tracked evidence
Qwen 3.5 2B · Non-ThinkingMMMLU56.935 / 38Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingAIME 202448.336 / 69Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingMMMLU46.736 / 38Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingMATH 50084.837 / 55Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingBFCL v356.637 / 49Tracked evidence
Qwen 3.5 0.8B · ThinkingMMMLU44.337 / 38Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMMMLU34.138 / 38Tracked evidence
Qwen 3.5 2B · ThinkingHMMT Feb 202522.939 / 44Tracked evidence
Qwen3-0.6B (Thinking) · ThinkingArena-Hard8.539 / 40Tracked evidence
Qwen3 VL 4B · ThinkingMMMU PRO5740 / 52Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingBFCL v352.240 / 49Tracked evidence
Qwen3-0.6B (Thinking) · Non-thinkingArena-Hard6.540 / 40Tracked evidence
Qwen 3 235b A22b · Non-ThinkingAIME 202440.142 / 69Tracked evidence
Qwen 3.5 2B · ThinkingMMMU PRO50.343 / 52Tracked evidence
Qwen 3 235b A22b · Non-ThinkingHMMT Feb 202511.943 / 44Tracked evidence
Qwen3-1.7B (Thinking) · ThinkingHMMT Feb 202510.244 / 44Tracked evidence
Qwen3-0.6B (Thinking) · ThinkingBFCL v346.445 / 49Tracked evidence
Qwen 3 30b A3b · Non-thinkingAIME 202432.845 / 69Tracked evidence
Qwen 3.5 2B · Non-ThinkingMMMU PRO47.746 / 52Tracked evidence
Qwen 3 235b A22b · thinking-2507BrowseComp4.646 / 51Tracked evidence
Qwen3-0.6B (Thinking) · Non-thinkingBFCL v344.147 / 49Tracked evidence
Qwen 3 14b · Non-thinkingAIME 202431.747 / 69Tracked evidence
Qwen3-0.6B (Thinking) · ThinkingMATH 50077.648 / 55Tracked evidence
Qwen 3 32b · Non-thinkingAIME 20243148 / 69Tracked evidence
Qwen3 VL 2B · ThinkingMMMU PRO42.549 / 52Tracked evidence
Qwen 3 30b A3b · Thinking (2507)BrowseComp2.349 / 51Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingMATH 5007350 / 55Tracked evidence
Qwen 3 8b · Non-thinkingAIME 202429.150 / 69Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMMMU PRO31.451 / 52Tracked evidence
Qwen 3.5 0.8B · ThinkingMMMU PRO31.252 / 52Tracked evidence
Qwen3-0.6B (Thinking) · Non-thinkingMATH 50055.253 / 55Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingAIME 20242553 / 69Tracked evidence
Qwen3-1.7B (Thinking) · Non-thinkingAIME 202413.460 / 69Tracked evidence
Qwen3-0.6B (Thinking) · ThinkingAIME 202410.761 / 69Tracked evidence
Qwen3-0.6B (Thinking) · Non-thinkingAIME 20243.468 / 69Tracked evidence

Coding

Model / VariantBenchmarkScoreRankScoring
Qwen 3.5 122b A10b · Non-ThinkingLiveCodeBench · v578.93 / 5In Quality Score
Qwen 3 235b A22b · thinking-2507LiveCodeBench · 2024_07_2025_0178.24 / 8In Quality Score
Qwen 3 235b A22b · ThinkingLiveCodeBench · 2024_08_2025_0566.54 / 17In Quality Score
Qwen3 Max · Qwen3 MaxLiveCodeBench · v685.95 / 40In Quality Score
Qwen 3 235b A22b · Non-ThinkingSWE-bench Verified · single_agentless39.45 / 7In Quality Score
Qwen 3.6 27B · ThinkingLiveCodeBench · v683.98 / 40In Quality Score
Qwen 3.5 397b A17b · ThinkingLiveCodeBench · v683.69 / 40In Quality Score
Qwen 3 235b A22b · Non-ThinkingSWE-bench Verified · multilingual_single20.910 / 10In Quality Score
Qwen 3 235b A22b · ThinkingLiveCodeBench70.711 / 69In Quality Score
Qwen 3.5 27b · ThinkingLiveCodeBench · v680.713 / 40In Quality Score
Qwen 3 235b A22b · Non-ThinkingAider (Polyglot)61.813 / 45In Quality Score
Qwen 3 235b A22b · ThinkingAider (Polyglot)61.814 / 45In Quality Score
Qwen 3.6 35B-A3B · ThinkingLiveCodeBench · v680.415 / 40In Quality Score
Qwen 3.5 122b A10b · ThinkingLiveCodeBench · v678.916 / 40In Quality Score
Qwen 3 32b · ThinkingLiveCodeBench65.716 / 69In Quality Score
Qwen 3 Coder 480B A35B Instruct · Non-thinkingGSO (Global Software Optimization) · opt_at_13.917 / 24In Quality Score
Qwen 3 235b A22b · ThinkingLiveCodeBench · v675.118 / 40In Quality Score
Qwen 3 14b · ThinkingLiveCodeBench63.518 / 69In Quality Score
Qwen 3.6 27B · ThinkingSWE-bench Verified77.219 / 68In Quality Score
Qwen 3.5 35b A3b · ThinkingLiveCodeBench · v674.620 / 40In Quality Score
Qwen 3 30b A3b · ThinkingLiveCodeBench62.621 / 69In Quality Score
Qwen 3.5 397b A17b · ThinkingSWE-bench Verified76.422 / 68In Quality Score
Qwen3 Next 80B A3B · ThinkingLiveCodeBench · v668.722 / 40In Quality Score
Qwen 3 30b A3b · Thinking (2507)LiveCodeBench · v66623 / 40In Quality Score
Qwen 3 8b · ThinkingLiveCodeBench57.523 / 69In Quality Score
Qwen 3.5 9B · ThinkingLiveCodeBench · v665.624 / 40In Quality Score
Qwen3 Max · Qwen3 MaxSWE-bench Verified75.325 / 68In Quality Score
Qwen 3.5 27b · ThinkingSWE-bench Verified7526 / 68In Quality Score
Qwen 3 32b · ThinkingAider (Polyglot)50.227 / 45In Quality Score
Qwen 3.5 4B · ThinkingLiveCodeBench · v655.829 / 40In Quality Score
Qwen3-4B (Thinking) · ThinkingLiveCodeBench54.229 / 69In Quality Score
Qwen 3.6 35B-A3B · ThinkingSWE-bench Verified73.432 / 68In Quality Score
Qwen 3 235b A22b · Non-ThinkingLiveCodeBench · v63736 / 40In Quality Score
Qwen 3 235b A22b · Non-ThinkingLiveCodeBench35.340 / 69In Quality Score
Qwen 3.5 122b A10b · ThinkingSWE-bench Verified7241 / 68In Quality Score
Qwen3-1.7B (Thinking) · ThinkingLiveCodeBench33.242 / 69In Quality Score
Qwen 3.5 35b A3b · ThinkingSWE-bench Verified69.244 / 68In Quality Score
Qwen 3 32b · Non-thinkingLiveCodeBench31.344 / 69In Quality Score
Qwen 3 30b A3b · Non-thinkingLiveCodeBench29.847 / 69In Quality Score
Qwen 3 14b · Non-thinkingLiveCodeBench2950 / 69In Quality Score
Qwen 3 8b · Non-thinkingLiveCodeBench22.856 / 69In Quality Score
Qwen3-4B (Thinking) · Non-thinkingLiveCodeBench21.358 / 69In Quality Score
Qwen3-0.6B (Thinking) · ThinkingLiveCodeBench12.363 / 69In Quality Score
Qwen3-1.7B (Thinking) · Non-thinkingLiveCodeBench11.664 / 69In Quality Score
Qwen 3 235b A22b · Non-ThinkingSWE-bench Verified34.465 / 68In Quality Score
Qwen 3 235b A22b · ThinkingSWE-bench Verified34.466 / 68In Quality Score
Qwen 3 30b A3b · Thinking (2507)SWE-bench Verified2268 / 68In Quality Score
Qwen3-0.6B (Thinking) · Non-thinkingLiveCodeBench3.668 / 69In Quality Score
Qwen 3.5 397b A17b · ThinkingSecCodeBench68.33 / 6Tracked evidence
Qwen 3.5 27b · ThinkingOJ-Bench40.14 / 19Tracked evidence
Qwen 3.6 Plus · ThinkingNL2Repo37.94 / 9Tracked evidence
Qwen 3.5 122b A10b · ThinkingOJ-Bench39.55 / 19Tracked evidence
Qwen 3.6 27B · ThinkingNL2Repo36.25 / 9Tracked evidence
Qwen 3 235b A22b · ThinkingCodeforces21466 / 47Tracked evidence
Qwen3 Max · Qwen3 MaxSecCodeBench57.56 / 6Tracked evidence
Qwen 3.5 122b A10b · ThinkingCodeforces21007 / 47Tracked evidence
Qwen 3.5 35b A3b · ThinkingOJ-Bench367 / 19Tracked evidence
Qwen 3 235b A22b · ThinkingOJ-Bench32.78 / 19Tracked evidence
Qwen3 Next 80B A3B · ThinkingOJ-Bench29.79 / 19Tracked evidence
Qwen 3.6 35B-A3B · ThinkingNL2Repo29.49 / 9Tracked evidence
Qwen 3.5 35b A3b · ThinkingCodeforces202810 / 47Tracked evidence
Qwen 3.6 27B · ThinkingSWE-bench Multilingual71.310 / 18Tracked evidence
Qwen 3.5 9B · ThinkingOJ-Bench29.210 / 19Tracked evidence
Qwen 3 30b A3b · Thinking (2507)OJ-Bench25.112 / 19Tracked evidence
Qwen 3 32b · ThinkingCodeforces197713 / 47Tracked evidence
Qwen 3.5 4B · ThinkingOJ-Bench24.113 / 19Tracked evidence
Qwen 3 30b A3b · ThinkingCodeforces197414 / 47Tracked evidence
Qwen 3.5 397b A17b · ThinkingSWE-bench Multilingual69.314 / 18Tracked evidence
Qwen 3.5 27b · ThinkingCodeforces189915 / 47Tracked evidence
Qwen 3.6 35B-A3B · ThinkingSWE-bench Multilingual67.215 / 18Tracked evidence
Qwen3 Max · Qwen3 MaxSWE-bench Multilingual66.716 / 18Tracked evidence
Qwen 3 8b · ThinkingCodeforces178518 / 47Tracked evidence
Qwen 3 14b · ThinkingCodeforces176619 / 47Tracked evidence
Qwen 3 235b A22b · Non-ThinkingOJ-Bench11.319 / 19Tracked evidence
Qwen3-4B (Thinking) · ThinkingCodeforces167122 / 47Tracked evidence
Qwen 3 235b A22b · Non-ThinkingCodeforces138724 / 47Tracked evidence
Qwen 3 32b · Non-thinkingCodeforces135325 / 47Tracked evidence
Qwen 3 30b A3b · Non-thinkingCodeforces126728 / 47Tracked evidence
Qwen 3 14b · Non-thinkingCodeforces120029 / 47Tracked evidence
Qwen 3 8b · Non-thinkingCodeforces111032 / 47Tracked evidence
Qwen3-4B (Thinking) · Non-thinkingCodeforces84240 / 47Tracked evidence

Agentic

Model / VariantBenchmarkScoreRankScoring
Qwen 3.6 Plus · ThinkingMCP Atlas · public_set74.11 / 13In Quality Score
Qwen 3.5 397b A17b · Thinkingτ²-bench · average86.77 / 30In Quality Score
Qwen3 Max · Qwen3 Maxτ²-bench · average84.610 / 30In Quality Score
Qwen 3.5 35b A3b · Thinkingτ²-bench · average81.211 / 30In Quality Score
Qwen 3.5 4B · Thinkingτ²-bench · average79.914 / 30In Quality Score
Qwen 3.6 35B-A3B · ThinkingMCP Atlas62.814 / 33In Quality Score
Qwen 3 235b A22b · thinking-2507τ²-bench · airline5815 / 29In Quality Score
Qwen 3.5 122b A10b · Thinkingτ²-bench · average79.516 / 30In Quality Score
Qwen 3.5 9B · Thinkingτ²-bench · average79.117 / 30In Quality Score
Qwen 3.5 27b · Thinkingτ²-bench · average7918 / 30In Quality Score
Qwen 3 235b A22b · thinking-2507τ²-bench · retail71.920 / 34In Quality Score
Qwen 3 235b A22b · Thinkingτ²-bench · average58.523 / 30In Quality Score
Qwen3 Next 80B A3B · Thinkingτ²-bench · average57.424 / 30In Quality Score
Qwen 3.5 2B · Thinkingτ²-bench · average48.825 / 30In Quality Score
Qwen3-4B (Thinking) · Thinkingτ²-bench · average43.226 / 30In Quality Score
Qwen 3 235b A22b · Thinkingτ²-bench · airline34.727 / 29In Quality Score
Qwen 3 235b A22b · Non-Thinkingτ²-bench · telecom22.127 / 28In Quality Score
Qwen 3 30b A3b · Thinking (2507)τ²-bench · average41.928 / 30In Quality Score
Qwen 3 235b A22b · Non-Thinkingτ²-bench · airline26.529 / 29In Quality Score
Qwen 3.5 0.8B · Thinkingτ²-bench · average11.630 / 30In Quality Score
Qwen 3 235b A22b · Thinkingτ²-bench · retail58.632 / 34In Quality Score
Qwen 3 235b A22b · Non-Thinkingτ²-bench · retail5734 / 34In Quality Score
Qwen 3.5 397b A17b · Thinkingτ³-Bench · retail84.41 / 6Tracked evidence
Qwen 3.5 397b A17b · Thinkingτ³-Bench · telecom97.82 / 6Tracked evidence
Qwen 3.5 397b A17b · Thinkingτ³-Bench · airline812 / 6Tracked evidence
Qwen 3.5 397b A17b · ThinkingBFCL v472.92 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingDeepPlanning34.32 / 16Tracked evidence
Qwen 3.6 Plus · Thinkingτ³-Bench70.73 / 10Tracked evidence
Qwen 3.5 397b A17b · ThinkingWideSearch744 / 13Tracked evidence
Qwen 3.5 122b A10b · ThinkingBFCL v472.24 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingMCPMark46.14 / 8Tracked evidence
Qwen3 Max · Qwen3 MaxDeepPlanning28.74 / 16Tracked evidence
Qwen 3.5 27b · ThinkingBFCL v468.55 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingDeepPlanning25.95 / 16Tracked evidence
Qwen 3.5 27b · ThinkingWideSearch61.16 / 13Tracked evidence
Qwen 3.5 27b · ThinkingSeal-047.26 / 16Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMCPMark376 / 8Tracked evidence
Qwen 3.5 122b A10b · ThinkingDeepPlanning24.16 / 16Tracked evidence
Qwen 3.5 397b A17b · Thinkingτ³-Bench · banking9.86 / 6Tracked evidence
Qwen3 Max · Qwen3 MaxBFCL v467.77 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingWideSearch60.57 / 13Tracked evidence
Qwen 3.5 397b A17b · ThinkingSeal-046.97 / 16Tracked evidence
Qwen3 Max · Qwen3 MaxMCPMark33.57 / 8Tracked evidence
Qwen 3.5 35b A3b · ThinkingBFCL v467.38 / 18Tracked evidence
Qwen 3.6 35B-A3B · Thinkingτ³-Bench67.28 / 10Tracked evidence
Qwen 3.6 35B-A3B · ThinkingWideSearch60.18 / 13Tracked evidence
Qwen3 Max · Qwen3 MaxSeal-046.98 / 16Tracked evidence
Qwen 3.5 35b A3b · ThinkingDeepPlanning22.88 / 16Tracked evidence
Qwen 3.5 9B · ThinkingBFCL v466.19 / 18Tracked evidence
Qwen3 Max · Qwen3 MaxWideSearch57.99 / 13Tracked evidence
Qwen 3.5 27b · ThinkingDeepPlanning22.69 / 16Tracked evidence
Qwen 3.5 35b A3b · ThinkingWideSearch57.110 / 13Tracked evidence
Qwen 3.5 9B · ThinkingDeepPlanning1810 / 16Tracked evidence
Qwen 3 235b A22b · ThinkingBFCL v454.812 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingSeal-044.112 / 16Tracked evidence
Qwen 3.5 4B · ThinkingDeepPlanning17.612 / 16Tracked evidence
Qwen 3.5 4B · ThinkingBFCL v450.313 / 18Tracked evidence
Qwen 3.5 35b A3b · ThinkingSeal-041.413 / 16Tracked evidence
Qwen 3 235b A22b · ThinkingDeepPlanning17.113 / 16Tracked evidence
Qwen3 Next 80B A3B · ThinkingBFCL v449.714 / 18Tracked evidence
Qwen 3.5 2B · ThinkingBFCL v443.615 / 18Tracked evidence
Qwen 3 30b A3b · Thinking (2507)DeepPlanning4.915 / 16Tracked evidence
Qwen 3.5 397b A17b · ThinkingOSWorld · verified62.216 / 27Tracked evidence
Qwen 3 30b A3b · Thinking (2507)BFCL v442.416 / 18Tracked evidence
Qwen3 Next 80B A3B · ThinkingDeepPlanning0.416 / 16Tracked evidence
Qwen3-4B (Thinking) · ThinkingBFCL v439.917 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingOSWorld · verified5818 / 27Tracked evidence
Qwen 3.6 Plus · ThinkingToolathlon39.818 / 31Tracked evidence
Qwen 3.5 0.8B · ThinkingBFCL v425.318 / 18Tracked evidence
Qwen 3.5 27b · ThinkingOSWorld · verified56.219 / 27Tracked evidence
Qwen 3.5 35b A3b · ThinkingOSWorld · verified54.520 / 27Tracked evidence
Qwen 3.5 397b A17b · ThinkingToolathlon38.320 / 31Tracked evidence
Qwen 3.5 9B · ThinkingOSWorld · verified41.823 / 27Tracked evidence
Qwen3 VL 235B A22B · ThinkingOSWorld · verified38.125 / 27Tracked evidence
Qwen 3.5 4B · ThinkingOSWorld · verified35.626 / 27Tracked evidence
Qwen3 VL 30B A3B · ThinkingOSWorld · verified30.627 / 27Tracked evidence
Qwen 3.6 35B-A3B · ThinkingToolathlon26.927 / 31Tracked evidence
Qwen3 Max · Qwen3 MaxToolathlon18.829 / 31Tracked evidence

Multimodal

Model / VariantBenchmarkScoreRankScoring
Qwen 3.5 27b · ThinkingCountBench97.81 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingVLMs Are Blind971 / 18Tracked evidence
Qwen 3.6 27B · ThinkingRefCOCO · avg92.51 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingMathVista · mini90.31 / 36Tracked evidence
Qwen 3.5 397b A17b · ThinkingMathVision88.61 / 17Tracked evidence
Qwen 3.5 27b · ThinkingDynaMath87.71 / 23Tracked evidence
Qwen 3.5 122b A10b · ThinkingMLVU · mavg87.31 / 22Tracked evidence
Qwen 3.6 35B-A3B · ThinkingRealWorldQA85.31 / 24Tracked evidence
Qwen 3.6 27B · ThinkingEmbSpatialBench84.61 / 24Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMStar83.81 / 33Tracked evidence
Qwen 3.5 27b · ThinkingLingoQA821 / 16Tracked evidence
Qwen3 VL 32B · ThinkingMathVerse · mini78.21 / 10Tracked evidence
Qwen3 VL 32B · ThinkingHallusionBench76.61 / 33Tracked evidence
Qwen 3.5 397b A17b · ThinkingRefSpatialBench73.61 / 21Tracked evidence
Qwen3 VL 32B · Non-ThinkingMathVision · mini60.51 / 10Tracked evidence
Qwen 3.5 397b A17b · ThinkingBabyVision52.31 / 22Tracked evidence
Qwen 3.6 35B-A3B · ThinkingODinW · 1350.81 / 13Tracked evidence
Qwen 3.5 397b A17b · ThinkingZEROBench · sub411 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingCountBench97.82 / 23Tracked evidence
Qwen 3.6 27B · ThinkingVLMs Are Blind972 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingV*95.82 / 23Tracked evidence
Qwen 3.5 397b A17b · ThinkingAI2D · test93.92 / 33Tracked evidence
Qwen 3.5 397b A17b · ThinkingRefCOCO · avg92.32 / 18Tracked evidence
Qwen 3.6 27B · ThinkingVideoMME · with_sub87.72 / 22Tracked evidence
Qwen 3.5 397b A17b · ThinkingMLVU · mavg86.72 / 22Tracked evidence
Qwen 3.5 122b A10b · ThinkingRealWorldQA85.12 / 24Tracked evidence
Qwen 3.5 27b · ThinkingEmbSpatialBench84.52 / 24Tracked evidence
Qwen3 VL 32B · Non-ThinkingChartQA · test842 / 10Tracked evidence
Qwen 3.5 122b A10b · ThinkingSLAKE81.62 / 22Tracked evidence
Qwen 3.5 397b A17b · ThinkingLingoQA81.62 / 16Tracked evidence
Qwen 3.5 397b A17b · ThinkingMVBench77.62 / 18Tracked evidence
Qwen3 VL 32B · Non-ThinkingHallusionBench74.92 / 33Tracked evidence
Qwen3 VL 8B · ThinkingMathVerse · mini73.32 / 10Tracked evidence
Qwen 3.6 27B · ThinkingRefSpatialBench702 / 21Tracked evidence
Qwen3 VL 32B · ThinkingMathVision · mini58.62 / 10Tracked evidence
Qwen 3.5 397b A17b · ThinkingODinW · 13472 / 13Tracked evidence
Qwen 3.6 27B · ThinkingCountBench97.83 / 23Tracked evidence
Qwen 3.5 27b · ThinkingVLMs Are Blind96.93 / 18Tracked evidence
Qwen 3.6 27B · ThinkingV*94.73 / 23Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMBench · en_dev_v1_193.73 / 24Tracked evidence
Qwen 3.5 122b A10b · ThinkingAI2D · test93.33 / 33Tracked evidence
Qwen 3.6 35B-A3B · ThinkingRefCOCO · avg923 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingVideoMME · with_sub87.53 / 22Tracked evidence
Qwen 3.6 27B · ThinkingMLVU · mavg86.63 / 22Tracked evidence
Qwen 3.5 397b A17b · ThinkingDynaMath86.33 / 23Tracked evidence
Qwen 3.5 397b A17b · ThinkingEmbSpatialBench84.53 / 24Tracked evidence
Qwen 3.5 35b A3b · ThinkingRealWorldQA84.13 / 24Tracked evidence
Qwen 3.5 122b A10b · ThinkingVideoMME · without_sub83.93 / 21Tracked evidence
Qwen3 VL 8B · Non-ThinkingChartQA · test83.23 / 10Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMStar82.93 / 33Tracked evidence
Qwen 3.5 122b A10b · ThinkingLingoQA80.83 / 16Tracked evidence
Qwen3 VL 235B A22B · ThinkingVideoMME793 / 4Tracked evidence
Qwen 3.5 122b A10b · ThinkingMVBench76.63 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingLVBench75.53 / 18Tracked evidence
Qwen3 VL 8B · Non-ThinkingHallusionBench74.13 / 33Tracked evidence
Qwen3 VL 235B A22B · ThinkingRefSpatialBench69.93 / 21Tracked evidence
Qwen 3.5 397b A17b · ThinkingERQA67.53 / 27Tracked evidence
Qwen3 VL 32B · Non-ThinkingMathVerse · mini64.23 / 10Tracked evidence
Qwen3 VL 8B · ThinkingMathVision · mini50.73 / 10Tracked evidence
Qwen 3.5 27b · ThinkingBabyVision44.63 / 22Tracked evidence
Qwen 3.5 122b A10b · ThinkingZEROBench · sub36.23 / 23Tracked evidence
Qwen 3.5 397b A17b · ThinkingZEROBench123 / 27Tracked evidence
Qwen 3.5 122b A10b · ThinkingVLMs Are Blind96.74 / 18Tracked evidence
Qwen 3.5 27b · ThinkingV*93.74 / 23Tracked evidence
Qwen 3.5 27b · ThinkingAI2D · test92.94 / 33Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMBench · en_dev_v1_192.84 / 24Tracked evidence
Qwen 3.5 122b A10b · ThinkingRefCOCO · avg91.34 / 18Tracked evidence
Qwen 3.5 27b · ThinkingMathVista · mini87.84 / 36Tracked evidence
Qwen 3.5 122b A10b · ThinkingMathVision86.24 / 17Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMLVU · mavg86.24 / 22Tracked evidence
Qwen 3.5 122b A10b · ThinkingDynaMath85.94 / 23Tracked evidence
Qwen 3.6 35B-A3B · ThinkingEmbSpatialBench84.34 / 24Tracked evidence
Qwen 3.6 27B · ThinkingRealWorldQA84.14 / 24Tracked evidence
Qwen 3.5 397b A17b · ThinkingVideoMME · without_sub83.74 / 21Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMStar81.94 / 33Tracked evidence
Qwen 3.5 9B · ThinkingLingoQA80.44 / 16Tracked evidence
Qwen 3.5 27b · ThinkingSLAKE804 / 22Tracked evidence
Qwen 3.6 27B · ThinkingMVBench75.54 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingLVBench74.44 / 18Tracked evidence
Qwen3 VL 8B · ThinkingHallusionBench734 / 33Tracked evidence
Qwen 3.5 122b A10b · ThinkingScreenSpot-Pro70.44 / 24Tracked evidence
Qwen 3.5 122b A10b · ThinkingRefSpatialBench69.34 / 21Tracked evidence
Qwen 3.5 122b A10b · ThinkingODinW · 1344.54 / 13Tracked evidence
Qwen 3.5 122b A10b · ThinkingBabyVision40.24 / 22Tracked evidence
Qwen 3.5 27b · ThinkingZEROBench · sub36.24 / 23Tracked evidence
Qwen 3.5 397b A17b · ThinkingCountBench97.25 / 23Tracked evidence
Qwen 3.5 9B · ThinkingVLMs Are Blind93.75 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingV*93.25 / 23Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMMBench · en_dev_v1_192.85 / 24Tracked evidence
Qwen 3.6 35B-A3B · ThinkingAI2D · test92.75 / 33Tracked evidence
Qwen3 VL 235B A22B · ThinkingRefCOCO · avg91.15 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingMathVista · mini87.45 / 36Tracked evidence
Qwen 3.5 122b A10b · ThinkingVideoMME · with_sub87.35 / 22Tracked evidence
Qwen 3.5 27b · ThinkingMathVision865 / 17Tracked evidence
Qwen 3.5 27b · ThinkingMLVU · mavg85.95 / 22Tracked evidence
Qwen 3.6 27B · ThinkingDynaMath85.65 / 23Tracked evidence
Qwen3 VL 235B A22B · ThinkingEmbSpatialBench84.35 / 24Tracked evidence
Qwen 3.5 397b A17b · ThinkingRealWorldQA83.95 / 24Tracked evidence
Qwen 3.6 27B · ThinkingMMStar81.45 / 33Tracked evidence
Qwen 3.5 397b A17b · ThinkingSLAKE79.95 / 22Tracked evidence
Qwen 3.5 35b A3b · ThinkingLingoQA79.25 / 16Tracked evidence
Qwen3 VL 32B · ThinkingChartQA · test79.15 / 10Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMVU75.45 / 20Tracked evidence
Qwen3 VL 235B A22B · ThinkingMVBench75.25 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingHallusionBench71.45 / 33Tracked evidence
Qwen 3.5 27b · ThinkingScreenSpot-Pro70.35 / 24Tracked evidence
Qwen 3.5 27b · ThinkingRefSpatialBench67.75 / 21Tracked evidence
Qwen 3.5 397b A17b · ThinkingSimpleVQA67.15 / 29Tracked evidence
Qwen 3.5 35b A3b · ThinkingERQA64.85 / 27Tracked evidence
Qwen3 VL 8B · Non-ThinkingMathVerse · mini57.45 / 10Tracked evidence
Qwen3 VL 8B · Non-ThinkingMathVision · mini505 / 10Tracked evidence
Qwen3 VL 235B A22B · ThinkingODinW · 1343.25 / 13Tracked evidence
Qwen 3.6 35B-A3B · ThinkingZEROBench · sub34.45 / 23Tracked evidence
Qwen3 VL 235B A22B · ThinkingWorldVQA23.55 / 5Tracked evidence
Qwen 3.5 9B · ThinkingCountBench97.26 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingV*92.76 / 23Tracked evidence
Qwen 3.5 4B · ThinkingVLMs Are Blind92.66 / 18Tracked evidence
Qwen 3.5 27b · ThinkingMMBench · en_dev_v1_192.66 / 24Tracked evidence
Qwen 3.5 35b A3b · ThinkingAI2D · test92.66 / 33Tracked evidence
Qwen 3.5 27b · ThinkingRefCOCO · avg90.96 / 18Tracked evidence
Qwen 3.6 27B · ThinkingMathVista · mini87.46 / 36Tracked evidence
Qwen 3.5 27b · ThinkingVideoMME · with_sub876 / 22Tracked evidence
Qwen 3.5 397b A17b · ThinkingVideo-MMMU84.76 / 28Tracked evidence
Qwen 3.5 122b A10b · ThinkingEmbSpatialBench83.96 / 24Tracked evidence
Qwen 3.5 27b · ThinkingRealWorldQA83.76 / 24Tracked evidence
Qwen 3.5 27b · ThinkingVideoMME · without_sub82.86 / 21Tracked evidence
Qwen 3.5 27b · ThinkingMMStar816 / 33Tracked evidence
Qwen 3.5 9B · ThinkingSLAKE796 / 22Tracked evidence
Qwen3 VL 8B · ThinkingChartQA · test78.66 / 10Tracked evidence
Qwen 3.5 35b A3b · ThinkingMVBench74.86 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMVU74.76 / 20Tracked evidence
Qwen 3.5 27b · ThinkingLVBench73.66 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingMedXpertQA · mm706 / 31Tracked evidence
Qwen 3.5 35b A3b · ThinkingODinW · 1342.66 / 13Tracked evidence
Qwen 3.5 35b A3b · ThinkingBabyVision38.46 / 22Tracked evidence
Qwen 3.5 35b A3b · ThinkingZEROBench · sub34.16 / 23Tracked evidence
Qwen 3.5 27b · ThinkingZEROBench106 / 27Tracked evidence
Qwen 3.5 122b A10b · ThinkingCountBench977 / 23Tracked evidence
Qwen 3.6 27B · ThinkingMMBench · en_dev_v1_192.37 / 24Tracked evidence
Qwen 3.5 122b A10b · Non-ThinkingV*90.17 / 23Tracked evidence
Qwen 3.5 9B · ThinkingRefCOCO · avg89.77 / 18Tracked evidence
Qwen 3.5 35b A3b · ThinkingVideoMME · with_sub86.67 / 22Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMathVista · mini86.47 / 36Tracked evidence
Qwen 3.5 35b A3b · ThinkingMLVU · mavg85.67 / 22Tracked evidence
Qwen 3.5 35b A3b · ThinkingDynaMath857 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingMathVision83.97 / 17Tracked evidence
Qwen 3.5 35b A3b · ThinkingEmbSpatialBench83.17 / 24Tracked evidence
Qwen 3.5 35b A3b · ThinkingVideoMME · without_sub82.57 / 21Tracked evidence
Qwen 3.5 35b A3b · ThinkingSLAKE78.77 / 22Tracked evidence
Qwen 3.5 27b · ThinkingMVBench74.67 / 18Tracked evidence
Qwen 3.5 4B · ThinkingLingoQA74.47 / 16Tracked evidence
Qwen 3.5 27b · ThinkingMMVU73.37 / 20Tracked evidence
Qwen 3.5 35b A3b · ThinkingLVBench71.47 / 18Tracked evidence
Qwen 3.5 27b · ThinkingHallusionBench707 / 33Tracked evidence
Qwen 3.5 35b A3b · ThinkingScreenSpot-Pro68.67 / 24Tracked evidence
Qwen 3.5 122b A10b · ThinkingMedXpertQA · mm67.37 / 31Tracked evidence
Qwen 3.6 35B-A3B · ThinkingRefSpatialBench64.37 / 21Tracked evidence
Qwen 3.6 27B · ThinkingERQA62.57 / 27Tracked evidence
Qwen 3.5 27b · ThinkingODinW · 1341.17 / 13Tracked evidence
Qwen 3.5 4B · ThinkingCountBench96.38 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMBench · en_dev_v1_191.58 / 24Tracked evidence
Qwen 3.5 9B · ThinkingV*90.18 / 23Tracked evidence
Qwen3 VL 30B A3B · ThinkingRefCOCO · avg89.38 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingVideoMME · with_sub86.68 / 22Tracked evidence
Qwen 3.5 35b A3b · ThinkingMathVista · mini86.28 / 36Tracked evidence
Qwen 3.6 27B · ThinkingVideo-MMMU84.48 / 28Tracked evidence
Qwen 3.5 9B · ThinkingEmbSpatialBench838 / 24Tracked evidence
Qwen 3.6 35B-A3B · ThinkingVideoMME · without_sub82.58 / 21Tracked evidence
Qwen 3.5 9B · ThinkingMMStar79.78 / 33Tracked evidence
Qwen3 VL 235B A22B · ThinkingVLMs Are Blind79.58 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingMVBench74.68 / 18Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMVU72.38 / 20Tracked evidence
Qwen 3.6 35B-A3B · ThinkingLVBench71.48 / 18Tracked evidence
Qwen 3.5 397b A17b · ThinkingScreenSpot-Pro65.68 / 24Tracked evidence
Qwen 3.5 35b A3b · ThinkingRefSpatialBench63.58 / 21Tracked evidence
Qwen 3.5 122b A10b · ThinkingERQA628 / 27Tracked evidence
Qwen 3.5 122b A10b · ThinkingSimpleVQA61.78 / 29Tracked evidence
Qwen 3.5 2B · Non-ThinkingODinW · 1340.58 / 13Tracked evidence
Qwen 3.5 27b · Non-ThinkingBabyVision34.88 / 22Tracked evidence
Qwen 3.5 9B · ThinkingAI2D · test90.29 / 33Tracked evidence
Qwen 3.5 9B · ThinkingMMBench · en_dev_v1_190.19 / 24Tracked evidence
Qwen 3.5 35b A3b · Non-ThinkingV*89.59 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingRefCOCO · avg89.29 / 18Tracked evidence
Qwen3 VL 235B A22B · ThinkingMathVista · mini85.89 / 36Tracked evidence
Qwen 3.5 9B · ThinkingMLVU · mavg84.49 / 22Tracked evidence
Qwen 3.6 35B-A3B · ThinkingVideo-MMMU83.79 / 28Tracked evidence
Qwen 3.5 9B · ThinkingDynaMath83.69 / 23Tracked evidence
Qwen3 VL 235B A22B · ThinkingRealWorldQA81.39 / 24Tracked evidence
Qwen 3.5 9B · ThinkingMathVision78.99 / 17Tracked evidence
Qwen3 VL 235B A22B · ThinkingMMStar78.79 / 33Tracked evidence
Qwen 3.5 9B · ThinkingMVBench74.49 / 18Tracked evidence
Qwen3 VL 235B A22B · ThinkingMMVU71.19 / 20Tracked evidence
Qwen 3.5 9B · ThinkingLVBench709 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingHallusionBench69.89 / 33Tracked evidence
Qwen 3.5 9B · ThinkingScreenSpot-Pro65.29 / 24Tracked evidence
Qwen3 VL 235B A22B · ThinkingSimpleVQA61.39 / 29Tracked evidence
Qwen 3.5 27b · ThinkingERQA60.59 / 27Tracked evidence
Qwen 3.5 9B · ThinkingRefSpatialBench58.59 / 21Tracked evidence
Qwen3 VL 4B · ThinkingODinW · 1339.49 / 13Tracked evidence
Qwen 3.5 122b A10b · Non-ThinkingBabyVision34.59 / 22Tracked evidence
Qwen 3.5 9B · ThinkingZEROBench · sub31.19 / 23Tracked evidence
Qwen 3.5 122b A10b · ThinkingZEROBench99 / 27Tracked evidence
Qwen3 VL 235B A22B · ThinkingCountBench93.710 / 23Tracked evidence
Qwen3 VL 235B A22B · ThinkingMMBench · en_dev_v1_189.710 / 24Tracked evidence
Qwen 3.5 4B · ThinkingAI2D · test89.610 / 33Tracked evidence
Qwen 3.5 27b · Non-ThinkingV*8910 / 23Tracked evidence
Qwen3 VL 4B · ThinkingRefCOCO · avg88.210 / 18Tracked evidence
Qwen 3.5 9B · ThinkingMathVista · mini85.710 / 36Tracked evidence
Qwen 3.5 9B · ThinkingVideoMME · with_sub84.510 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingMLVU · mavg83.810 / 22Tracked evidence
Qwen 3.5 4B · ThinkingDynaMath83.310 / 23Tracked evidence
Qwen 3.5 4B · ThinkingEmbSpatialBench81.310 / 24Tracked evidence
Qwen 3.5 397b A17b · ThinkingCharXiv Reasoning80.810 / 48Tracked evidence
Qwen3 VL 235B A22B · ThinkingVideoMME · without_sub7910 / 21Tracked evidence
Qwen 3.5 4B · ThinkingMMStar78.310 / 33Tracked evidence
Qwen 3.5 4B · ThinkingSLAKE76.110 / 22Tracked evidence
Qwen 3.5 2B · ThinkingVLMs Are Blind75.810 / 18Tracked evidence
Qwen 3.5 4B · ThinkingMathVision74.610 / 17Tracked evidence
Qwen 3.5 9B · ThinkingHallusionBench69.310 / 33Tracked evidence
Qwen 3.5 4B · ThinkingLVBench66.410 / 18Tracked evidence
Qwen3 VL 235B A22B · ThinkingScreenSpot-Pro6210 / 24Tracked evidence
Qwen 3.5 4B · ThinkingRefSpatialBench54.610 / 21Tracked evidence
Qwen3 VL 2B · ThinkingODinW · 133610 / 13Tracked evidence
Qwen 3.5 4B · ThinkingMMBench · en_dev_v1_189.411 / 24Tracked evidence
Qwen3 VL 235B A22B · ThinkingAI2D · test89.211 / 33Tracked evidence
Qwen 3.5 9B · Non-ThinkingV*88.511 / 23Tracked evidence
Qwen 3.5 4B · ThinkingRefCOCO · avg88.111 / 18Tracked evidence
Qwen 3.5 4B · ThinkingMathVista · mini85.111 / 36Tracked evidence
Qwen3 VL 235B A22B · ThinkingVideoMME · with_sub83.811 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingDynaMath82.811 / 23Tracked evidence
Qwen 3.5 9B · ThinkingRealWorldQA80.311 / 24Tracked evidence
Qwen3 VL 235B A22B · ThinkingMathVision74.611 / 17Tracked evidence
Qwen 3.5 2B · ThinkingSLAKE74.411 / 22Tracked evidence
Qwen 3.5 2B · Non-ThinkingVLMs Are Blind74.311 / 18Tracked evidence
Qwen3 VL 235B A22B · ThinkingLingoQA66.811 / 16Tracked evidence
Qwen3 VL 235B A22B · ThinkingLVBench63.611 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingSimpleVQA58.911 / 29Tracked evidence
Qwen 3.5 9B · ThinkingERQA55.511 / 27Tracked evidence
Qwen3 VL 30B A3B · ThinkingRefSpatialBench54.211 / 21Tracked evidence
Qwen 3.5 2B · ThinkingODinW · 1335.911 / 13Tracked evidence
Qwen 3.5 35b A3b · Non-ThinkingBabyVision29.611 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingZEROBench · sub28.411 / 23Tracked evidence
Qwen 3.5 35b A3b · ThinkingZEROBench811 / 27Tracked evidence
Qwen 3.5 2B · ThinkingCountBench91.412 / 23Tracked evidence
Qwen3 VL 32B · ThinkingMathVista · mini83.812 / 36Tracked evidence
Qwen 3.5 27b · ThinkingVideo-MMMU82.312 / 28Tracked evidence
Qwen3 VL 4B · ThinkingEmbSpatialBench80.712 / 24Tracked evidence
Qwen 3.5 4B · ThinkingRealWorldQA79.512 / 24Tracked evidence
Qwen 3.5 9B · ThinkingVideoMME · without_sub78.412 / 21Tracked evidence
Qwen3 VL 32B · ThinkingMMStar75.712 / 33Tracked evidence
Qwen3 VL 30B A3B · ThinkingVLMs Are Blind72.512 / 18Tracked evidence
Qwen3 VL 30B A3B · ThinkingMVBench7212 / 18Tracked evidence
Qwen 3.5 35b A3b · ThinkingHallusionBench67.912 / 33Tracked evidence
Qwen 3.5 9B · ThinkingMMVU67.812 / 20Tracked evidence
Qwen 3.5 27b · ThinkingMedXpertQA · mm62.412 / 31Tracked evidence
Qwen3 VL 30B A3B · ThinkingScreenSpot-Pro60.512 / 24Tracked evidence
Qwen 3.5 35b A3b · ThinkingSimpleVQA58.312 / 29Tracked evidence
Qwen3 VL 4B · ThinkingRefSpatialBench45.312 / 21Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingODinW · 1333.212 / 13Tracked evidence
Qwen 3.5 9B · ThinkingBabyVision28.612 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingMMBench · en_dev_v1_188.913 / 24Tracked evidence
Qwen 3.5 4B · Non-ThinkingV*86.413 / 23Tracked evidence
Qwen 3.5 2B · ThinkingRefCOCO · avg84.813 / 18Tracked evidence
Qwen 3.5 4B · ThinkingVideoMME · with_sub83.513 / 22Tracked evidence
Qwen 3.5 4B · ThinkingMLVU · mavg82.813 / 22Tracked evidence
Qwen 3.5 122b A10b · ThinkingVideo-MMMU8213 / 28Tracked evidence
Qwen3 VL 30B A3B · ThinkingEmbSpatialBench80.613 / 24Tracked evidence
Qwen3 VL 30B A3B · ThinkingDynaMath80.113 / 23Tracked evidence
Qwen 3.5 27b · ThinkingCharXiv Reasoning79.513 / 48Tracked evidence
Qwen 3.5 4B · ThinkingVideoMME · without_sub76.913 / 21Tracked evidence
Qwen3 VL 30B A3B · ThinkingMMStar75.513 / 33Tracked evidence
Qwen 3.5 4B · ThinkingMVBench71.213 / 18Tracked evidence
Qwen3 VL 4B · ThinkingVLMs Are Blind68.613 / 18Tracked evidence
Qwen 3.5 122b A10b · ThinkingHallusionBench67.613 / 33Tracked evidence
Qwen3 VL 30B A3B · ThinkingMMVU66.113 / 20Tracked evidence
Qwen3 VL 30B A3B · ThinkingLingoQA6213 / 16Tracked evidence
Qwen 3.5 35b A3b · ThinkingMedXpertQA · mm61.413 / 31Tracked evidence
Qwen 3.5 4B · ThinkingScreenSpot-Pro60.313 / 24Tracked evidence
Qwen3 VL 30B A3B · ThinkingLVBench59.213 / 18Tracked evidence
Qwen 3.5 2B · ThinkingRefSpatialBench32.913 / 21Tracked evidence
Qwen 3.5 0.8B · ThinkingODinW · 1331.613 / 13Tracked evidence
Qwen 3.5 9B · Non-ThinkingBabyVision25.813 / 22Tracked evidence
Qwen3 VL 32B · ThinkingAI2D · test87.214 / 33Tracked evidence
Qwen3 VL 235B A22B · ThinkingV*85.914 / 23Tracked evidence
Qwen3 VL 2B · ThinkingRefCOCO · avg84.814 / 18Tracked evidence
Qwen 3.5 2B · ThinkingEmbSpatialBench77.914 / 24Tracked evidence
Qwen3 VL 30B A3B · ThinkingRealWorldQA77.414 / 24Tracked evidence
Qwen3 VL 32B · Non-ThinkingMMStar74.314 / 33Tracked evidence
Qwen3 VL 4B · ThinkingMVBench69.314 / 18Tracked evidence
Qwen3 VL 30B A3B · ThinkingSLAKE68.814 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingHallusionBench66.714 / 33Tracked evidence
Qwen3 VL 4B · ThinkingScreenSpot-Pro59.514 / 24Tracked evidence
Qwen 3.5 4B · ThinkingERQA5414 / 27Tracked evidence
Qwen 3.5 2B · Non-ThinkingRefSpatialBench3014 / 21Tracked evidence
Qwen 3.5 4B · ThinkingZEROBench · sub26.314 / 23Tracked evidence
Qwen3 VL 235B A22B · ThinkingBabyVision22.214 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingZEROBench414 / 27Tracked evidence
Qwen 3.5 2B · Non-ThinkingRefCOCO · avg84.315 / 18Tracked evidence
Qwen 3.5 4B · ThinkingV*84.315 / 23Tracked evidence
Qwen3 VL 30B A3B · ThinkingMathVista · mini81.915 / 36Tracked evidence
Qwen 3.5 35b A3b · ThinkingVideo-MMMU80.415 / 28Tracked evidence
Qwen3 VL 30B A3B · ThinkingVideoMME · with_sub79.915 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingMLVU · mavg78.915 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingVideoMME · without_sub73.315 / 21Tracked evidence
Qwen 3.5 2B · Non-ThinkingSLAKE67.515 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingHallusionBench6615 / 33Tracked evidence
Qwen3 VL 30B A3B · ThinkingMathVision65.715 / 17Tracked evidence
Qwen 3.5 4B · ThinkingMMVU64.915 / 20Tracked evidence
Qwen 3.5 2B · ThinkingLVBench57.115 / 18Tracked evidence
Qwen3 VL 235B A22B · ThinkingERQA52.515 / 27Tracked evidence
Qwen3 VL 2B · ThinkingRefSpatialBench28.915 / 21Tracked evidence
Qwen3 VL 30B A3B · ThinkingZEROBench · sub23.715 / 23Tracked evidence
Qwen3 VL 30B A3B · ThinkingCountBench9016 / 23Tracked evidence
Qwen3 VL 30B A3B · ThinkingAI2D · test86.916 / 33Tracked evidence
Qwen3 VL 30B A3B · ThinkingV*83.216 / 23Tracked evidence
Qwen3 VL 32B · Non-ThinkingMathVista · mini81.816 / 36Tracked evidence
Qwen3 VL 235B A22B · ThinkingVideo-MMMU8016 / 28Tracked evidence
Qwen 3.6 27B · ThinkingCharXiv Reasoning78.416 / 48Tracked evidence
Qwen3 VL 2B · ThinkingEmbSpatialBench75.916 / 24Tracked evidence
Qwen 3.5 2B · ThinkingRealWorldQA74.516 / 24Tracked evidence
Qwen3 VL 4B · ThinkingSLAKE65.916 / 22Tracked evidence
Qwen 3.5 2B · ThinkingMVBench64.916 / 18Tracked evidence
Qwen 3.5 0.8B · ThinkingVLMs Are Blind59.416 / 18Tracked evidence
Qwen 3.6 27B · ThinkingSimpleVQA56.116 / 29Tracked evidence
Qwen 3.5 2B · Non-ThinkingScreenSpot-Pro54.516 / 24Tracked evidence
Qwen3 VL 4B · ThinkingLVBench53.516 / 18Tracked evidence
Qwen 3.5 0.8B · ThinkingRefSpatialBench23.516 / 21Tracked evidence
Qwen 3.5 4B · Non-ThinkingBabyVision19.116 / 22Tracked evidence
Qwen3 VL 4B · ThinkingCountBench89.417 / 23Tracked evidence
Qwen3 VL 4B · ThinkingMMBench · en_dev_v1_186.717 / 24Tracked evidence
Qwen 3.5 0.8B · ThinkingRefCOCO · avg79.317 / 18Tracked evidence
Qwen 3.6 35B-A3B · ThinkingCharXiv Reasoning7817 / 48Tracked evidence
Qwen 3.5 2B · ThinkingMLVU · mavg76.217 / 22Tracked evidence
Qwen3 VL 4B · ThinkingVideoMME · with_sub7617 / 22Tracked evidence
Qwen3 VL 4B · ThinkingDynaMath74.417 / 23Tracked evidence
Qwen3 VL 4B · ThinkingRealWorldQA73.217 / 24Tracked evidence
Qwen 3.5 2B · ThinkingVideoMME · without_sub6917 / 21Tracked evidence
Qwen3 VL 2B · ThinkingMVBench64.517 / 18Tracked evidence
Qwen3 VL 4B · ThinkingMMVU58.617 / 20Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingVLMs Are Blind57.317 / 18Tracked evidence
Qwen 3.5 27b · ThinkingSimpleVQA5617 / 29Tracked evidence
Qwen 3.5 9B · ThinkingMedXpertQA · mm49.917 / 31Tracked evidence
Qwen3 VL 2B · ThinkingScreenSpot-Pro48.517 / 24Tracked evidence
Qwen3 VL 2B · ThinkingLVBench47.617 / 18Tracked evidence
Qwen3 VL 4B · ThinkingERQA47.317 / 27Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingRefSpatialBench21.717 / 21Tracked evidence
Qwen 3.5 4B · ThinkingZEROBench317 / 27Tracked evidence
Qwen 3.5 2B · Non-ThinkingCountBench86.818 / 23Tracked evidence
Qwen3 VL 32B · Non-ThinkingAI2D · test8518 / 33Tracked evidence
Qwen 3.5 2B · ThinkingMMBench · en_dev_v1_183.318 / 24Tracked evidence
Qwen 3.5 9B · ThinkingVideo-MMMU78.918 / 28Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingRefCOCO · avg77.818 / 18Tracked evidence
Qwen3 VL 4B · ThinkingMLVU · mavg75.718 / 22Tracked evidence
Qwen 3.5 2B · ThinkingVideoMME · with_sub75.618 / 22Tracked evidence
Qwen 3.5 2B · ThinkingDynaMath73.618 / 23Tracked evidence
Qwen3 VL 4B · ThinkingMMStar73.218 / 33Tracked evidence
Qwen3 VL 4B · ThinkingVideoMME · without_sub68.918 / 21Tracked evidence
Qwen 3.5 0.8B · ThinkingSLAKE62.618 / 22Tracked evidence
Qwen 3.5 0.8B · ThinkingMVBench55.818 / 18Tracked evidence
Qwen3 VL 2B · ThinkingVLMs Are Blind5018 / 18Tracked evidence
Qwen3 VL 2B · ThinkingMMVU48.918 / 20Tracked evidence
Qwen3 VL 235B A22B · ThinkingMedXpertQA · mm47.618 / 31Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingScreenSpot-Pro46.518 / 24Tracked evidence
Qwen 3.5 0.8B · ThinkingLVBench45.118 / 18Tracked evidence
Qwen3 VL 4B · ThinkingZEROBench · sub18.918 / 23Tracked evidence
Qwen3 VL 30B A3B · ThinkingBabyVision18.618 / 22Tracked evidence
Qwen 3.5 9B · ThinkingZEROBench318 / 27Tracked evidence
Qwen3 VL 4B · ThinkingAI2D · test84.919 / 33Tracked evidence
Qwen3 VL 2B · ThinkingCountBench84.119 / 23Tracked evidence
Qwen3 VL 4B · ThinkingMathVista · mini79.519 / 36Tracked evidence
Qwen 3.5 35b A3b · ThinkingCharXiv Reasoning77.519 / 48Tracked evidence
Qwen3 VL 8B · ThinkingMMStar72.319 / 33Tracked evidence
Qwen 3.5 4B · ThinkingHallusionBench6519 / 33Tracked evidence
Qwen3 VL 2B · ThinkingSLAKE61.119 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingSimpleVQA54.319 / 29Tracked evidence
Qwen 3.5 2B · ThinkingMMVU48.619 / 20Tracked evidence
Qwen 3.5 4B · ThinkingMedXpertQA · mm42.919 / 31Tracked evidence
Qwen 3.5 2B · Non-ThinkingZEROBench · sub18.619 / 23Tracked evidence
Qwen3 VL 2B · ThinkingMMBench · en_dev_v1_181.920 / 24Tracked evidence
Qwen3 VL 8B · ThinkingMathVista · mini79.520 / 36Tracked evidence
Qwen 3.5 122b A10b · ThinkingCharXiv Reasoning77.220 / 48Tracked evidence
Qwen3 VL 30B A3B · ThinkingVideo-MMMU7520 / 28Tracked evidence
Qwen 3.5 2B · ThinkingMMStar71.720 / 33Tracked evidence
Qwen 3.5 2B · Non-ThinkingRealWorldQA71.220 / 24Tracked evidence
Qwen 3.5 2B · Non-ThinkingDynaMath69.620 / 23Tracked evidence
Qwen 3.5 0.8B · ThinkingEmbSpatialBench68.620 / 24Tracked evidence
Qwen3 VL 2B · ThinkingVideoMME · without_sub62.120 / 21Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingSLAKE59.520 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingERQA45.320 / 27Tracked evidence
Qwen3 VL 30B A3B · ThinkingMedXpertQA · mm35.520 / 31Tracked evidence
Qwen 3.5 0.8B · ThinkingMMVU34.320 / 20Tracked evidence
Qwen 3.5 2B · ThinkingZEROBench · sub17.120 / 23Tracked evidence
Qwen 3.5 4B · ThinkingBabyVision1620 / 22Tracked evidence
Qwen 3.5 2B · Non-ThinkingMMBench · en_dev_v1_181.321 / 24Tracked evidence
Qwen3 VL 8B · Non-ThinkingMMStar69.921 / 33Tracked evidence
Qwen3 VL 2B · ThinkingMLVU · mavg69.221 / 22Tracked evidence
Qwen3 VL 2B · ThinkingVideoMME · with_sub67.921 / 22Tracked evidence
Qwen3 VL 2B · ThinkingDynaMath66.721 / 23Tracked evidence
Qwen 3.5 2B · Non-ThinkingEmbSpatialBench66.421 / 24Tracked evidence
Qwen 3.5 0.8B · ThinkingVideoMME · without_sub57.721 / 21Tracked evidence
Qwen 3.5 9B · ThinkingSimpleVQA51.221 / 29Tracked evidence
Qwen3 VL 2B · ThinkingZEROBench · sub13.221 / 23Tracked evidence
Qwen 3.5 2B · ThinkingZEROBench121 / 27Tracked evidence
Qwen3 VL 8B · ThinkingAI2D · test83.922 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingCountBench7722 / 23Tracked evidence
Qwen 3.5 4B · ThinkingVideo-MMMU74.122 / 28Tracked evidence
Qwen3 VL 2B · ThinkingRealWorldQA69.522 / 24Tracked evidence
Qwen 3.5 0.8B · ThinkingMLVU · mavg65.622 / 22Tracked evidence
Qwen3 VL 4B · ThinkingHallusionBench64.122 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingVideoMME · with_sub63.822 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingSLAKE54.722 / 22Tracked evidence
Qwen 3.5 0.8B · ThinkingDynaMath49.922 / 23Tracked evidence
Qwen3 VL 4B · ThinkingSimpleVQA48.822 / 29Tracked evidence
Qwen 3.5 0.8B · ThinkingZEROBench · sub12.922 / 23Tracked evidence
Qwen 3.5 0.8B · ThinkingZEROBench022 / 27Tracked evidence
Qwen 3.5 2B · ThinkingAI2D · test83.323 / 33Tracked evidence
Qwen 3.5 2B · ThinkingMathVista · mini76.723 / 36Tracked evidence
Qwen 3.5 9B · ThinkingCharXiv Reasoning7323 / 48Tracked evidence
Qwen 3.5 0.8B · ThinkingMMBench · en_dev_v1_169.923 / 24Tracked evidence
Qwen3 VL 4B · ThinkingVideo-MMMU69.423 / 28Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingCountBench68.623 / 23Tracked evidence
Qwen 3.5 0.8B · ThinkingRealWorldQA63.423 / 24Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingDynaMath46.523 / 23Tracked evidence
Qwen 3.5 2B · ThinkingERQA43.823 / 27Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingZEROBench · sub11.423 / 23Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingZEROBench023 / 27Tracked evidence
Qwen3 VL 8B · Non-ThinkingAI2D · test8324 / 33Tracked evidence
Qwen3 VL 8B · Non-ThinkingMathVista · mini76.424 / 36Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMMBench · en_dev_v1_16824 / 24Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingRealWorldQA61.624 / 24Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingEmbSpatialBench54.624 / 24Tracked evidence
Qwen3 VL 2B · ThinkingSimpleVQA43.624 / 29Tracked evidence
Qwen3 VL 2B · ThinkingERQA41.824 / 27Tracked evidence
Qwen 3.5 2B · ThinkingMedXpertQA · mm26.924 / 31Tracked evidence
Qwen 3.5 2B · Non-ThinkingZEROBench024 / 27Tracked evidence
Qwen3 VL 2B · ThinkingMMStar68.125 / 33Tracked evidence
Qwen 3.5 2B · ThinkingVideo-MMMU62.125 / 28Tracked evidence
Qwen 3.5 4B · ThinkingSimpleVQA43.425 / 29Tracked evidence
Qwen 3.5 0.8B · ThinkingERQA34.525 / 27Tracked evidence
Qwen3 VL 2B · ThinkingZEROBench025 / 27Tracked evidence
Qwen 3.5 2B · Non-ThinkingAI2D · test81.526 / 33Tracked evidence
Qwen 3.5 4B · ThinkingCharXiv Reasoning70.826 / 48Tracked evidence
Qwen 3.5 2B · Non-ThinkingMMStar6826 / 33Tracked evidence
Qwen 3.5 2B · Non-ThinkingSimpleVQA39.526 / 29Tracked evidence
Qwen 3.5 2B · Non-ThinkingERQA3326 / 27Tracked evidence
Qwen3 VL 4B · ThinkingMedXpertQA · mm26.326 / 31Tracked evidence
Qwen3 VL 30B A3B · ThinkingZEROBench026 / 27Tracked evidence
Qwen 3.5 2B · Non-ThinkingMathVista · mini73.927 / 36Tracked evidence
Qwen3 VL 2B · ThinkingVideo-MMMU54.127 / 28Tracked evidence
Qwen 3.5 2B · ThinkingSimpleVQA38.527 / 29Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMedXpertQA · mm25.327 / 31Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingERQA23.827 / 27Tracked evidence
Qwen3 VL 4B · ThinkingZEROBench027 / 27Tracked evidence
Qwen3 VL 2B · ThinkingMathVista · mini73.628 / 36Tracked evidence
Qwen 3.5 2B · ThinkingHallusionBench5828 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingVideo-MMMU44.328 / 28Tracked evidence
Qwen 3.5 0.8B · ThinkingSimpleVQA31.328 / 29Tracked evidence
Qwen3 VL 2B · ThinkingAI2D · test80.429 / 33Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingSimpleVQA30.429 / 29Tracked evidence
Qwen 3.5 2B · Non-ThinkingMedXpertQA · mm19.129 / 31Tracked evidence
Qwen3 VL 2B · ThinkingHallusionBench54.930 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingMedXpertQA · mm17.130 / 31Tracked evidence
Qwen 3.5 0.8B · ThinkingAI2D · test69.931 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingMMStar58.331 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingHallusionBench53.131 / 33Tracked evidence
Qwen3 VL 2B · ThinkingMedXpertQA · mm1331 / 31Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingAI2D · test68.732 / 33Tracked evidence
Qwen3 VL 235B A22B · ThinkingCharXiv Reasoning66.132 / 48Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMMStar55.932 / 33Tracked evidence
Qwen 3.5 2B · Non-ThinkingHallusionBench51.332 / 33Tracked evidence
Qwen 3.5 0.8B · ThinkingMathVista · mini62.233 / 36Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingHallusionBench46.733 / 33Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMathVista · mini58.634 / 36Tracked evidence
Qwen 3.5 2B · ThinkingCharXiv Reasoning58.837 / 48Tracked evidence
Qwen3 VL 30B A3B · ThinkingCharXiv Reasoning56.638 / 48Tracked evidence
Qwen 3.5 2B · Non-ThinkingCharXiv Reasoning52.641 / 48Tracked evidence
Qwen3 VL 4B · ThinkingCharXiv Reasoning50.342 / 48Tracked evidence
Qwen 3.5 0.8B · ThinkingCharXiv Reasoning41.345 / 48Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingCharXiv Reasoning38.246 / 48Tracked evidence
Qwen3 VL 2B · ThinkingCharXiv Reasoning37.147 / 48Tracked evidence

Document/OCR

Model / VariantBenchmarkScoreRankScoring
Qwen 3.5 397b A17b · ThinkingOCRBench93.11 / 35Tracked evidence
Qwen 3.5 397b A17b · ThinkingMMLongBench-Doc61.52 / 22Tracked evidence
Qwen 3.5 122b A10b · ThinkingOCRBench92.13 / 35Tracked evidence
Qwen 3.5 35b A3b · ThinkingOCRBench914 / 35Tracked evidence
Qwen 3.5 27b · ThinkingMMLongBench-Doc60.24 / 22Tracked evidence
Qwen 3.5 35b A3b · ThinkingMMLongBench-Doc59.55 / 22Tracked evidence
Qwen3 VL 8B · Non-ThinkingOCRBench906 / 35Tracked evidence
Qwen 3.5 122b A10b · ThinkingMMLongBench-Doc596 / 22Tracked evidence
Qwen 3.5 27b · ThinkingOCRBench89.47 / 35Tracked evidence
Qwen 3.6 27B · ThinkingOCRBench89.48 / 35Tracked evidence
Qwen 3.5 9B · ThinkingMMLongBench-Doc57.78 / 22Tracked evidence
Qwen 3.5 9B · ThinkingOCRBench89.29 / 35Tracked evidence
Qwen3 VL 32B · Non-ThinkingOCRBench88.510 / 35Tracked evidence
Qwen3 VL 235B A22B · ThinkingMMLongBench-Doc56.210 / 22Tracked evidence
Qwen3 VL 235B A22B · ThinkingOCRBench87.511 / 35Tracked evidence
Qwen 3.5 4B · ThinkingMMLongBench-Doc54.211 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingMMLongBench-Doc47.413 / 22Tracked evidence
Qwen 3.5 2B · Non-ThinkingOCRBench85.414 / 35Tracked evidence
Qwen 3.5 4B · ThinkingOCRBench8515 / 35Tracked evidence
Qwen 3.5 2B · ThinkingMMLongBench-Doc45.415 / 22Tracked evidence
Qwen3 VL 32B · ThinkingOCRBench8516 / 35Tracked evidence
Qwen3 VL 4B · ThinkingMMLongBench-Doc44.416 / 22Tracked evidence
Qwen 3.5 2B · ThinkingOCRBench84.517 / 35Tracked evidence
Qwen 3.5 2B · Non-ThinkingMMLongBench-Doc38.817 / 22Tracked evidence
Qwen3 VL 30B A3B · ThinkingOCRBench83.918 / 35Tracked evidence
Qwen3 VL 2B · ThinkingMMLongBench-Doc33.819 / 22Tracked evidence
Qwen 3.5 0.8B · ThinkingMMLongBench-Doc33.620 / 22Tracked evidence
Qwen3 VL 8B · ThinkingOCRBench8221 / 35Tracked evidence
Qwen3 VL 4B · ThinkingOCRBench80.822 / 35Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingMMLongBench-Doc28.122 / 22Tracked evidence
Qwen3 VL 2B · ThinkingOCRBench79.225 / 35Tracked evidence
Qwen 3.5 0.8B · Non-ThinkingOCRBench79.126 / 35Tracked evidence
Qwen 3.5 0.8B · ThinkingOCRBench74.531 / 35Tracked evidence

Where this family sits in the market

Several Qwen3 variants sit on the open-weights Pareto frontier, with competitive quality at sub-$0.50/M input pricing.

AnthropicCohereDeepSeekGoogleMetaMicrosoftMiniMaxMistralMoonshotnvidiaOpenAIQwenxAIZhipu

Dashed line = Pareto frontier (no model both cheaper and better). Thinking/non-thinking pairs of the same model are connected — line length = cost of reasoning. Hover any dot for details.

Self-hosting

These variants ship with open weights, so you can run them on your own hardware or via a hosting provider you control. Pick a variant that fits your GPU memory budget; mixture-of-experts variants are cheaper to serve than their total parameter count suggests, but the full weights still need to fit in memory.

  • Qwen3-0.6B (Thinking)Non-thinking · open weights
  • Qwen3-1.7B (Thinking)Non-thinking · open weights
  • Qwen3-4B (Thinking)Non-thinking · open weights
  • Qwen 3 8bNon-thinking · open weights
  • Qwen 3 14bNon-thinking · open weights
  • Qwen 3 30b A3bNon-thinking · open weights
  • Qwen 3 32bNon-thinking · open weights
  • Qwen 3 235b A22bNon-Thinking · open weights
  • Qwen 3.5 0.8BThinking · open weights
  • Qwen 3.5 2BThinking · open weights
  • Qwen 3.5 4BThinking · open weights
  • Qwen 3.5 9BThinking · open weights
  • Qwen 3.5 27bThinking · open weights
  • Qwen 3.5 35b A3bThinking · open weights
  • Qwen 3.5 122b A10bThinking · open weights
  • Qwen 3.5 397b A17bThinking · open weights
  • Qwen 3.5 FlashThinking · open weights
  • Qwen 3.6 27BThinking · open weights
  • Qwen 3.6 35B-A3BThinking · open weights
  • Qwen 3.6 FlashThinking · open weights
  • Qwen3 Next 80B A3BThinking · open weights
  • Qwen 3 Coder 480B A35B InstructNon-thinking · open weights
  • Qwen3 VL 2BThinking · open weights
  • Qwen3 VL 4BThinking · open weights
  • Qwen3 VL 8BThinking · open weights
  • Qwen3 VL 30B A3BThinking · open weights
  • Qwen3 VL 32BThinking · open weights
  • Qwen3 VL 235B A22BThinking · open weights

The Qwen3 family

Every variant we track in this family, grouped by license. Use this to orient before drilling into the variant table.

Open weights (28)

  • Qwen3-0.6B (Thinking)2 variants
  • Qwen3-1.7B (Thinking)2 variants
  • Qwen3-4B (Thinking)2 variants
  • Qwen 3 8b2 variants
  • Qwen 3 14b2 variants
  • Qwen 3 30b A3b4 variants
  • Qwen 3 32b2 variants
  • Qwen 3 235b A22b6 variants
  • Qwen 3.5 0.8B2 variants
  • Qwen 3.5 2B2 variants
  • Qwen 3.5 4B2 variants
  • Qwen 3.5 9B2 variants
  • Qwen 3.5 27b2 variants
  • Qwen 3.5 35b A3b2 variants
  • Qwen 3.5 122b A10b2 variants
  • Qwen 3.5 397b A17b1 variant
  • Qwen 3.5 Flash1 variant
  • Qwen 3.6 27B1 variant
  • Qwen 3.6 35B-A3B1 variant
  • Qwen 3.6 Flash1 variant
  • Qwen3 Next 80B A3B1 variant
  • Qwen 3 Coder 480B A35B Instruct1 variant
  • Qwen3 VL 2B1 variant
  • Qwen3 VL 4B1 variant
  • Qwen3 VL 8B2 variants
  • Qwen3 VL 30B A3B1 variant
  • Qwen3 VL 32B2 variants
  • Qwen3 VL 235B A22B1 variant

Closed · API only (2)

  • Qwen 3.6 Plus1 variant
  • Qwen3 Max4 variants

Alternatives to consider

Peer families that solve overlapping problems. Pick by your binding constraint (cost, latency, open weights, vendor lock-in), not by leaderboard order.

Caveats

What this page does not tell you, listed honestly.

  • Quality score not yet computed for: Qwen 3.5 0.8B, Qwen 3.5 2B, Qwen 3.5 Flash, Qwen 3.6 Flash, Qwen3 VL 2B, Qwen3 VL 4B, Qwen3 VL 8B, Qwen3 VL 30B A3B, Qwen3 VL 32B, Qwen3 VL 235B A22B. We require a minimum benchmark coverage before scoring; until the gap is filled the row shows a dash.
  • No tracked API pricing for: Qwen3-0.6B (Thinking), Qwen3-1.7B (Thinking), Qwen3-4B (Thinking), Qwen 3.5 0.8B, Qwen 3.5 2B, Qwen 3.5 4B, Qwen3 VL 2B, Qwen3 VL 4B. Variants without hosted-provider pricing are listed for completeness; cost columns show a dash.
  • Context window not declared for: Qwen3-0.6B (Thinking), Qwen3-1.7B (Thinking), Qwen3-4B (Thinking), Qwen 3.5 0.8B, Qwen 3.5 2B, Qwen 3.5 4B, Qwen3 VL 2B, Qwen3 VL 4B.

Editor's notes

By borisLast verified AI-assisted, human-reviewed

Why this family matters

Qwen3 is the broadest open-weights family currently in our index: dense models from 0.6B to 32B, two mixture-of-experts builds (235B-A22B, 30B-A3B), and a long-context coding-tuned variant, all under permissive licensing. Whether any single Qwen3 variant tops the open-weights leaderboard at the moment you read this is a question for the variant table on this page. The family's structural value is the spread, and that is what makes Qwen3 a default candidate for any team that does not want to be locked to a single API.

When the binding constraint is peak score on a single reasoning benchmark and budget is unbounded, closed flagships are typically the safer pick. When deployment factors carry weight in the decision (latency floor, data sovereignty, predictable per-token cost, the option to fine-tune or self-host), an open-weights variant belongs on the shortlist. Qwen3 makes that shortlist conversation easier than most, because of the variant spread.

How the family is structured

Qwen3 ships in three lines that sit on the same page because they share generation, brand, and licence terms, not because they are interchangeable. Pick the line first, then the variant within it.

  • Qwen3 text line. Dense models from 0.6B to 32B for self-hosting, plus mixture-of-experts builds (30B-A3B, 235B-A22B, 5-397B-A17B, 6-35B-A3B) for production-scale serving. This is the default chat-and-tools workhorse line; if you are not sure which line applies, you want this one.
  • Qwen3 Coder. A single variant (qwen-3-coder-480b-a35b) purpose-built for agentic coding workloads. Pricing and pricing structure are different from the text line; pick this when SWE-bench-class throughput is the binding constraint, not when a chat model would also handle the occasional code question.
  • Qwen3-VL. Vision-language variants (2B, 4B, 8B, 30B-A3B, 32B, 235B-A22B) for image-grounded workloads. Use this line when the workload is layout-aware document extraction, image reasoning, or any task where running OCR-to-text and then a chat-tier model loses information. Caveat: our benchmark coverage on the VL line is thin compared with the text line; treat the listed variants as a shortlist to evaluate against your own data.

A second axis cuts across all three lines: most variants ship with a thinking mode and a non-thinking mode under the same model name. Thinking modes use explicit chain-of-thought before answering and typically cost more tokens to produce a response; non-thinking modes answer directly. The variant table on this page surfaces both modes where they exist; pick by the workload's tolerance for latency and cost-per-call, not by assuming thinking is always better. Some variants additionally show a base or instruct label: base is the foundation checkpoint without instruction-tuning (rarely the right pick for product features); instruct is the tuned version meant for direct deployment.

Which variant to start with

If you are picking up Qwen3 for the first time, default to qwen-qwen3-6-35b-a3b. It is a mixture-of-experts model: 35B total parameters, only ~3B active per token, so it serves and costs like a small model on capable hardware while remaining comparable on quality to dense models 3 to 5 times larger (see scores in the variant table below).

When to deviate:

  • Coding agents: use qwen-3-coder-480b-a35b. Purpose-built for agentic coding workloads. Pricing runs materially above the value pick (see variant table); worth it only if your workload is dominated by agentic coding loops where the SWE-bench-class score gap pays back the per-token cost.
  • Self-host on a single GPU: the 8B or 14B dense variants. Hosted routes typically expose ~40K context, but the official Qwen3 model cards describe larger YaRN-extended windows on these base models. Verify the context limit on the deployment surface you actually use. The MoE versions need either tensor-parallel inference or accept paying for unused experts in VRAM.
  • Long-document work: watch the provider limit, not just the model card. Our current hosted rows show many Qwen3 dense variants and some 30B-A3B routes at ~40K tokens, even though Qwen's model cards describe larger YaRN-extended windows for several base models. 235B-A22B is commonly exposed at 128K. The Qwen3.5 / Qwen3.6 refreshes (397B-A17B, 35B-A3B) and Qwen3-Coder reach 256K-class contexts. Qwen3.6 Plus is the 1M-context option in our current data.
  • You already use a closed flagship and want a fallback: start with the 235B-A22B MoE. It is the variant most likely to be a drop-in for a GPT-class workload at a fraction of the per-token cost.

Where the data is weak

We aggregate benchmark scores from multiple sources but coverage is not uniform. Specifically:

  • The 4B and 1.7B size tiers have thinner benchmark coverage than their bigger siblings. Treat their listed scores as directional, not comparative.
  • Several variants are missing release dates upstream. We are working on backfilling these from the registry.
  • Display-name conventions across the family are not fully normalised yet ("Qwen 3 32b" for the original, "Qwen 3.5", "Qwen 3.6 35B-A3B"). When in doubt, the slug (qwen-qwen3-32b vs qwen-qwen3-6-35b-a3b) is the unambiguous identifier.
  • Hosted-context vs model-card-context: many tables on this site show the context window the API actually exposes today, which is often smaller than the model card's YaRN-extended ceiling. We are planning to surface both numbers explicitly; in the meantime, treat the listed value as "what the provider serves now."
  • Series-level Pareto positioning is not yet in our pipeline; per-variant benchmarks in the table are the load-bearing data.

If you are making a procurement decision, the variant table on this page is the load-bearing artifact. Cross-check pricing against the provider's own docs before you commit. Pricing changes faster than our scrape cadence.

When to reach for which alternative

  • Long-form reasoning chains as the dominant workload: before committing to a Qwen3 variant, check DeepSeek-R1's score on the same benchmark in our index. Long chain-of-thought is the workload where the ranking is most likely to flip family.
  • Enterprise procurement where licence terms and US-jurisdiction hosting matter: Llama variants tend to clear those gates with fewer questions than Qwen3 does. Qwen3's licence is permissive, but the provenance conversation is structurally different and worth surfacing with your procurement team early.
  • Cost ceiling is high and the only axis is peak quality on a single benchmark: that conversation lives with the closed flagships (GPT-5, Claude Opus 4). Compare scores on the specific benchmark that matters for your workload; the cross-family comparison views in our index are designed for exactly this question.

Sources worth reading

Recent voices

External pointers worth reading on this family. Curated, dated, attributed; we link to sources rather than reproducing them.

  • BlogQwen Team
    Qwen3 release announcement

    Official launch post covering the dense 0.6B to 32B range and the 30B-A3B / 235B-A22B mixture-of-experts variants.

  • HFQwen Team
    Qwen organisation on Hugging Face

    Canonical source for model cards, weights and per-variant licence information across the Qwen3 family.

  • BlogQwen Team
    Qwen3-Coder release post

    Qwen3-Coder launch covering the 480B-A35B agentic-coding variant and its long-context capabilities.

  • HFQwen Team
    Qwen3.6-35B-A3B model card

    Model card for the Qwen3.6 mixture-of-experts refresh; primary source for the 35B/A3B context window and licensing claims.

  • GitHubggml-org/llama.cpp
    llama.cpp PR optimizes Qwen 3.5 inference on Apple Silicon

    Merged llama.cpp PR specifically tunes Qwen 3.5 kernels for M-series unified memory. Mac users running Qwen 3.5 locally should rebuild; reported decode speedups are large enough to change which variant is practical on a given Mac.

Changelog

  • Data

    Folded qwen3-max, qwen3-next-80b-a3b, and qwen3.6-flash into the qwen3 surface. Product decision: Qwen3-family by branding, no standalone page. Surface now owns 32 registry slugs.

How we score

Quality scores combine multiple public benchmarks (LMArena, LiveBench, SWE-bench, Aider and others) into a single comparable number. Pricing is the published API list price; self-hosted cost depends on your own hardware. We do not accept paid placements.

Author: Boris. Read the full methodology.

Get the next Qwen3 update

New variants, repriced models, and recommendation changes, in plain English. No spam, no paid placements.

Subscribe →

Need help picking for production?

Independent evaluation against your real workload, your real data, and your real cost ceiling. No vendor incentives.

See services →