Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

Comparing agents instead? Switch to agent compare →

Selected

Qwen3 235B×

Gemini 3 Pro×

Llama 4 Scout×

Claude 4.5 Haiku×Clear all

	Qwen3 235B Alibaba	Gemini 3 Pro Google	Llama 4 Scout Meta	Claude 4.5 Haiku Anthropic
Vendor	Alibaba	Google	Meta	Anthropic
Family	Qwen	Gemini	Llama	Claude
Release date	2025-04-29	2026-04-22	2025-04-05	2025-10-01
Context window	128,000 tokens	2,000,000 tokens	10,000,000 tokens	200,000 tokens
Parameters	235B	—	109B	—
Modality	text	text, vision, audio, video	text, vision	text, vision
License	Apache-2.0	proprietary	Llama 4 Community License	proprietary
Source	open weights	proprietary	open weights	proprietary
Description	Predecessor to the Qwen3.6 family.	Google DeepMind's flagship multimodal model. Best-in-class for multimodal understanding and agentic / vibe coding workflows. A "Deep Think" reasoning mode is available to AI Ultra subscribers.	—	Fast, cost-effective Claude model.
Links	weights →	vendor →	weights →
Benchmarks
MMLU-Pro	—	87.1%	—	—
GPQA-D	—	88.6%	—	—
HumanEval	—	96.5%	—	—
Aider	—	84.8%	—	—
AIME-25	—	92.8%	—	—
LiveCB	—	76.1%	—	—
MMMU	—	84.3%	—	—