Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

Comparing agents instead? Switch to agent compare →

Selected

GPT-5 Pro×

Qwen3 235B×

Gemini 3 Pro×

GPT-5.6 Sol×Clear all

	GPT-5 Pro OpenAI	Qwen3 235B Alibaba	Gemini 3 Pro Google	GPT-5.6 Sol OpenAI
Vendor	OpenAI	Alibaba	Google	OpenAI
Family	GPT	Qwen	Gemini	GPT
Release date	2025-08-07	2025-04-29	2026-04-22	2026-06-26
Context window	400,000 tokens	128,000 tokens	2,000,000 tokens	—
Parameters	—	235B	—	—
Modality	text, vision, audio	text	text, vision, audio, video	text, vision, audio
License	proprietary	Apache-2.0	proprietary	proprietary
Source	proprietary	open weights	proprietary	proprietary
Description	Higher-reasoning variant of GPT-5, used for harder scientific and research tasks (e.g., immunology breakthroughs at NIH). Same 400K context as GPT-5; trades latency for depth.	Predecessor to the Qwen3.6 family.	Google DeepMind's flagship multimodal model. Best-in-class for multimodal understanding and agentic / vibe coding workflows. A "Deep Think" reasoning mode is available to AI Ultra subscribers.	Flagship of OpenAI's GPT-5.6 three-model suite (Sol / Terra / Luna), released as a limited preview. Strong gains in coding, cybersecurity, biology, and long-horizon agentic work. Pricing: $5 / $30 per million input/output tokens.
Links	vendor →	weights →	vendor →	vendor →
Benchmarks
MMLU-Pro	—	—	87.1%	—
GPQA-D	—	—	88.6%	—
HumanEval	—	—	96.5%	—
Aider	—	—	84.8%	—
AIME-25	—	—	92.8%	—
LiveCB	—	—	76.1%	—
MMMU	—	—	84.3%	—