Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

Comparing agents instead? Switch to agent compare →

Selected

GPT-5.5×

Claude 4.8 Opus×

Gemini 3 Pro×

Gemini Omni×Clear all

	GPT-5.5 OpenAI	Claude 4.8 Opus Anthropic	Gemini 3 Pro Google	Gemini Omni Google
Vendor	OpenAI	Anthropic	Google	Google
Family	GPT	Claude	Gemini	Gemini
Release date	2026-04-23	2026-05-28	2026-04-22	2026-05-20
Context window	400,000 tokens	1,000,000 tokens	2,000,000 tokens	1,000,000 tokens
Parameters	—	—	—	—
Modality	text, vision, audio	text, vision	text, vision, audio, video	text, vision, audio, video
License	proprietary	proprietary	proprietary	proprietary
Source	proprietary	proprietary	proprietary	proprietary
Description	OpenAI's smartest and most intuitive model — successor to GPT-5, with major coding, research, and document workflow gains. GPT-5.5 Pro variant also available for heavier reasoning.	Anthropic's flagship hybrid-reasoning model — Opus 4.8 pushes coding and AI-agent workflows further (agentic coding 69.2% vs 64.3% on 4.7; multidisciplinary reasoning 57.9% vs 54.7%). 1M-token context, $5/$25 per MTok pricing held flat from Opus 4.7. Introduces Dynamic Workflows for Claude Code (research preview). GA on Anthropic API (`claude-opus-4-8`), AWS Bedrock, Google Vertex, Microsoft Foundry, and GitHub Copilot.	Google DeepMind's flagship multimodal model. Best-in-class for multimodal understanding and agentic / vibe coding workflows. A "Deep Think" reasoning mode is available to AI Ultra subscribers.	Multimodal Gemini variant introduced at Google I/O 2026 — unified text, image, audio, and video processing in a single model.
Links	vendor →	vendor →	vendor →	vendor →
Benchmarks
MMLU-Pro	87.6%	—	87.1%	—
GPQA-D	89.4%	—	88.6%	—
HumanEval	96.8%	—	96.5%	—
Aider	89.7%	—	84.8%	—
AIME-25	94.2%	—	92.8%	—
LiveCB	78.9%	—	76.1%	—
MMMU	78.4%	—	84.3%	82.5%