Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

Comparing agents instead? Switch to agent compare →

Selected

Gemini 3 Pro×

Gemini Omni×

Claude 4.6 Sonnet×

Mistral Large 2×Clear all

	Gemini 3 Pro Google	Gemini Omni Google	Claude 4.6 Sonnet Anthropic	Mistral Large 2 Mistral
Vendor	Google	Google	Anthropic	Mistral
Family	Gemini	Gemini	Claude	Mistral
Release date	2026-04-22	2026-05-20	2026-02-17	2024-07-24
Context window	2,000,000 tokens	1,000,000 tokens	1,000,000 tokens	128,000 tokens
Parameters	—	—	—	123B
Modality	text, vision, audio, video	text, vision, audio, video	text, vision	text
License	proprietary	proprietary	proprietary	Mistral Research License
Source	proprietary	proprietary	proprietary	open weights
Description	Google DeepMind's flagship multimodal model. Best-in-class for multimodal understanding and agentic / vibe coding workflows. A "Deep Think" reasoning mode is available to AI Ultra subscribers.	Multimodal Gemini variant introduced at Google I/O 2026 — unified text, image, audio, and video processing in a single model.	Anthropic's mid-tier flagship — released 12 days after Opus 4.6 and matching most of its capabilities at ~40% the API cost ($3/$15 per 1M tokens). 1M-token context.	Large text-only Mistral model with a 128K context window and 123B parameters, tuned for strong instruction following and long-context reasoning. Mistral's flagship open-weights release in the Large 2 line.
Links	vendor →	vendor →		vendor →weights →
Benchmarks
MMLU-Pro	87.1%	—	85.4%	—
GPQA-D	88.6%	—	85.8%	—
HumanEval	96.5%	—	95.9%	89.0%
Aider	84.8%	—	83.6%	—
AIME-25	92.8%	—	87.4%	—
LiveCB	76.1%	—	73.8%	—
MMMU	84.3%	82.5%	—	—