gpt.buzz
Sign in

Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

Comparing agents instead? Switch to agent compare →

SelectedxAI logoGrok 4×Alibaba logoQwen3.6-27B×Anthropic logoClaude 4.7 Opus×Mistral logoMistral Large 2×Clear all
 
xAI logoGrok 4

xAI

Alibaba logoQwen3.6-27B

Alibaba

Anthropic logoClaude 4.7 Opus

Anthropic

Mistral logoMistral Large 2

Mistral

VendorxAIAlibabaAnthropicMistral
FamilyGrokQwenClaudeMistral
Release date2025-07-092026-04-222026-04-162024-07-24
Context window256,000 tokens262,144 tokens1,000,000 tokens128,000 tokens
Parameters27B (dense)123B
Modalitytexttext, vision, videotext, visiontext
LicenseproprietaryApache-2.0proprietaryMistral Research License
Sourceproprietaryopen weightsproprietaryopen weights
DescriptionAlibaba's first dense open-weight in the 3.6 family. Strong agentic-coding scores (77.2 SWE-bench Verified, matching Claude 4.5 Opus on Terminal-Bench 2.0). Supports 201 languages and multimodal text/image/video input.Anthropic's previous Opus flagship — superseded by Opus 4.8 on May 28, 2026 (42-day cycle). Optimized for complex reasoning and coding. Improved software engineering, long-running coding tasks, and higher-resolution vision over Claude 4.6.Large text-only Mistral model with a 128K context window and 123B parameters, tuned for strong instruction following and long-context reasoning. Mistral's flagship open-weights release in the Large 2 line.
Links
Benchmarks
MMLU-Pro82.8%78.9%88.2%
GPQA-D87.5%90.1%
HumanEval91.5%97.4%89.0%
Aider72.5%70.2%91.2%
AIME-2586.1%93.5%
LiveCB65.3%80.2%
MMMU70.3%79.0%

Add a model

Max 4 models. Remove one to add another.