gpt.buzz
Sign in

Compare models

Pick up to 4models. Specs render side-by-side. Share the URL — it's stateless.

SelectedDeepSeek logoDeepSeek-R1×Alibaba logoQwen3 235B×DeepSeek logoDeepSeek-V4-Flash×Mistral logoMistral Large 2×Clear all
 
DeepSeek logoDeepSeek-R1

DeepSeek

Alibaba logoQwen3 235B

Alibaba

DeepSeek logoDeepSeek-V4-Flash

DeepSeek

Mistral logoMistral Large 2

Mistral

VendorDeepSeekAlibabaDeepSeekMistral
FamilyDeepSeekQwenDeepSeekMistral
Release date2025-01-202025-04-292026-04-222024-07-24
Context window128,000 tokens128,000 tokens1,000,000 tokens128,000 tokens
Parameters671B235B284B (13B active)123B
Modalitytexttexttexttext
LicenseMITApache-2.0MITMistral Research License
Sourceopen weightsopen weightsopen weightsopen weights
DescriptionReasoning-focused open-weight model.Predecessor to the Qwen3.6 family.Smaller, faster sibling to DeepSeek-V4-Pro. Same 1M context window with a much lighter 284B / 13B-active MoE.
Links

Add a model

Max 4 models. Remove one to add another.