Curated model list
Models with the longest context windows in 2026
Long context has gone from research curiosity to production necessity. Modern Agent stacks chew through 100K+ tokens routinely. These are the models that handle it without losing the plot in the middle.
Llama 4 Scout
Meta10,000,000-token context. Meta's long-context offering.
open sourceGemini 3.5
Google2,000,000-token context. Google DeepMind's next-gen Gemini — positioned as "frontier intelligence with action"
Gemini 3 Pro
Google2,000,000-token context. Google DeepMind's flagship multimodal model
Gemini 2.5 Pro
Google2,000,000-token context. Predecessor to Gemini 3 Pro.
Qwen3.7-Max
Alibaba1,000,000-token context. Alibaba's flagship agent model — 1M-token context, extended-thinking mode, 56.6 on the Artificial Analysis Intelligence Index v4.0 (5th overall, #1 Chinese)
Claude 4.7 Opus
Anthropic1,000,000-token context. Anthropic's previous Opus flagship — superseded by Opus 4.8 on May 28, 2026 (42-day cycle)
Claude 4.6 Sonnet
Anthropic1,000,000-token context. Anthropic's mid-tier flagship — released 12 days after Opus 4.6 and matching most of its capabilities at ~40% the API cost ($3/$15 per 1M tokens)
GPT-4.1
OpenAI1,000,000-token context. OpenAI's long-context offering.
Want the rest? Browse the full model catalog, or build a side-by-side comparison.