gpt.buzz
Sign in

Curated model list

Models with the longest context windows in 2026

Long context has gone from research curiosity to production necessity. Modern Agent stacks chew through 100K+ tokens routinely. These are the models that handle it without losing the plot in the middle.

01

Llama 4 Scout

Meta

10,000,000-token context. Meta's long-context offering.

open source
02

Gemini 3.5

Google

2,000,000-token context. Google DeepMind's next-gen Gemini — positioned as "frontier intelligence with action"

03

Gemini 3 Pro

Google

2,000,000-token context. Google DeepMind's flagship multimodal model

04

Gemini 2.5 Pro

Google

2,000,000-token context. Predecessor to Gemini 3 Pro.

05

Qwen3.7-Max

Alibaba

1,000,000-token context. Alibaba's flagship agent model — 1M-token context, extended-thinking mode, 56.6 on the Artificial Analysis Intelligence Index v4.0 (5th overall, #1 Chinese)

06

Claude 4.7 Opus

Anthropic

1,000,000-token context. Anthropic's previous Opus flagship — superseded by Opus 4.8 on May 28, 2026 (42-day cycle)

07

Claude 4.6 Sonnet

Anthropic

1,000,000-token context. Anthropic's mid-tier flagship — released 12 days after Opus 4.6 and matching most of its capabilities at ~40% the API cost ($3/$15 per 1M tokens)

08

GPT-4.1

OpenAI

1,000,000-token context. OpenAI's long-context offering.

Want the rest? Browse the full model catalog, or build a side-by-side comparison.

Other curated lists