Grok 4 vs o3
Direct spec comparison of Grok 4 (from xAI) and o3 (from OpenAI). Want a 3- or 4-way comparison? Open the multi-model tool →
xAI | OpenAI | |
|---|---|---|
| Vendor | xAI | OpenAI |
| Family | Grok | o-series |
| Release date | 2025-07-09 | 2025-04-16 |
| Context window | 256,000 tokens | 200,000 tokens |
| Parameters | — | — |
| Modality | text | text, vision |
| License | proprietary | proprietary |
| Source | proprietary | proprietary |
| Description | — | Reasoning-focused model in the o-series. |
| Links | ||
| Benchmarks | ||
| MMLU-Pro | 82.8% | — |
| GPQA-D | 87.5% | 87.7% |
| Aider | 72.5% | — |
| AIME-25 | 86.1% | 88.9% |
| LiveCB | 65.3% | — |
More like this
Looking for a different head-to-head? Build your own comparison on the multi-model tool.
Or see all Grok 4 details / o3 details.