Infrastructure intelligence

The hardware story behind frontier AI

Compute clusters, data centers, GPU supply, power deals, and capex — the physical layer that decides which labs can train tomorrow's frontier and at what unit cost. Editorial briefs synthesize the data; signals below are the atomic facts they're built on.

Editorial briefs

2 published

Brief · May 20, 2026

The CoWoS bottleneck: why GPU supply still gates 2026 model roadmaps

Every Blackwell, Ironwood, and Trainium2 chip flows through one TSMC packaging line. Capacity doubled in 2026 — and every frontier lab is still constrained.

supply-chaintsmcpackagingnvidia

Brief · May 15, 2026

Anthropic's two-cloud bet: why Trainium plus TPU changes the math

Custom silicon on two hyperscalers gives Anthropic the lowest unit cost in the frontier — and the biggest single-vendor risk.

anthropicsilicon-economicsawsgoogle

Signals

1 event

All Compute cluster Data center Silicon GPU supply Power Capex Partnership Training run

Google xAI DeepSeek OpenAI Anthropic Mistral Meta Alibaba

SiliconverifiedApr 9, 2026
Google TPU v7 "Ironwood": 9,216-chip pods, 42.5 exaflops, dedicated to inference
Announced at Google Cloud Next April 2026, TPU v7 (Ironwood) is the first Google TPU generation purpose-built for inference rather than training. Each pod scales to 9,216 chips delivering 42.5 exaflops of FP8 compute, with 192GB HBM3e per chip. Powers Gemini 3 Pro / 3.5 inference at Google scale. Annual TPU spend ramped to over $40B for FY26.
Accelerators
TPU v7 Ironwood
Google Gemini 3.5 Google Cloud Next 2026 keynote ↗

Editorial briefs

The CoWoS bottleneck: why GPU supply still gates 2026 model roadmaps

Anthropic's two-cloud bet: why Trainium plus TPU changes the math

Signals

Google TPU v7 "Ironwood": 9,216-chip pods, 42.5 exaflops, dedicated to inference