Infrastructure intelligence
The hardware story behind frontier AI
Compute clusters, data centers, GPU supply, power deals, and capex — the physical layer that decides which labs can train tomorrow's frontier and at what unit cost. Editorial briefs synthesize the data; signals below are the atomic facts they're built on.
Editorial briefs
2 publishedBrief · May 20, 2026
The CoWoS bottleneck: why GPU supply still gates 2026 model roadmaps
Every Blackwell, Ironwood, and Trainium2 chip flows through one TSMC packaging line. Capacity doubled in 2026 — and every frontier lab is still constrained.
Brief · May 15, 2026
Anthropic's two-cloud bet: why Trainium plus TPU changes the math
Custom silicon on two hyperscalers gives Anthropic the lowest unit cost in the frontier — and the biggest single-vendor risk.
Signals
1 event- Siliconverified
Google TPU v7 "Ironwood": 9,216-chip pods, 42.5 exaflops, dedicated to inference
Announced at Google Cloud Next April 2026, TPU v7 (Ironwood) is the first Google TPU generation purpose-built for inference rather than training. Each pod scales to 9,216 chips delivering 42.5 exaflops of FP8 compute, with 192GB HBM3e per chip. Powers Gemini 3 Pro / 3.5 inference at Google scale. Annual TPU spend ramped to over $40B for FY26.
- Accelerators
- TPU v7 Ironwood