Models/Vendor
Google
Subscription plans
updated 2026-05-11Free
individual$0/ month
Gemini 2.5 Flash with daily limits. Free access to Gemini app + AI Studio.
- ✓Gemini 2.5 Flash
- ✓AI Studio access
- ✓Limited Gemini 3 Pro queries
AI Pro
individual$20/ month
Gemini 3 Pro access in the Gemini app + Workspace integrations. 2TB Drive included.
- ✓Gemini 3 Pro
- ✓Deep Research
- ✓Workspace integration
- ✓2TB Google Drive
- +1 more
AI Ultra
individual$250/ month
Gemini 3.5 + Deep Think reasoning mode + highest Gemini limits. Premium tier.
- ✓Gemini 3.5
- ✓Deep Think reasoning
- ✓Higher quotas across all Gemini products
- ✓Veo video model access
- +1 more
Workspace Business
enterprise$30/ seat / mo
Gemini bundled into Workspace Business plans. Per-seat pricing.
- ✓Gemini in Workspace apps
- ✓NotebookLM Plus
- ✓Admin controls
- ✓Data residency options
Infrastructure intelligence
Full feed →Compute deals, data centers, silicon, and capex that shape Google's training and inference economics.
- Siliconverified
Google TPU v7 "Ironwood": 9,216-chip pods, 42.5 exaflops, dedicated to inference
Announced at Google Cloud Next April 2026, TPU v7 (Ironwood) is the first Google TPU generation purpose-built for inference rather than training. Each pod scales to 9,216 chips delivering 42.5 exaflops of FP8 compute, with 192GB HBM3e per chip. Powers Gemini 3 Pro / 3.5 inference at Google scale. Annual TPU spend ramped to over $40B for FY26.
Accelerators: TPU v7 Ironwood - Powerreported
Google Stone Mountain GA campus expanding to 2GW with on-site solar+gas hybrid
Google's Stone Mountain, Georgia data center cluster — built around Gemini training — disclosed a 2GW expansion plan in early 2026 backed by a 1.6GW solar+gas hybrid generation deal with Southern Company. Site will host the second-largest TPU v7 deployment after Council Bluffs, IA.
Power: 2 GWLocation: Stone Mountain, GA - Partnershipverified
Anthropic adds up to 1M TPU v7 Ironwood chips via Google Cloud
Anthropic's November 2025 deal with Google Cloud expands its TPU footprint to up to 1M Ironwood (TPU v7) chips, supplementing the Trainium2 fleet on AWS. This makes Anthropic the rare frontier lab running heterogeneous custom silicon across two clouds. Deal value reportedly tens of billions over multiple years.
Accelerators: 1M · TPU v7 Ironwood
Models
Filter on /models →Gemini 3.5
released 2026-05-20
Google DeepMind's next-gen Gemini — positioned as "frontier intelligence with action". Built for complex agentic workflows. Announced at Google I/O 2026.
- Context
- 2,000,000
- License
- proprietary
Gemini Omni
released 2026-05-20
Multimodal Gemini variant introduced at Google I/O 2026 — unified text, image, audio, and video processing in a single model.
- Context
- 1,000,000
- License
- proprietary
Gemini 3 Pro
released 2026-04-22
Google DeepMind's flagship multimodal model. Best-in-class for multimodal understanding and agentic / vibe coding workflows. A "Deep Think" reasoning mode is available to AI Ultra subscribers.
- Context
- 2,000,000
- License
- proprietary
Gemini 2.5 Flash
released 2025-06-17
Google's cost-and-latency-optimized Gemini 2.5 variant. Generally available June 2025 — designed for high-throughput agentic workflows with sub-second first-token latency.
- Context
- 1,000,000
- License
- proprietary
Gemini 2.5 Pro
released 2025-03-25
Predecessor to Gemini 3 Pro.
- Context
- 2,000,000
- License
- proprietary
Recent news
Introducing computer use in Gemini 3.5 Flash
Google introduced computer-use capabilities in Gemini 3.5 Flash, enabling the model to interact with computer interfaces as part of its workflow. It matters because this moves Gemini from text and image generation toward agentic task execution, a step toward automating multi-step actions in software.
Fluid, natural voice translation with Gemini 3.5 Live Translate
Gemini 3.5 Live Translate adds near real-time, natural speech translation to Google AI Studio, Google Translate, and Google Meet. It matters because it extends Gemini’s voice translation into products used for development, consumer translation, and video meetings, making live cross-language conversation more fluid.
11 demos of Gemini Omni and Gemini 3.5 in action
Google I/O 2026 included 11 demo videos showing Gemini Omni and Gemini 3.5 in action. The demos highlight Google’s latest multimodal models in practical use, giving a concrete look at the capabilities behind the announcement.
Catch up on 12 major I/O 2026 moments
Google I/O 2026 highlighted 12 major keynote moments, including updates on Gemini Omni and Gemini 3.5 Flash. The lineup matters because it signals Google's latest push across its Gemini model family, with specific new model names suggesting continued expansion in capability and product coverage.