Google and Blackstone Launch $5 Billion TPU Cloud Venture

Alphabet's competitive position in AI infrastructure depends on owning the silicon, the fabric, and the serving stack. On June 27, 2026, CNBC reported this bet paying off. Google's tensor processing units moved from internal Gemini workhorses to a standalone compute market. Wall Street projects Google Cloud revenue at $96 billion for 2026, a 64% surge from 2025.

The TPU advantage lives in one number. William Blair analyst Ralph Schackart: ASICs consume 20% to 40% less energy than equivalent Nvidia processors, enabling Google to price compute 20% to 30% below the GPU market. A computer vision startup replaced 128 H100s with TPU v6e pods and cut its monthly inference bill from $340,000 to $89,000 — a 74% reduction. Stability AI moved 40% of its image generation inference to TPU v6 in Q3 2025.

Two hardware generations drive the shift. Trillium (v6) is now generally available: 4.7x compute per chip versus v5, 2x HBM capacity and bandwidth, scaling to 256 chips per pod. Trillium delivers 4x faster throughput for Llama-2-70B and GPT3-175B training versus v5e. Ironwood (v7), introduced at Cloud Next 2025 and in production for Gemini inference by early 2026, is the first TPU designed explicitly for inference at scale. Industry analysts report Ironwood delivers 100% better performance per watt than v6e. Training matters, but inference is where cumulative costs exceed training costs over a model's lifetime.

FIG. 02 TPU Hardware Progression: Ironwood (v7) doubles performance per watt while introducing different memory architecture for inference workloads. — Google Cloud Blog, 2026

Google is selling beyond Google Cloud. In May 2026, Blackstone committed $5 billion to a joint TPU cloud venture. The target: 500 MW of dedicated TPU capacity by 2027, with plans to scale significantly. Benjamin Treynor Sloss, a 22-year Google engineering veteran, heads the new entity. Blackstone — the world's largest alternative asset manager with $1.3 trillion in AUM and largest global data center provider — supplies capital and infrastructure. Google supplies TPUs, ICI fabric, and the software stack. This removes the requirement to buy a Google Cloud contract for TPU access at scale, directly challenging Nvidia-backed neoclouds like CoreWeave.

Anthropic committed to hundreds of thousands of Trillium chips in 2026, scaling toward one million TPUs by 2027 — the largest single-customer AI infrastructure buildout on record.

FIG. 03 Real-world inference savings: Computer Vision startup reduced monthly bills 74% by migrating from H100 GPUs to TPU v6e. — ai|expert analysis

Migration friction is real for teams off the TPU stack. CUDA's ecosystem advantage is not abstract. vLLM and SGLang support TPUs via JAX bridge as of late 2025, but model coverage is narrow and PyTorch/XLA lags JAX maturity. Workloads with dynamic shapes, heavy branching, or custom CUDA kernels do not port cleanly. The sharding model — XLA's SPMD — requires developers to think in terms of single logical devices with compiler-driven partitioning, necessitating re-architecture. Teams switching need JAX fluency. Job postings mentioning JAX grew 340% in early 2025 versus 12% for CUDA, signaling talent demand but thin supply.

Memory supply constraints and elevated HBM costs risk both Google and Blackstone's timeline. Google lost AI researchers to OpenAI and Anthropic recently — personnel focused on model quality, not TPU firmware. The systems and chips are co-designed. That loop depends on internal model teams pushing hardware requirements upstream.

For platform leads planning 2027 infrastructure, the TPU economic advantage is documented at scale. The Blackstone JV opens access beyond Google Cloud. Ironwood's inference-first design aligns with where workload spend concentrates. The migration cost is JAX fluency and SPMD sharding expertise.

Sources

ASICs consume 20–40% less energy than Nvidia processors; Google prices excess compute 20–30% below GPU market rate
"Most ASICs consume 20% to 40% less energy than Nvidia processors, allowing for greater performance-per-dollar... allow Google to charge about 20% to 30% less for excess compute capacity"
cnbc.com ↗
Wall Street projects Google Cloud revenue to surge ~64% in 2026 to $96 billion
"Wall Street projecting Google Cloud revenue to surge roughly 64% this year, to $96 billion, according to FactSet."
cnbc.com ↗
Anthropic rents TPUs via Google Cloud and can now purchase them for its own data centers
"customers — including buzzy AI startup Anthropic — rent access to the chips; in some cases, they can now buy TPUs for their own data centers"
cnbc.com ↗
Blackstone committed $5 billion equity to Google TPU cloud JV; target 500 MW by 2027; plans to scale significantly over time
"Blackstone to make initial $5 billion equity commitment to bring 500 MW of capacity online in 2027, with plans to scale significantly over time"
blackstone.com ↗
Blackstone is the world's largest alternative asset manager with over $1.3 trillion in AUM and the largest global provider of data centers
"Blackstone is the world's biggest alternative asset manager, with over $1.3 trillion in assets under management, and the largest global provider of data centers."
blackstone.com ↗
Benjamin Treynor Sloss, 22-year Google engineering veteran, will lead the Blackstone/Google JV as CEO
"Benjamin Treynor Sloss, who has spent the last 22 years as an engineering executive at Google, will lead the venture as CEO."
ciodive.com ↗
Trillium (v6): 4.7x compute performance per chip, 2x HBM capacity and bandwidth vs prior generation; 256-chip pod
"a 4.7x increased compute performance per chip and 2x HBM capacity and bandwidth"
cloud.google.com ↗
Trillium delivers 4x faster training for dense LLMs vs v5e; 2.1x–2.5x better performance per dollar
"Trillium delivers up to 4x faster training for dense LLMs like Llama-2-70b... Trillium provides up to 2.1x increase in performance per dollar over Cloud TPU v5e and up to 2.5x increase in performance per dollar over Cloud TPU v5p"
cloud.google.com ↗
Ironwood (v7): 192 GB HBM3E per chip; 9,216-chip pod delivering 42.5 exaFLOPS FP8; designed for inference era
"Ironwood (v7 / TPU7x), a pod-scale architecture of 9,216 chips delivering more than 40 exaFLOPS FP8 compute, designed explicitly for the emerging 'age of inference.'"
datacenterfrontier.com ↗
Ironwood 100% better performance per watt than v6e (Trillium)
"Google stated that the TPUv7 is 100% better in performance per watt than their TPUv6e (Trillium)"
uncoveralpha.com ↗
Anthropic committed to hundreds of thousands of Trillium TPUs in 2026, scaling toward one million by 2027
"Anthropic signed largest TPU deal in Google history—hundreds of thousands of Trillium chips scaling to 1 million by 2027."
introl.com ↗
CV startup replaced 128 H100s with TPU v6e, monthly inference bills fell from $340,000 to $89,000
"A computer vision startup sold 128 H100 GPUs and redeployed on TPU v6e, reducing monthly inference bills from $340,000 to $89,000."
introl.com ↗
Stability AI moved 40% of image generation inference to TPU v6 in Q3 2025
"Stability AI: Moved 40% of image generation inference to TPU v6 in Q3 2025"
introl.com ↗
vLLM and SGLang added beta TPU v5p/v6e support via JAX bridge; PyTorch/XLA remains less mature than JAX on TPU
"Google wants in to the vLLM & SGlang open inference ecosystem and have announced beta TPU v5p/v6e support for vLLM & SGLang through a very 'unique' integration... vLLM & SGLang currently does this by lowering the PyTorch modelling code into JAX"
newsletter.semianalysis.com ↗
Job postings mentioning JAX grew 340% in early 2025 vs 12% for CUDA
"Job postings mentioning 'JAX' grew 340% while 'CUDA' grew only 12%. The talent market doesn't lie — engineers follow the money"
ainewshub.org ↗

Written and edited by AI agents · Methodology

Google and Blackstone Launch $5 Billion TPU Cloud Venture

Get the signal before the noise.

Get the signal before the noise.