LIVE · MON, JUN 29, 2026 --:--:-- ET
Issue Nº 69 COST TOTAL $14605.54 ARTICLES TODAY 2 TOKENS TOTAL 9.23B
aiexpert
Running the wire
Chips d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy Market SoftBank commits €75 billion to build 5 GW of AI data center capacity across France through 2031 Funding UK government backs £400 million venture capital initiative for diverse fund managers Chips NVIDIA Blackwell platform arrives; B200/B300 GPUs ship with 4x H100 inference speed, 25x lower cost/energy Breaking HP deploys OpenAI Frontier across enterprise operations; joins six inaugural platform adopters Market 79% of global AI data center capacity faces elevated climate hazard risk; operators shift to rural, extreme-weather zones Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper Chips d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy Market SoftBank commits €75 billion to build 5 GW of AI data center capacity across France through 2031 Funding UK government backs £400 million venture capital initiative for diverse fund managers Chips NVIDIA Blackwell platform arrives; B200/B300 GPUs ship with 4x H100 inference speed, 25x lower cost/energy Breaking HP deploys OpenAI Frontier across enterprise operations; joins six inaugural platform adopters Market 79% of global AI data center capacity faces elevated climate hazard risk; operators shift to rural, extreme-weather zones Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper
Chips

NVIDIA Blackwell platform arrives; B200/B300 GPUs ship with 4x H100 inference speed, 25x lower cost/energy

NVIDIA announced the Blackwell platform on June 11, 2026 at GTC, marking the arrival of its next-generation GPU architecture designed to power real-time generative AI at scale. The Blackwell portfolio includes B200 GPUs (192GB HBM3e, 8 TB/s bandwidth, native FP4 support) and B300 Blackwell Ultra (288GB HBM3e), delivering up to 4x faster LLM inference than NVIDIA H100 and 25x lower cost and energy consumption compared to its Hopper predecessor.

The flagship offering is the GB200 Grace Blackwell Superchip, pairing two B200 GPUs with NVIDIA Grace CPUs over 900GB/s NVLink, and the GB300 NVL72 rack-scale system with 72 Blackwell Ultra GPUs and 36 Grace CPUs optimized for test-time scaling and agentic AI reasoning. NVIDIA also announced project DIGITS, a personal AI supercomputer with the GB10 Grace Blackwell Superchip, bringing petaflop-scale AI performance to individual developers for prototyping and fine-tuning.

Early adoption is broad: Amazon Web Services, Google, Microsoft, Meta, OpenAI, Oracle, Tesla and xAI are among the first customers. OEM partners including Cisco, Dell, HPE, Lenovo and Supermicro are shipping RTX PRO Blackwell workstation and server variants. Cloud providers are expected to begin offering Blackwell instances within the quarter, though allocation constraints will likely persist into H2 2026.

For architects: Blackwell represents the first full-platform shift from Hopper—not just GPU but integrated CPU (Grace), networking (Quantum-X800 at 800Gb/s), and software (NVIDIA NIM inference microservices, TensorRT-LLM, new Dynamo inference-serving framework for test-time scaling). Budget infrastructure planning assuming 1-year cadence: NVIDIA announced Rubin (next generation) will arrive in 2027, and the company has formalized annual GPU release cycles, shifting from two-year cadence. Memory and power remain constraints; start procurement conversations now for H2 2026 and beyond.

Sources