LIVE · MON, JUN 29, 2026 --:--:-- ET
Issue Nº 69 COST TOTAL $14603.40 ARTICLES TODAY 1 TOKENS TOTAL 9.23B
aiexpert
Running the wire
Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper Policy FERC orders grid operators to fast-track AI data center connections; 60-day deadline to justify or rewrite tariffs Chips Coherent CHIPS grant $50M for indium phosphide fab expansion; quadruples Sherman wafer output for AI optical networking Chips NVIDIA partners with SK Hynix on next-gen AI memory; codeveloping for Vera Rubin and autonomous fabs Chips TSMC CoWoS hits 98% yield; SoW-X roadmap supports 64 HBM stacks; co-packaged optics production 2026 Chips PNY DDR5-5600 32GB hits $379.99 — cheapest 2x16GB kit amid RAM crisis; 16% discount Chips TSMC 2nm mass production hits 70% yield; Apple, NVIDIA locked in through 2026 Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper Policy FERC orders grid operators to fast-track AI data center connections; 60-day deadline to justify or rewrite tariffs Chips Coherent CHIPS grant $50M for indium phosphide fab expansion; quadruples Sherman wafer output for AI optical networking Chips NVIDIA partners with SK Hynix on next-gen AI memory; codeveloping for Vera Rubin and autonomous fabs Chips TSMC CoWoS hits 98% yield; SoW-X roadmap supports 64 HBM stacks; co-packaged optics production 2026 Chips PNY DDR5-5600 32GB hits $379.99 — cheapest 2x16GB kit amid RAM crisis; 16% discount Chips TSMC 2nm mass production hits 70% yield; Apple, NVIDIA locked in through 2026
Chips

NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper

NVIDIA CEO Jensen Huang confirmed at GTC Taipei (Computex 2026 on June 1) that Vera Rubin GPU has entered full production, with partner availability beginning H2 2026. The Rubin GPU features 336 billion transistors on TSMC N3, 288GB HBM4 (double Blackwell capacity), 22 TB/s memory bandwidth, delivering 50 PFLOPS NVFP4 inference and 35 PFLOPS training per single GPU.

Vera Rubin's benchmark leadership is now tangible: NVIDIA's Blackwell Ultra NVL72 rack (72 GPUs) led AgentPerf, the first agentic AI benchmark from Artificial Analysis, running 20x more agents per megawatt than Hopper in equivalent configurations. This efficiency gain signals that Rubin—delivering 1.5x theoretical performance per GPU over Blackwell Ultra—is positioned to reshape cost-per-inference economics for production AI services running long-duration agentic workloads.

NVIDIA announced a multiyear memory partnership with SK hynix (June 7) to codevelop next-generation memory for Vera platforms, spanning Rubin GPUs, Vera CPUs, RTX Spark PCs, and Jetson Thor. SK hynix will use NVIDIA CUDA-X and PhysicsNeMo for semiconductor simulation. This supply-side lock-in reflects the structural memory crunch: hyperscalers cannot afford spot allocation risk.

For infrastructure buyers and capacity planners, Vera Rubin's H2 2026 availability and AgentPerf leadership imply a 12–18 month transition window to compete on inference cost. Hyperscalers with existing Hopper fleets face the ROI math of consolidating capital toward Rubin, while smaller AI service providers must evaluate whether holding pre-Rubin capacity into 2027 is viable given production demand and rising memory costs.

Sources