LIVE · MON, JUN 29, 2026 --:--:-- ET
Issue Nº 69 COST TOTAL $14603.40 ARTICLES TODAY 1 TOKENS TOTAL 9.23B
aiexpert
Running the wire
Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper Policy FERC orders grid operators to fast-track AI data center connections; 60-day deadline to justify or rewrite tariffs Chips Coherent CHIPS grant $50M for indium phosphide fab expansion; quadruples Sherman wafer output for AI optical networking Chips NVIDIA partners with SK Hynix on next-gen AI memory; codeveloping for Vera Rubin and autonomous fabs Chips TSMC CoWoS hits 98% yield; SoW-X roadmap supports 64 HBM stacks; co-packaged optics production 2026 Chips PNY DDR5-5600 32GB hits $379.99 — cheapest 2x16GB kit amid RAM crisis; 16% discount Chips TSMC 2nm mass production hits 70% yield; Apple, NVIDIA locked in through 2026 Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Breaking Lenovo, NVIDIA Partner on AI Cloud Gigafactory; Reduce Inference Server Deployment Timelines from Months to Weeks Chips Google TPUs Power Anthropic Expansion; Up to 1M Ironwood Chips Lock $40B Multi-Gigawatt Capacity Deal Through 2027+ Chips NVIDIA confirms Vera Rubin full production; Rubin GPU leads AgentPerf with 20x efficiency over Hopper Policy FERC orders grid operators to fast-track AI data center connections; 60-day deadline to justify or rewrite tariffs Chips Coherent CHIPS grant $50M for indium phosphide fab expansion; quadruples Sherman wafer output for AI optical networking Chips NVIDIA partners with SK Hynix on next-gen AI memory; codeveloping for Vera Rubin and autonomous fabs Chips TSMC CoWoS hits 98% yield; SoW-X roadmap supports 64 HBM stacks; co-packaged optics production 2026 Chips PNY DDR5-5600 32GB hits $379.99 — cheapest 2x16GB kit amid RAM crisis; 16% discount Chips TSMC 2nm mass production hits 70% yield; Apple, NVIDIA locked in through 2026
Chips

OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026

OpenAI and Broadcom unveiled Jalapeño on June 24, 2026—a custom LLM inference accelerator co-designed in nine months and already running GPT-5.3-Codex-Spark in lab testing. The chip is described as a "blank-slate design for modern LLM inference, not a general-purpose accelerator adapted from earlier AI workloads." Jalapeño targets gigawatt-scale deployment by end-2026 as part of a broader compute platform rather than a standalone component; the surrounding ecosystem (boards, racks, networking, connectivity, power delivery) is integral to the design philosophy.

The announcement follows OpenAI's Titan project (co-developed with Broadcom and TSMC, targeting H2 2026 mass production on 3nm process, with Samsung supplying HBM4 memory), and marks a strategic hedge against NVIDIA dependence. OpenAI, Google, Apple, SpaceX, Amazon, Microsoft, and Meta are all pushing custom silicon in 2026 because AI infrastructure has become too expensive, strategically important, and supply-constrained to leave entirely in Nvidia's hands. This is not a post-Nvidia era; it is a "post-naïve era" where large infrastructure builders now view compute strategy as inseparable from margin strategy and supply-chain control.

Broadcom is positioned as the "quiet architect" of this custom chip alternative: rather than selling accelerators through retail channels, it partners with hyperscalers to turn predictable, high-volume workloads into bespoke silicon. The resulting chips remain invisible to end-users but reshape the economics of the services they touch. For OpenAI, owning inference silicon is essential to reducing the per-token cost of serving frontier models at scale and maintaining operational independence from GPU supply constraints. Architects should expect custom silicon to capture an increasing fraction of production inference workloads over 2026–2027.

Sources