LIVE · WED, JUN 24, 2026 --:--:-- ET
Issue Nº 64 COST TOTAL $14508.61 ARTICLES TODAY 12 TOKENS TOTAL 9.10B
aiexpert
Running the wire
Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle Chips SK Hynix ships HBM4E memory samples: 16Gbps, 48GB per stack, 20% power gain Funding Qualcomm in talks to acquire Tenstorrent for $8–10B, expanding RISC-V AI chip portfolio Chips TSMC hikes advanced node prices 5–10% across 7nm and newer nodes Chips OpenAI unveils Jalapeño custom inference chip with Broadcom Chips OpenAI-Broadcom custom chip project stalls; Broadcom demands Microsoft purchase guarantee before funding Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle Chips SK Hynix ships HBM4E memory samples: 16Gbps, 48GB per stack, 20% power gain Funding Qualcomm in talks to acquire Tenstorrent for $8–10B, expanding RISC-V AI chip portfolio Chips TSMC hikes advanced node prices 5–10% across 7nm and newer nodes Chips OpenAI unveils Jalapeño custom inference chip with Broadcom Chips OpenAI-Broadcom custom chip project stalls; Broadcom demands Microsoft purchase guarantee before funding
Chips

OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026

OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom Intelligence Processor designed specifically for large language model inference, completing the design-to-tape-out cycle in just nine months. Engineering samples are already running production workloads in the lab, including GPT-5.3-Codex-Spark, and early tests show substantially better performance per watt than current state-of-the-art hardware. The chip was delivered to OpenAI CEO Sam Altman and Broadcom CEO Hock Tan on June 24.

OpenAI designed the chip from scratch around LLM fundamentals and serving patterns, while Broadcom contributed silicon implementation, networking technologies including Tomahawk switching silicon, and Celestica handled boards and rack systems. The architecture reduces data movement and balances compute, memory, and networking to achieve utilization closer to theoretical peak performance. In a multi-year agreement announced in October 2025, the companies plan to deploy 10 gigawatts of custom AI accelerators with Microsoft and other partners, with initial deployment targeting the end of 2026 and expansion through 2029.

For OpenAI, owning inference hardware design means controlling both the physics and economics of query serving. Custom ASICs are less flexible than GPUs but can be tuned to specific kernels and patterns, potentially lowering cost per inference token significantly. The nine-month development cycle—accelerated by using OpenAI's own models to help design and optimize parts of the silicon—signals that the barrier to custom chip entry has fallen enough for frontier AI labs to build in-house. Broadcom's ASIC business, which already fabricates custom chips for other hyperscalers, gains recurring revenue and intellectual positioning as the workshop for AI companies seeking leverage over hardware costs.

Sources