LIVE · WED, JUN 24, 2026 --:--:-- ET
Issue Nº 64 COST TOTAL $14510.26 ARTICLES TODAY 14 TOKENS TOTAL 9.10B
aiexpert
Running the wire
Market Micron posts $41.5B Q3 revenue, guides $50B for Q4 on AI memory supercycle Funding Qualcomm acquires Modular for ~$4B to build hardware-agnostic AI stack against NVIDIA CUDA Market AWS launches EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell; 4.6x inference gains Chips Qualcomm unveils Dragonfly C1000 data-center CPU; Meta commits to 2028 production volumes Chips OpenAI unveils Jalapeño inference chip with Broadcom, targets late-2026 deployment Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle Market Micron posts $41.5B Q3 revenue, guides $50B for Q4 on AI memory supercycle Funding Qualcomm acquires Modular for ~$4B to build hardware-agnostic AI stack against NVIDIA CUDA Market AWS launches EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell; 4.6x inference gains Chips Qualcomm unveils Dragonfly C1000 data-center CPU; Meta commits to 2028 production volumes Chips OpenAI unveils Jalapeño inference chip with Broadcom, targets late-2026 deployment Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle
Chips

OpenAI unveils Jalapeño inference chip with Broadcom, targets late-2026 deployment

OpenAI and Broadcom on Wednesday unveiled Jalapeño, OpenAI's first custom AI accelerator chip designed specifically for large language model inference. The companies claim early internal testing shows substantially better performance per watt than current state-of-the-art systems, though final benchmarks haven't been released. The chip was developed from design to tape-out in nine months, an unusually fast turnaround that OpenAI attributes to using its own models to accelerate parts of the hardware design process.

Jalapeño is a purpose-built ASIC with a massive compute chiplet (~840mm² reticle-sized die) surrounded by six HBM memory modules and optimized for low latency and high throughput inference. Unlike general-purpose GPUs, the architecture is tuned around LLM serving patterns, memory movement, and networking efficiency—balancing compute, memory, and I/O to operate closer to theoretical peak utilization. Broadcom handles silicon manufacturing and contributes its Tomahawk networking silicon; Celestica provides board and rack integration.

Deployment begins at gigawatt scale in late 2026 through Microsoft and other partners, with initial prototype production in late 2026 scaling through the years ahead. OpenAI President Greg Brockman told CNBC that OpenAI cannot get compute fast enough, underscoring the infrastructure pressure driving the partnership. Broadcom CEO Hock Tan noted the compute demand from the company's six hyperscaler customers is insatiable and expected to remain elevated through 2028.

For AI architects, Jalapeño signals OpenAI's move to own the full stack—from models to inference hardware—to reduce costs and latency on serving. This matters because OpenAI controls both the workload and silicon, enabling tighter hardware-software co-optimization than off-the-shelf GPUs can deliver. The nine-month design cycle and gigawatt-scale plans suggest a credible alternative to NVIDIA's dominance in inference infrastructure, though hard performance numbers are still pending.

Sources