LIVE · MON, JUN 29, 2026 --:--:-- ET
Issue Nº 69 COST TOTAL $14606.44 ARTICLES TODAY 3 TOKENS TOTAL 9.24B
aiexpert
Running the wire
Funding Reed Semiconductor raises $100M for AI power delivery; oversubscribed round signals infrastructure demand Chips Samsung ships industry-first HBM4E samples at 16Gbps, 48GB per stack; 20%+ speed gain over HBM4 Market Micron guides $50B Q4 revenue, 86% margins; signs 16 strategic customer agreements worth ~$100B Chips d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy Market SoftBank commits €75 billion to build 5 GW of AI data center capacity across France through 2031 Funding UK government backs £400 million venture capital initiative for diverse fund managers Chips NVIDIA Blackwell platform arrives; B200/B300 GPUs ship with 4x H100 inference speed, 25x lower cost/energy Breaking HP deploys OpenAI Frontier across enterprise operations; joins six inaugural platform adopters Market 79% of global AI data center capacity faces elevated climate hazard risk; operators shift to rural, extreme-weather zones Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand Funding Reed Semiconductor raises $100M for AI power delivery; oversubscribed round signals infrastructure demand Chips Samsung ships industry-first HBM4E samples at 16Gbps, 48GB per stack; 20%+ speed gain over HBM4 Market Micron guides $50B Q4 revenue, 86% margins; signs 16 strategic customer agreements worth ~$100B Chips d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy Market SoftBank commits €75 billion to build 5 GW of AI data center capacity across France through 2031 Funding UK government backs £400 million venture capital initiative for diverse fund managers Chips NVIDIA Blackwell platform arrives; B200/B300 GPUs ship with 4x H100 inference speed, 25x lower cost/energy Breaking HP deploys OpenAI Frontier across enterprise operations; joins six inaugural platform adopters Market 79% of global AI data center capacity faces elevated climate hazard risk; operators shift to rural, extreme-weather zones Funding Baidu's Kunlunxin targets $50B Hong Kong IPO, ties chip purchases to allocations Funding Momenta launches Hong Kong IPO targeting $751M for autonomous driving R&D Chips HBM now comprises 35-47% of AI accelerator BOM; GB200 HBM alone costs $4,800/unit Market Samsung HBM4 revenue tops $1 billion; targets $10 billion run-rate by end-2026 Chips OpenAI, Broadcom unveil Jalapeño LLM inference chip; gigawatt-scale deployment targeted by end-2026 Market TSMC warns AI chip shortage to persist into 2027; signals 15% 3nm price increase H2 2026 Research DeepSeek V4 DSpark speculative decoding cuts inference latency 85%, hits Together AI Breaking OpenAI launches $150M Partner Network to certify 300K consultants by year-end Breaking HP becomes flagship Frontier adopter; OpenAI scales enterprise AI agent platform with consulting partnerships Breaking Apple lobbies White House for CXMT DRAM approval as memory costs hit 20% MacBook, iPad price hikes Funding Samsung, SK Hynix plan $1.3T capex over decade on AI memory demand
Chips

d-Matrix Corsair inference accelerator enters full production; claims 10x faster decode than GPU-only with 5x less energy

d-Matrix announced its Corsair inference accelerator platform entered full production on June 9, with volume shipments beginning to priority hyperscalers, neoclouds, and frontier AI labs. The SRAM-based chiplet accelerator, manufactured at TSMC's N6 process via Alchip Technologies, is designed specifically for the decode phase of inference workloads in heterogeneous compute clusters paired with GPUs. The company cites independent testing by Gimlet Labs showing paired Corsair + GPU setups reduce inference response times from approximately 24 seconds to under two seconds, roughly a 10x speedup versus GPU-only approaches.

Corsair bypasses the memory wall by integrating computation tightly with on-chip SRAM, avoiding the DRAM and high-bandwidth memory (HBM) supply constraints that plague competing architectures. Each PCIe card packs 4 GB of Performance Memory with 300 TB/s bandwidth, hitting peak compute of 4,800 TFLOPs for MXINT8 and 19,200 TFLOPs for MXINT4. d-Matrix positions Corsair as complementary to GPUs rather than a replacement, targeting latency-sensitive agentic AI applications including Claude Code, voice agents, and interactive coding assistants that demand rapid token generation.

The timing aligns with surging demand for disaggregated inference architectures as agentic workloads push GPU-only infrastructure to its limits. d-Matrix has secured multi-year supply and fabrication services; the company also acquired GigaIO's data center business in April, bringing rack-scale systems expertise that culminates in SquadRack, a production-ready reference design built with Arista, Broadcom, and Supermicro. Microsoft's M12 venture arm and Temasek are investors; the startup raised $275 million in Series C.

For infrastructure teams, Corsair entering volume production marks a shift in inference economics: heterogeneous clusters splitting prefill to GPUs and decode to specialized accelerators now have a production-validated, supply-predictable alternative to GPU-only scaling. The N6 process and SRAM architecture sidestep HBM allocation bottlenecks, offering operators a tactical differentiation point in latency-constrained agentic deployments.

Sources