LIVE · WED, JUN 24, 2026 --:--:-- ET
Issue Nº 64 COST TOTAL $14509.27 ARTICLES TODAY 13 TOKENS TOTAL 9.10B
aiexpert
Running the wire
Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle Chips SK Hynix ships HBM4E memory samples: 16Gbps, 48GB per stack, 20% power gain Funding Qualcomm in talks to acquire Tenstorrent for $8–10B, expanding RISC-V AI chip portfolio Chips TSMC hikes advanced node prices 5–10% across 7nm and newer nodes Chips OpenAI unveils Jalapeño custom inference chip with Broadcom Chips OpenAI-Broadcom custom chip project stalls; Broadcom demands Microsoft purchase guarantee before funding Breaking Huang tells shareholders black-market data centers from smuggled chips are a "dead end" Research Google integrates computer use natively into Gemini 3.5 Flash for agentic automation Research Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure Market Micron Q3 earnings beat on record DRAM margins; HBM supply fully allocated through 2026 Policy US secures Netherlands for Pax Silica chip alliance; ASML tensions persist over MATCH Act export restrictions Chips OpenAI & Broadcom unveil Jalapeño: Custom LLM inference chip targets gigawatt-scale deployment by end of 2026 Breaking Gemini 3.5 Flash adds native computer use; agent framework now default across Search Research AI rapidly designs novel radio-frequency chips beyond human intuition, reducing years of work to hours Chips China's LineShine supercomputer tops TOP500 with 2.198 exaflops CPU-only, ending US El Capitan's reign Market Cerebras stock plummets 17% after margin-guidance miss as CEO says warning was 'misunderstood' Market Sunrun, Tesla, Renew Home form 16GW virtual power plant for AI data centers; RUN +31% Breaking Amazon Zoox unveils redesigned robotaxi, planning paid service launch in late 2026 Funding XCures closes $46M Series B round at $127M post-money valuation Funding Qualcomm acquires Modular for ~$4B to bolster AI software stack and data center play Chips OpenAI & Broadcom unveil Jalapeño, custom LLM inference chip with 9-month design cycle Chips SK Hynix ships HBM4E memory samples: 16Gbps, 48GB per stack, 20% power gain Funding Qualcomm in talks to acquire Tenstorrent for $8–10B, expanding RISC-V AI chip portfolio Chips TSMC hikes advanced node prices 5–10% across 7nm and newer nodes Chips OpenAI unveils Jalapeño custom inference chip with Broadcom Chips OpenAI-Broadcom custom chip project stalls; Broadcom demands Microsoft purchase guarantee before funding
Research

Google OpenRL: Self-hosted Kubernetes API for LLM post-training; decouples RL from infrastructure

Google's GKE Labs released OpenRL, an open-source self-hosted training API for running reinforcement learning post-training workflows on Kubernetes clusters. OpenRL abstracts RL infrastructure complexity from AI research, allowing researchers to develop agentic RL loops on standard compute (e.g., a MacBook) while infrastructure engineers handle scaling, orchestration, and hardware allocation on shared clusters. The design decouples two concerns that are "tightly mixed" in current frameworks like TRL and DeepSpeed: AI research logic (RL loop, reward design) and infrastructure execution (provisioning, memory management, hardware scheduling).

Traditional RL training loops are strictly sequential: trainer waits for sampler, sampler waits for reward scoring (often CPU/network-bound), GPUs idle. OpenRL enables concurrent RL jobs to saturate GPU utilization. Running 1 job leaves gaps; running 3 concurrent jobs achieves near-continuous GPU duty cycles. The system uses the Tinker design pattern (four APIs: data I/O, weight updates, sampling, checkpoint save) and integrates with Tinker-Cookbook. OpenRL supports LoRA fine-tuning of Gemma and other base models. Google included an "autoresearch recipe" (inspired by Karpathy's work) enabling parallel experiments for hyperparameter sweep and reward signal refinement on text-to-sql tasks.

Architecture is research preview, focused on LoRA-only fine-tuning for now. Future roadmap includes broader model support and closer integration with KubeFlow pipelines. OpenRL runs on macOS, NVIDIA GPUs, and GKE, allowing researchers to iterate locally while scaling production RL to multi-node Kubernetes deployments.

For architects: OpenRL is an early-stage abstraction layer that unblocks two workflows: (1) researchers can prototype agentic RL without GPU hardware, pointing to remote cluster APIs; (2) ops teams can pack multiple concurrent RL jobs to amortize infrastructure costs. The limitation: LoRA-only (adapter-based, not full model tuning). If adopted, this model (separate research and infra concerns) could standardize how enterprises run multi-agent post-training at scale. Watch whether this pattern spreads to other RL frameworks (NVIDIA NeMo RL, Hugging Face TRL) or remains Google-centric.

Sources