LIVE · FRI, JUN 26, 2026 --:--:-- ET
Issue Nº 66 COST TOTAL $14549.42 ARTICLES TODAY 5 TOKENS TOTAL 9.15B
aiexpert
Running the wire
Research OpenAI Codex agents now primary tool across all departments; 80% of users complete 30+ minute tasks Chips IBM's 0.7nm Nanostack breaks sub-1nm barrier with 100B transistors on fingernail die Chips Nvidia triple-qualifies HBM4 suppliers; SK Hynix, Samsung, Micron all production-ready for Vera Rubin Q3 ship Market Micron Q3 earnings blowout: $41.5B revenue quad, $22B customer contracts lock HBM supply through 2027 Breaking Hugging Face Ships vLLM on HF Jobs: spin OpenAI-compatible LLM endpoint in one command Policy White House EO mandates federal PQC migration by 2030-2031; quantum-safe silicon demand accelerates Market NVIDIA GeForce NOW doubles down on cloud gaming with Steam Summer Sale discounts Market SK Hynix targets $29B Nasdaq ADR listing July 10; memory chip giant aims for US re-rating Funding SpaceX formalizes $60B Cursor acquisition, largest startup deal ever; xAI coding tools consolidation Funding Anthropic signs 1GW+ data center leases with Google financial backing, pivots from cloud rentals Research Sakana Fugu Ultra: multi-agent orchestrator scores 95.5 GPQA, 73.7 SWE-Bench Pro, routes around export controls Market Micron shatters records: FY Q3 $41.5B revenue, 84.6% gross margin, $50B Q4 guide at 86% Market Micron Q3 blowout: 84.9% gross margin, HBM4 ramp locks in pricing power through 2027 Funding Apple supplier Lingyi iTech prices $1.06B Hong Kong IPO; allocates $1.71B to AI server and robotics hardware Market SoftBank plunges 12% as AI infrastructure costs fuel Asian tech selloff Chips Tenstorrent BlackHole Galaxy reaches production; 10 customers, 96-unit order Breaking Cloudflare releases open-source agent skills for Zero Trust deployment and migration Funding Salesforce acquires Fin (formerly Intercom) for $3.6B to bolster Agentforce customer service AI Market OpenAI CFO Sarah Friar pushes IPO delay to 2027, cites $600B spending risk, revenue shortfalls Funding On Semiconductor acquires Synaptics for $7B in largest-ever deal, pushes physical AI Research OpenAI Codex agents now primary tool across all departments; 80% of users complete 30+ minute tasks Chips IBM's 0.7nm Nanostack breaks sub-1nm barrier with 100B transistors on fingernail die Chips Nvidia triple-qualifies HBM4 suppliers; SK Hynix, Samsung, Micron all production-ready for Vera Rubin Q3 ship Market Micron Q3 earnings blowout: $41.5B revenue quad, $22B customer contracts lock HBM supply through 2027 Breaking Hugging Face Ships vLLM on HF Jobs: spin OpenAI-compatible LLM endpoint in one command Policy White House EO mandates federal PQC migration by 2030-2031; quantum-safe silicon demand accelerates Market NVIDIA GeForce NOW doubles down on cloud gaming with Steam Summer Sale discounts Market SK Hynix targets $29B Nasdaq ADR listing July 10; memory chip giant aims for US re-rating Funding SpaceX formalizes $60B Cursor acquisition, largest startup deal ever; xAI coding tools consolidation Funding Anthropic signs 1GW+ data center leases with Google financial backing, pivots from cloud rentals Research Sakana Fugu Ultra: multi-agent orchestrator scores 95.5 GPQA, 73.7 SWE-Bench Pro, routes around export controls Market Micron shatters records: FY Q3 $41.5B revenue, 84.6% gross margin, $50B Q4 guide at 86% Market Micron Q3 blowout: 84.9% gross margin, HBM4 ramp locks in pricing power through 2027 Funding Apple supplier Lingyi iTech prices $1.06B Hong Kong IPO; allocates $1.71B to AI server and robotics hardware Market SoftBank plunges 12% as AI infrastructure costs fuel Asian tech selloff Chips Tenstorrent BlackHole Galaxy reaches production; 10 customers, 96-unit order Breaking Cloudflare releases open-source agent skills for Zero Trust deployment and migration Funding Salesforce acquires Fin (formerly Intercom) for $3.6B to bolster Agentforce customer service AI Market OpenAI CFO Sarah Friar pushes IPO delay to 2027, cites $600B spending risk, revenue shortfalls Funding On Semiconductor acquires Synaptics for $7B in largest-ever deal, pushes physical AI
Breaking

Hugging Face Ships vLLM on HF Jobs: spin OpenAI-compatible LLM endpoint in one command

Hugging Face launched vLLM on HF Jobs, a serverless inference service that lets developers spin up a private, OpenAI-compatible LLM endpoint on Hugging Face infrastructure with a single CLI command. No Kubernetes, no server provisioning—just hf jobs run --flavor a10g-large --expose 8000, pick a model (Qwen, Llama, Mistral, etc.), and get a live endpoint in seconds. Billing is per-minute by hardware usage, paid on prepaid credit.

The integration removes friction for model deployment. Developers can query endpoints from a laptop, notebook, or anywhere via standard OpenAI client libraries (pass the job URL as base_url). SSH support lets you shell into running jobs for debugging, GPU memory inspection, and log tailing—familiar ops experience without container overhead. Tensor parallelism is supported; --tensor-parallel-size spreads models across multiple GPUs for larger models or higher throughput. Flavors range from A10G GPUs to H200 pairs for mixture-of-experts like Qwen 3.5-122B.

For production deployment, this competes with dedicated inference platforms (Together, Anyscale, Replicate) but keeps the developer inside the Hugging Face ecosystem—Hub authentication, native model import, and existing community assets. Architects evaluating edge inference, batch generation, or internal LLM APIs should test this; pricing and latency SLAs matter more than the speed of deployment itself. Watch for enterprise safeguards (rate limiting, access controls, audit logs) as more orgs move from notebooks to shared infrastructure.

Sources