LIVE · FRI, JUN 26, 2026 --:--:-- ET
Issue Nº 66 COST TOTAL $14555.55 ARTICLES TODAY 8 TOKENS TOTAL 9.15B
aiexpert
Running the wire
Funding Onsemi acquires Synaptics for $7B to consolidate Edge AI and Physical AI stack Chips Apple skips M6 Pro/Max, fast-tracks AI-focused M7 to late 2027 as bandwidth race heats up Market OpenAI leans toward delaying IPO to 2027 over market volatility, holding firm on $1 trillion valuation Chips Solidigm ships 122TB SSD with unlimited 5-year random-write endurance; 84% less NAS power than HDD+TLC Breaking Google launches Gemini 3.5 Flash: outperforms Pro tier on coding, 40% cheaper, 4x faster Funding SE3 Labs emerges from stealth with €5.5M for autonomous defense AI, backs German sovereignty Market Anthropic, OpenAI face enterprise cost-cutting as customers switch to cheaper models Research OpenAI Codex agents now primary tool across all departments; 80% of users complete 30+ minute tasks Chips IBM's 0.7nm Nanostack breaks sub-1nm barrier with 100B transistors on fingernail die Chips Nvidia triple-qualifies HBM4 suppliers; SK Hynix, Samsung, Micron all production-ready for Vera Rubin Q3 ship Market Micron Q3 earnings blowout: $41.5B revenue quad, $22B customer contracts lock HBM supply through 2027 Breaking Hugging Face Ships vLLM on HF Jobs: spin OpenAI-compatible LLM endpoint in one command Policy White House EO mandates federal PQC migration by 2030-2031; quantum-safe silicon demand accelerates Market NVIDIA GeForce NOW doubles down on cloud gaming with Steam Summer Sale discounts Market SK Hynix targets $29B Nasdaq ADR listing July 10; memory chip giant aims for US re-rating Funding SpaceX formalizes $60B Cursor acquisition, largest startup deal ever; xAI coding tools consolidation Funding Anthropic signs 1GW+ data center leases with Google financial backing, pivots from cloud rentals Research Sakana Fugu Ultra: multi-agent orchestrator scores 95.5 GPQA, 73.7 SWE-Bench Pro, routes around export controls Market Micron shatters records: FY Q3 $41.5B revenue, 84.6% gross margin, $50B Q4 guide at 86% Market Micron Q3 blowout: 84.9% gross margin, HBM4 ramp locks in pricing power through 2027 Funding Onsemi acquires Synaptics for $7B to consolidate Edge AI and Physical AI stack Chips Apple skips M6 Pro/Max, fast-tracks AI-focused M7 to late 2027 as bandwidth race heats up Market OpenAI leans toward delaying IPO to 2027 over market volatility, holding firm on $1 trillion valuation Chips Solidigm ships 122TB SSD with unlimited 5-year random-write endurance; 84% less NAS power than HDD+TLC Breaking Google launches Gemini 3.5 Flash: outperforms Pro tier on coding, 40% cheaper, 4x faster Funding SE3 Labs emerges from stealth with €5.5M for autonomous defense AI, backs German sovereignty Market Anthropic, OpenAI face enterprise cost-cutting as customers switch to cheaper models Research OpenAI Codex agents now primary tool across all departments; 80% of users complete 30+ minute tasks Chips IBM's 0.7nm Nanostack breaks sub-1nm barrier with 100B transistors on fingernail die Chips Nvidia triple-qualifies HBM4 suppliers; SK Hynix, Samsung, Micron all production-ready for Vera Rubin Q3 ship Market Micron Q3 earnings blowout: $41.5B revenue quad, $22B customer contracts lock HBM supply through 2027 Breaking Hugging Face Ships vLLM on HF Jobs: spin OpenAI-compatible LLM endpoint in one command Policy White House EO mandates federal PQC migration by 2030-2031; quantum-safe silicon demand accelerates Market NVIDIA GeForce NOW doubles down on cloud gaming with Steam Summer Sale discounts Market SK Hynix targets $29B Nasdaq ADR listing July 10; memory chip giant aims for US re-rating Funding SpaceX formalizes $60B Cursor acquisition, largest startup deal ever; xAI coding tools consolidation Funding Anthropic signs 1GW+ data center leases with Google financial backing, pivots from cloud rentals Research Sakana Fugu Ultra: multi-agent orchestrator scores 95.5 GPQA, 73.7 SWE-Bench Pro, routes around export controls Market Micron shatters records: FY Q3 $41.5B revenue, 84.6% gross margin, $50B Q4 guide at 86% Market Micron Q3 blowout: 84.9% gross margin, HBM4 ramp locks in pricing power through 2027
Breaking

Google launches Gemini 3.5 Flash: outperforms Pro tier on coding, 40% cheaper, 4x faster

Google released Gemini 3.5 Flash on May 19, 2026, at Google I/O, establishing it as the default model across the Gemini app (900M MAU), Google Search AI Mode (1B+ MAU), Antigravity 2.0, and Gemini API. The Flash-tier release inverts Google's historical hierarchy: 3.5 Flash outperforms the flagship Gemini 3.1 Pro on coding and agentic benchmarks—Terminal-Bench 2.1: 76.2% vs. 70.3%, MCP Atlas: 83.6%, GDPval-AA: 1656 Elo—while delivering 4x faster output token generation and pricing 40% lower at $1.50/$9.00 per million input/output tokens (vs. 3.1 Pro's $2.50/$15). The model supports 1M context and is the strongest agentic model Google has shipped to date.

The architectural move signals a shift in frontier AI strategy: rather than lead with Pro capability and let Flash trail, Google optimized the Flash family for speed and cost while maintaining frontier-grade reasoning. Gemini 3.5 Flash beats GPT-5.5 on MCP Atlas (tool-use reliability) and matches it on coding speed. It regresses slightly on pure reasoning (Humanity's Last Exam, ARC-AGI-2) compared to 3.1 Pro, reflecting a design choice to prioritize real-world agentic tasks over abstract reasoning. Gemini 3.5 Pro is still in internal testing and rolling out "next month" (targeting June 2026). Google disclosed that 3.2 quadrillion tokens per month flow through its systems, up 7x year-over-year, and Antigravity 2.0 runs 3.5 Flash at 12x the speed of the public API through local optimization.

Pricing and availability are aggressive: $1.50 input is the lowest price for any frontier model, making high-volume agentic pipelines materially cheaper. Cached input tokens cost $0.15 per million (90% discount). For teams running document extraction, code generation, or agent-based workflows, the unit economics vs. Claude Opus 4.7 ($5/$25) or GPT-5.5 ($4-8/$12-24) are now decisively in Google's favor at scale. Google also introduced Gemini Spark (a persistent personal agent in the Gemini app, AI Ultra subscribers only, $100/month), and announced Gemini Omni, a video-generation model starting with image and audio inputs.

For practitioners, the 3.5 Flash launch reshuffles the cost-per-inference calculus: any agentic or coding workload previously locked to a "better costs money" trade-off can now evaluate Google first without quality sacrifice. The default-model position (500M+ search daily users) means developers building against Gemini API will benchmark against a model that already reaches billions via search; that distribution advantage compounds adoption. Watch for model selection logic in LLM routers and orchestration layers to shift toward latency-sensitive, cost-per-token-efficient models. Enterprises should audit which workloads are currently over-provisioned on flagship models and can drop to 3.5 Flash without quality loss.

Sources