LIVE · WED, JUN 03, 2026 --:--:-- ET
Issue Nº 43 COST TOTAL $14177.95 ARTICLES TODAY 2 TOKENS TOTAL 8.79B
aiexpert
Running the wire
Research Vector search alone insufficient for RAG; hybrid retrieval strategies emerge as standard Chips Taiwan to advance photonics, wide-bandgap semiconductors, and quantum as AI-era compute layers Breaking Travelers deploys OpenAI-powered claims processing nationwide Breaking OpenAI Codex expands to cover every role, tool, and workflow type Market Perplexity CEO: latency wins the AI race, not just benchmark scores Market Investors bet humanoid robots will transform industry and homes over next decade Funding Quobly raises €115m Series A backed by STMicroelectronics Funding EQT Emerges as Europe's New Kingmaker for Scale-up Tech Funding Market India IT stocks lead rebound as NVIDIA CEO Huang calms AI displacement fears among services firms Market SpaceX seeks $135 per share for $75 billion IPO Chips Photonics emerges as foundational scaling layer for AI-era compute Chips NVIDIA NemoClaw brings industrial AI to software leaders with secure autonomous engineers Market Palo Alto CEO: customer calls surge on AI security concerns Funding Oxford Quantum Circuits raises $350M Series C, blockbuster quantum computing round Market Asian stocks poised to gain as AI rally extends Market Alphabet plans $80 billion stock sale to fund AI infrastructure expansion Chips Intel Addresses Arrow Lake Misstep With Arrow Lake Refresh, Eyes Nova Lake Rebound Market Palo Alto Networks Pops 12% on Earnings Beat and Strong FY27 Guidance Chips Microsoft announces Majorana 2 quantum computing chip; practical system targeted for 2029 Chips NVIDIA and Microsoft partner on unified stack for agentic AI across Windows, cloud, and devices Research Vector search alone insufficient for RAG; hybrid retrieval strategies emerge as standard Chips Taiwan to advance photonics, wide-bandgap semiconductors, and quantum as AI-era compute layers Breaking Travelers deploys OpenAI-powered claims processing nationwide Breaking OpenAI Codex expands to cover every role, tool, and workflow type Market Perplexity CEO: latency wins the AI race, not just benchmark scores Market Investors bet humanoid robots will transform industry and homes over next decade Funding Quobly raises €115m Series A backed by STMicroelectronics Funding EQT Emerges as Europe's New Kingmaker for Scale-up Tech Funding Market India IT stocks lead rebound as NVIDIA CEO Huang calms AI displacement fears among services firms Market SpaceX seeks $135 per share for $75 billion IPO Chips Photonics emerges as foundational scaling layer for AI-era compute Chips NVIDIA NemoClaw brings industrial AI to software leaders with secure autonomous engineers Market Palo Alto CEO: customer calls surge on AI security concerns Funding Oxford Quantum Circuits raises $350M Series C, blockbuster quantum computing round Market Asian stocks poised to gain as AI rally extends Market Alphabet plans $80 billion stock sale to fund AI infrastructure expansion Chips Intel Addresses Arrow Lake Misstep With Arrow Lake Refresh, Eyes Nova Lake Rebound Market Palo Alto Networks Pops 12% on Earnings Beat and Strong FY27 Guidance Chips Microsoft announces Majorana 2 quantum computing chip; practical system targeted for 2029 Chips NVIDIA and Microsoft partner on unified stack for agentic AI across Windows, cloud, and devices
Market

Perplexity CEO: latency wins the AI race, not just benchmark scores

Perplexity CEO Aravind Srinivas told CNBC that inference latency—not raw accuracy—will be the decisive metric in enterprise AI adoption over the next 12 months. He argued that sub-100ms response times for agentic workflows will separate winners from legacy vendors struggling with slower inference stacks.

For infrastructure buyers evaluating model-serving platforms and GPU allocation strategies, this signals a shift in RFP priorities: expect customers to demand latency SLAs alongside accuracy benchmarks. This favors NVIDIA's inference-optimization roadmap (TensorRT-LLM, Llama 3 optimizations) and smaller, purpose-built inference engines over heavyweight training-first vendors.

Read at source →