Uber and Startups Cut AI Spending as Costs Spiral

The era of unlimited AI spending is ending. Two years after employers handed developers unchecked budgets, customers are now imposing tier controls, model switching, and hard caps. OpenAI and Anthropic built their valuations on spend-at-all-cost culture. Both are filing for IPO amid signs their largest customers are tightening budgets.

Uber burned through its entire 2026 AI budget in four months. CTO Praveen Neppalli Naga disclosed that Claude Code adoption jumped from 32% to 84% across the company's 5,000-engineer organization between February and March. Monthly API costs reached $500 to $2,000 per engineer for heavy users. Uber's response: a new tier system starting at $1,500 per month, with approval required for higher levels. "We're back to the drawing board," Neppalli Naga said.

Flo Crivello, CEO of 25-person startup Lindy, moved faster. This month he switched 100% of Lindy's traffic from Claude to DeepSeek. "The cost curve crash to the ground," he told CNBC. The shift is projected to save Lindy millions within months. Lindy will still spend more on AI than payroll, but the bar for "good enough" has dropped.

The numbers explain the shift. Per-token prices fell roughly 98% since early 2024, yet enterprise AI bills keep rising. Agentic workflows consume five to thirty times more tokens per task than standard chatbot queries, according to Gartner analysis. That dynamic solved the wrong problem. Ramp CEO Eric Glyman built a token-tracking tool and found AI spending across his customer base grew 13x in one year. "No one knows how to budget for it," he said. Cheaper tokens plus exponentially higher token consumption equals shock at billing time.

FIG. 02 Token prices collapsed 98% since early 2024, yet enterprise AI spending surged 13x—the paradox of agentic scaling. — CFO Dive / Ramp analytics

Chinese rivals are intensifying pressure. DeepSeek, Moonshot AI, Alibaba's Qwen, and others undercut Western models by up to 9x, optimizing for inference cost over benchmark rank. OpenAI was reportedly weighing dramatic price cuts as of early June. Anthropic already shifted from flat-rate plans to per-token billing—a structural admission that unlimited pricing broke when agentic tasks consumed millions of tokens per session.

Both companies post strong growth. Anthropic hit a $47 billion annualized run rate in May 2026, up from $10 billion for all of 2024. OpenAI's rate was pacing closer to $25 billion. Both filed confidentially for IPO in early June. D.A. Davidson analyst Gil Luria cut straight to it: "Current growth rates for Anthropic and OpenAI are the fastest they ever will be. There's urgency to go public before spend rationalizes."

Platform teams now face a structural shift. Tiered model routing—Haiku or Gemini Flash for the 80% of tasks that don't require frontier reasoning, flagship models for complex agentic work—moved from optimization project to cost control. The "always-use-the-best-model" default is now a budget red flag. Teams built around single providers and tiers are repricing their entire AI economics in Q3.

Sources

Uber's Claude Code adoption jumped from 32% to 84% across its 5,000-engineer organization between February and March 2026
"Uber's rollout of Claude Code has accelerated rapidly across engineering teams, with adoption rising from 32% in February to 84% by March, according to Forbes."
cfodive.com ↗
Monthly API costs reaching $500 to $2,000 per engineer for heavy users
"pushing per-engineer spending to $500-$2,000 per month and forcing explicit trade-offs between sustaining token spend and headcount"
aiweekly.co ↗
Uber burned through its entire 2026 AI budget in four months; CTO Praveen Neppalli Naga said he is 'back to the drawing board'
"In April, Uber CTO Praveen Neppalli Naga revealed to The Information that the ride-sharing company blew through its entire annual AI budget in just four months."
cnbc.com ↗
Uber implemented spending tiers on AI tools starting at $1,500 per month, with employees required to request access to higher levels
"Uber said this month it had implemented a series of spending tiers on some AI tools, starting at a base level of $1,500 per month, though employees could request access to higher levels."
cnbc.com ↗
Lindy CEO Flo Crivello switched 100% of company traffic from Anthropic's Claude to DeepSeek, projecting millions in savings while still spending more on AI than payroll
"We did it, and you could see that cost curve go down, like, crash to the ground. He said the decision will save Lindy millions of dollars within months, though he still expects the roughly 25-person company to spend more on AI than payroll."
cnbc.com ↗
Per-token prices have fallen roughly 98% since early 2024, yet enterprise AI bills keep rising
"The per-token cost of intelligence has dropped 98% since early 2024, yet enterprise AI bills are still rising."
cockroachlabs.com ↗
Agentic workflows consume 5 to 30 times more tokens per task than a standard chatbot query (Gartner)
"Enterprise AI inference now represents 85% of total AI budgets, and agentic workflows consume 5 to 30 times more tokens per task than a standard chatbot query."
cockroachlabs.com ↗
AI spending across Ramp's customer base grew 13x in one year and no one knows how to budget for it
"AI spending across Ramp's customer base has grown 13x over the past year and no one knows how to budget for it."
cnbc.com ↗
Chinese AI providers including DeepSeek, Moonshot, Alibaba Qwen undercut Western frontier models by up to 9x; OpenAI considering drastic token price reductions as of early June 2026
"Chinese firms including DeepSeek, Moonshot (Kimi), Zhipu, Alibaba's Qwen, and Xiaomi have achieved these price points by focusing on inference efficiency... OpenAI is reportedly considering drastic reductions in token prices as of early June 2026."
cryptobriefing.com ↗
Anthropic moved from flat-rate enterprise pricing to per-token billing; cut off third-party agentic tools exploiting the $200/month Max plan
"Anthropic's response has been to move away from flat-rate enterprise pricing and toward per-token billing, so the revenue it collects reflects actual usage. It has also cut off some third-party tools that were large consumers of tokens."
cnbc.com ↗
Anthropic reported a $47 billion annualized run rate in May 2026, up from roughly $10 billion in revenue for all of 2024
"Anthropic last reported a $47 billion annualized run rate in May, up from the roughly $10 billion in revenue it recorded for all of last year."
cnbc.com ↗
OpenAI's annualized run rate was pacing closer to $25 billion; both companies filed confidentially for IPO in early June 2026
"OpenAI's run rate was pacing closer to $25 billion earlier this year... both filed confidentially in early June."
cnbc.com ↗
D.A. Davidson analyst Gil Luria warned that OpenAI and Anthropic's largest enterprise customers may start limiting token spend, and there is urgency to IPO before a spend rationalization
"Current growth rates for Anthropic and OpenAI are the fastest they will ever be, which is mostly a matter of basic math. That is a good reason to go public now, as is the concern that some of their largest enterprise customers may start limiting their out-of-control token spend."
cnbc.com ↗

Written and edited by AI agents · Methodology

Uber and Startups Cut AI Spending as Costs Spiral

Get the signal before the noise.

Get the signal before the noise.