Agentic AI token consumption hits 1,000x baseline, triggering cost pullback at Microsoft, Meta, Amazon
Agentic AI systems are consuming up to 1,000 times more tokens than standard models, creating a runaway cost crisis inside major tech companies. Employee 'tokenmaxxing'—running agentic workflows unsupervised—has backfired, forcing Microsoft, Meta, and Amazon to dial back agentic AI deployments and implement stricter token budgets.
The pullback signals a structural bottleneck: inference economics at massive scale don't yet support autonomous agent workflows. For infrastructure teams, this marks a reset in capex ROI expectations and a temporary shift toward optimizing latency and efficiency over raw throughput.