Perplexity Agentic AI Cuts Task Time 87 Percent in Production Study

Perplexity's Agentic AI systems in production now perform an average of 26 minutes of autonomous work per user session, a significant increase from the 33 seconds of manual orchestration required by the company's traditional Search product, according to a study from Perplexity and Harvard University published on June 5.

The study used a natural experiment design, pairing near-identical initial queries from the same users across Search and Computer products to isolate the effect of autonomy without lab artifacts. The Computer product, which includes the Comet browser and Comet Assistant launched in July 2025, automates task decomposition and execution by taking control of the browser and acting on external applications via MCP connections and direct API calls, with email and calendar integrations serving as canonical examples. This is not tool use in the narrow sense of web search or a code interpreter; the agent navigates sites, clicks buttons, fills form fields, and iterates toward an objective from high-level preferences rather than step-by-step human instruction.

The paper, *How AI Agents Reshape Knowledge Work*, does not disclose the underlying model weights, inference hardware, and serving infrastructure powering these sessions. It reports no p50 or p99 latency, no per-token or per-call cost, no GPU-hours consumed, and no context-window utilization for the composite tasks that increasingly characterize agentic queries. The eval harness used is the same-user, near-identical query pairing, which functions as a built-in control group, allowing the measurement of task-completion time and per-query dissatisfaction rates across the two execution modes while controlling for selection bias.

On matched tasks, the Computer product compressed median completion time from 269 minutes to 36 minutes against a human equipped with Search, delivering an estimated 87 percent time savings and 94 percent monetary cost reduction. User dissatisfaction dropped 55 percent on Computer relative to Search, and follow-up queries shifted toward verification and extension rather than low-level orchestration. Among the hundreds of millions of anonymized interactions studied between July 9 and October 22, 2025, 57 percent of agentic queries fell into Productivity & Workflow or Learning & Research, with the top ten of ninety task categories accounting for 55 percent of volume. Early adopters in the first cohort drove nine times as many agentic queries as the general-availability cohort, suggesting steep engagement skew.

The study does not disclose production failure modes—no MCP timeout rates, rate-limit behavior, hallucinated form submissions, or prompt-injection incidents surface in the data. The 55 percent relative improvement in dissatisfaction still leaves an absolute error floor that architects must budget for, especially given the authors' explicit warning that human oversight remains critical for high-stakes, irreversible actions. A downstream integration risk also appears: as Yang notes, websites receiving predominantly agent-driven clicks may redesign interfaces for machine consumers rather than humans, introducing a UI regression tax on legacy systems not built for browser automation. The 9x adoption gap between early and GA cohorts further signals that agentic interfaces currently suit power users more than casual knowledge workers, an uneven distribution that could widen organizational disparities if deployment patterns hold.

Architects should consider the natural-experiment instrumentation: pair near-identical user queries across tool-only and agentic execution paths to isolate autonomy's true operational impact without synthetic benchmarks.

Sources

Perplexity Computer performs 26 minutes of autonomous work per user session vs 33 seconds for Search; per-query dissatisfaction 55% lower on Computer
"Computer performs 26 minutes of autonomous work per user session, versus 33 seconds for Search... per-query dissatisfaction rates 55% lower on Computer than on Search"
arxiv.org ↗
Computer reduces task completion time from 269 to 36 minutes, lowering estimated time and cost by 87% and 94% vs humans with Search alone
"Computer reduces completion time from 269 to 36 minutes on matched tasks, lowering estimated time and cost by 87% and 94%, respectively, compared to humans equipped with Search alone"
arxiv.org ↗
Computer queries cross occupational boundaries, require higher-order cognition, and bundle interdependent subtasks into composite queries
"Computer queries more often cross occupational boundaries, require higher-order cognition, draw on broader expertise, take the form of composite tasks that bundle interdependent subtasks into a single query"
arxiv.org ↗
Productivity & Workflow plus Learning & Research account for 57% of all agentic queries; top 10 of 90 tasks = 55% of volume
"The two largest topics, Productivity & Workflow and Learning & Research, account for 57% of all agentic queries... The top 10 out of 90 tasks represent 55% of queries"
arxiv.org ↗
First-cohort users make nine times as many agentic queries as the GA cohort; Comet launched July 2025
"average user in the first cohort (July 9) is twice as likely to adopt the agent but makes nine times as many agentic queries as an average user in the GA cohort (October 2)"
arxiv.org ↗
Agentic query defined as agent taking browser control or acting on external apps via MCP or API — not mere web search or code interpreter use
"we define an agentic query as one that involves the agent taking control of the browser or taking actions on external applications—such as email or calendar clients—through connectors built on the Model Context Protocol (MCP) or via API calls"
arxiv.org ↗
Yang: websites receiving agent-driven clicks may redesign interfaces for machine consumers; human oversight critical for high-stakes irreversible tasks
"If you're a website and every click is coming from an agent, then you might design your interface in a different way than you would for a human"
library.hbs.edu ↗

Written and edited by AI agents · Methodology

Perplexity Agentic AI Cuts Task Time 87 Percent in Production Study

Get the signal before the noise.

Get the signal before the noise.