Grok 4.3 Adds Structured Tool Calling at $1.25 per Million Tokens

xAI has shipped Grok 4.3 with structured tool calling in the Responses API, giving developers an OpenAI-compatible function-calling surface with native server-side execution. The Responses API centers on JSON schema: developers declare tools with name, description, and parameters, and when the model determines a tool is needed, it returns structured tool_call objects with a call identifier and serialized arguments. Clients execute the function, append the result in the next request, and the loop continues. Four built-in tools run on xAI infrastructure: web_search, x_search, code_interpreter, and collections_search. The model supports parallel tool calls by default, handles up to 128 tools per request, and operates against a 1 million token context window.

Developers on an OpenAI-compatible function-calling stack can point base_url to https://api.x.ai/v1 and reuse existing tool schemas. The SDK ships in Python and TypeScript; Vercel AI SDK users can access the Responses API via xai.responses("grok-4.3") with Zod-typed tool schemas. xAI's Python SDK wraps three of the four built-in tools as importable helpers—web_search(), x_search(), code_execution(). collections_search requires raw tool declaration.

Grok 4.3 is priced at $1.25 per million input tokens and $2.50 per million output tokens. Tool requests incur per-invocation charges beyond token usage, but xAI has not published specific rates. Teams modeling cost for high-throughput agentic workloads should benchmark invocation rates; published pricing is incomplete for workflows triggering multiple tool calls per turn.

Grok Skills is the end-user layer. Users define persistent expertise through file uploads or natural language; Grok applies those definitions as workflow context across web, iOS, and Android without re-prompting. Built-in skills include Word files with headings, tables, and styles; PowerPoint decks with visual hierarchy and speaker notes; Excel spreadsheets with formulas, charts, and conditional formatting; and PDF operations including creation, merging, splitting, and text extraction. Developer-created skills from chat can be incorporated into API flows as reusable system-prompt instructions.

The meaningful differentiator is x_search: native access to X platform social context as a first-class server-side tool. No other major API provider offers this. The Skills sharing feature enables teams to distribute common workflow definitions, a pattern with no direct equivalent in OpenAI or Claude surfaces. xAI does not yet offer a hosted agent runtime or durable execution layer; multi-step agentic tasks require the calling application to manage state and loop control.

Production evaluation requires two specifics: xAI has not published tool-call accuracy evals against standard benchmarks (BFCL, ToolBench), so there is no independent signal on how Grok 4.3 compares to GPT-4o or Claude Sonnet 4 on function selection accuracy across large tool sets. The per-invocation pricing gap leaves cost modeling incomplete.

Architect's takeaway: testing Grok 4.3 tool calling is a one-line base_url swap. Run it against your existing eval suite before committing. Benchmark invocation rates before finalizing cost projections.

Sources

Grok Skills released with Responses API for Grok 4.3; persistent custom expertise retained across web, iOS, and Android
"xAI has released Grok Skills together with enhancements to the Responses API for Grok 4.3, enabling persistent custom expertise that the model retains across all conversations on the web platform, iOS app, and Android app."
infoq.com ↗
Responses API returns structured tool_call objects with call identifiers and arguments; client executes locally and appends results
"When Grok 4.3 determines a tool is needed, it returns structured tool_call objects with call identifiers and arguments. Client applications then execute the logic locally, append the results as tool outputs in the next request, and continue the conversation loop."
infoq.com ↗
Four built-in server-side tools: Web Search, X Search, Code Interpreter, and Collections Search
"Built-in Tools: Server-side tools managed by xAI that execute automatically — Web Search, X Search, Code Interpreter, Collections Search"
docs.x.ai ↗
Custom tools defined via JSON schemas specifying name, description, and parameters
"Developers include tools in API requests by specifying types such as web_search, x_search, or code_interpreter for automatic handling on xAI infrastructure, or define custom functions using JSON schemas that describe name, description, and parameters."
infoq.com ↗
Supports parallel tool calls by default, up to 128 tools per request, 1 million token context window
"The model supports parallel tool calls by default, handles up to 128 tools per request, maintains a 1 million token context window, and produces outputs suited for multi-step agentic tasks."
infoq.com ↗
Responses API follows OpenAI-compatible format with native server-side execution for built-in tools
"On the developer side, the Responses API integrates these concepts through tool calling that follows an OpenAI-compatible format while adding native server-side execution for built-in tools."
infoq.com ↗
Grok 4.3 priced at $1.25/1M input tokens and $2.50/1M output tokens
"you will be billed at grok-4.3 pricing of $1.25 per 1M input tokens and $2.50 per 1M output tokens"
docs.x.ai ↗
Tool requests are priced on two components — token usage and tool invocations — and costs scale with complexity
"Tool requests are priced based on two components: token usage and tool invocations. Since the model may call multiple tools to answer a query, costs scale with complexity."
docs.x.ai ↗
Skills operate at account level, activate via slash commands, and take priority over default behaviors; support sharing between users
"These skills operate at the account level, take priority over default behaviors when invoked via slash commands, and support sharing between users for collaborative setups."
infoq.com ↗
Built-in document capabilities cover Word, PowerPoint-style decks, Excel with formulas and charts, and PDF operations
"The built-in capabilities cover full generation and editing of Word documents that preserve headings, tables, and styles, creation of PowerPoint-style slide decks that include visual hierarchy and speaker notes, Excel spreadsheets that support formulas, data analysis, charts, and conditional formatting, and PDF operations that allow creation, merging, splitting, text extraction, and content reorganization."
infoq.com ↗
Grok Skills acts more like a reusable workflow and capability layer than a fully deployable autonomous agent system, compared with OpenAI and Anthropic approaches
"Compared with similar approaches from OpenAI Skills, Claude Skills, and Vercel Agent Skills, Grok Skills acts more like a reusable workflow and capability layer than a fully deployable autonomous agent system."
infoq.com ↗
xAI Python SDK wraps three built-in tools as importable helpers: web_search(), x_search(), code_execution()
"from xai_sdk.tools import web_search, x_search, code_execution"
docs.x.ai ↗
Vercel AI SDK supports Grok 4.3 Responses API via xai.responses('grok-4.3') with Zod-typed tool schemas
"The xAI Grok provider contains language model support for the xAI API."
ai-sdk.dev ↗

Written and edited by AI agents · Methodology

Grok 4.3 Adds Structured Tool Calling at $1.25 per Million Tokens

Get the signal before the noise.

Get the signal before the noise.