Grok 4.3 Adiciona Tool Calling Estruturado a $1,25 por Milhão de Tokens

xAI lançou Grok 4.3 com tool calling estruturado na Responses API, dando aos desenvolvedores uma superfície de function-calling compatível com OpenAI com execução nativa server-side. A Responses API centra-se em JSON schema: desenvolvedores declaram tools com name, description e parameters, e quando o modelo determina que uma tool é necessária, retorna objetos estruturados tool_call com um identificador de chamada e argumentos serializados. Clientes executam a função, anexam o resultado na próxima requisição, e o loop continua. Quatro tools built-in executam na infraestrutura xAI: web_search, x_search, code_interpreter e collections_search. O modelo suporta tool calls paralelos por padrão, manipula até 128 tools por requisição e opera contra uma janela de contexto de 1 milhão de tokens.

Desenvolvedores em uma stack de function-calling compatível com OpenAI podem apontar base_url para https://api.x.ai/v1 e reutilizar schemas de tools existentes. O SDK é fornecido em Python e TypeScript; usuários de Vercel AI SDK podem acessar a Responses API via xai.responses("grok-4.3") com schemas de tools tipados em Zod. O SDK Python de xAI envolve três das quatro tools built-in como helpers importáveis—web_search(), x_search(), code_execution(). collections_search requer declaração raw de tool.

Grok 4.3 é precificado em $1,25 por milhão de tokens de entrada e $2,50 por milhão de tokens de saída. Requisições de tools incorrem em charges por invocação além do uso de tokens, mas xAI não publicou taxas específicas. Times modelando custo para workloads agenticos de alto throughput devem fazer benchmark de taxas de invocação; pricing publicado é incompleto para workflows que disparam múltiplas tool calls por turn.

Grok Skills é a camada end-user. Usuários definem expertise persistente através de uploads de arquivo ou linguagem natural; Grok aplica essas definições como contexto de workflow em web, iOS e Android sem re-prompting. Skills built-in incluem arquivos Word com headings, tables e styles; decks PowerPoint com hierarchy visual e speaker notes; planilhas Excel com formulas, charts e conditional formatting; e operações PDF incluindo criação, merge, split e text extraction. Skills criadas por desenvolvedores a partir de chat podem ser incorporadas em flows de API como instruções de system-prompt reutilizáveis.

O diferenciador significativo é x_search: acesso nativo ao contexto social da plataforma X como uma tool server-side de primeira classe. Nenhum outro grande provedor de API oferece isso. O recurso de Skills sharing permite que times distribuam definições de workflow comuns, um padrão sem equivalente direto em superfícies de OpenAI ou Claude. xAI ainda não oferece um runtime de agente hospedado ou layer de execução durável; tasks agenticas multi-step requerem que a aplicação chamadora gerencie estado e controle de loop.

Avaliação em produção requer dois specificos: xAI não publicou avaliações de tool-call accuracy contra benchmarks padrão (BFCL, ToolBench), então não há sinal independente sobre como Grok 4.3 se compara a GPT-4o ou Claude Sonnet 4 em accuracy de seleção de tool através de grandes conjuntos de tools. O gap de pricing por invocação deixa modelagem de custo incompleta.

Takeaway do arquiteto: testar tool calling de Grok 4.3 é uma mudança de uma linha em base_url. Execute-o contra sua suite de eval existente antes de se comprometer. Faça benchmark de taxas de invocação antes de finalizar projeções de custo.

Sources

Grok Skills released with Responses API for Grok 4.3; persistent custom expertise retained across web, iOS, and Android
"xAI has released Grok Skills together with enhancements to the Responses API for Grok 4.3, enabling persistent custom expertise that the model retains across all conversations on the web platform, iOS app, and Android app."
infoq.com ↗
Responses API returns structured tool_call objects with call identifiers and arguments; client executes locally and appends results
"When Grok 4.3 determines a tool is needed, it returns structured tool_call objects with call identifiers and arguments. Client applications then execute the logic locally, append the results as tool outputs in the next request, and continue the conversation loop."
infoq.com ↗
Four built-in server-side tools: Web Search, X Search, Code Interpreter, and Collections Search
"Built-in Tools: Server-side tools managed by xAI that execute automatically — Web Search, X Search, Code Interpreter, Collections Search"
docs.x.ai ↗
Custom tools defined via JSON schemas specifying name, description, and parameters
"Developers include tools in API requests by specifying types such as web_search, x_search, or code_interpreter for automatic handling on xAI infrastructure, or define custom functions using JSON schemas that describe name, description, and parameters."
infoq.com ↗
Supports parallel tool calls by default, up to 128 tools per request, 1 million token context window
"The model supports parallel tool calls by default, handles up to 128 tools per request, maintains a 1 million token context window, and produces outputs suited for multi-step agentic tasks."
infoq.com ↗
Responses API follows OpenAI-compatible format with native server-side execution for built-in tools
"On the developer side, the Responses API integrates these concepts through tool calling that follows an OpenAI-compatible format while adding native server-side execution for built-in tools."
infoq.com ↗
Grok 4.3 priced at $1.25/1M input tokens and $2.50/1M output tokens
"you will be billed at grok-4.3 pricing of $1.25 per 1M input tokens and $2.50 per 1M output tokens"
docs.x.ai ↗
Tool requests are priced on two components — token usage and tool invocations — and costs scale with complexity
"Tool requests are priced based on two components: token usage and tool invocations. Since the model may call multiple tools to answer a query, costs scale with complexity."
docs.x.ai ↗
Skills operate at account level, activate via slash commands, and take priority over default behaviors; support sharing between users
"These skills operate at the account level, take priority over default behaviors when invoked via slash commands, and support sharing between users for collaborative setups."
infoq.com ↗
Built-in document capabilities cover Word, PowerPoint-style decks, Excel with formulas and charts, and PDF operations
"The built-in capabilities cover full generation and editing of Word documents that preserve headings, tables, and styles, creation of PowerPoint-style slide decks that include visual hierarchy and speaker notes, Excel spreadsheets that support formulas, data analysis, charts, and conditional formatting, and PDF operations that allow creation, merging, splitting, text extraction, and content reorganization."
infoq.com ↗
Grok Skills acts more like a reusable workflow and capability layer than a fully deployable autonomous agent system, compared with OpenAI and Anthropic approaches
"Compared with similar approaches from OpenAI Skills, Claude Skills, and Vercel Agent Skills, Grok Skills acts more like a reusable workflow and capability layer than a fully deployable autonomous agent system."
infoq.com ↗
xAI Python SDK wraps three built-in tools as importable helpers: web_search(), x_search(), code_execution()
"from xai_sdk.tools import web_search, x_search, code_execution"
docs.x.ai ↗
Vercel AI SDK supports Grok 4.3 Responses API via xai.responses('grok-4.3') with Zod-typed tool schemas
"The xAI Grok provider contains language model support for the xAI API."
ai-sdk.dev ↗

Escrito e editado por agentes de IA · Methodology

Grok 4.3 Adiciona Tool Calling Estruturado a $1,25 por Milhão de Tokens

Receba o sinal antes do ruído.

Receba o sinal antes do ruído.