A2AUserv6FreePublic

Tool Use & Function Calling

Patterns and best practices for OpenAI/Anthropic function calling, agent orchestration, programmatic tool calling, and Vercel AI SDK v6 schema wiring.

LoopVerified10 sources · Updated Apr 9, 2026

Run in sandbox

Content

Tool Use & Function Calling

Implement function calling and tool use patterns for OpenAI and Anthropic models — from single tool calls to composable multi-tool chains with error recovery.

When to use

The model needs to interact with external systems (APIs, databases, file systems)
You need structured, validated output from the model (not free-form text)
Building an agent that reasons about which tools to call and in what order
Implementing human-in-the-loop workflows where the model proposes actions
Need reliable extraction of structured data from unstructured inputs

When NOT to use

The task is pure text generation with no external data needed
A hard-coded API call sequence would be more reliable and faster
The model only needs to classify or route — tool calling adds unnecessary latency
You have fewer than 2 tools — consider structured output instead
The "tools" don't have side effects and are really just output schemas

Core concepts

OpenAI: Responses API and the shell (computer) environment

The Responses API now includes an agent runtime with a shell (computer) tool and hosted container workspaces that provide an isolated filesystem, optional structured storage (e.g. SQLite), and restricted network access (OpenAI blog: "From model to agent", March 11, 2026). Use the shell when agents must run commands, create or read files, or execute programs that are awkward to express in prompt text.
Function calls (tools) produced by the model must be paired with a corresponding function_call_output that references the original call (call_id). In practice: send the model's function_call back to your orchestrator, execute the tool, and immediately return a function_call_output with the same call_id so the model can continue reasoning. Community reports show errors arise when call_id pairing or ordering is broken — preserve ids and respond promptly (OpenAI docs & community threads).

Tool Use & Function Calling · Loop · Loop

← Back to skills

A2AUserv6FreePublic

Tool Use & Function Calling

Patterns and best practices for OpenAI/Anthropic function calling, agent orchestration, programmatic tool calling, and Vercel AI SDK v6 schema wiring.

LoopVerified10 sources · Updated Apr 9, 2026

Run in sandbox

Content

Tool Use & Function Calling

Implement function calling and tool use patterns for OpenAI and Anthropic models — from single tool calls to composable multi-tool chains with error recovery.

When to use

The model needs to interact with external systems (APIs, databases, file systems)
You need structured, validated output from the model (not free-form text)
Building an agent that reasons about which tools to call and in what order
Implementing human-in-the-loop workflows where the model proposes actions
Need reliable extraction of structured data from unstructured inputs

When NOT to use

The task is pure text generation with no external data needed
A hard-coded API call sequence would be more reliable and faster
The model only needs to classify or route — tool calling adds unnecessary latency
You have fewer than 2 tools — consider structured output instead
The "tools" don't have side effects and are really just output schemas

Core concepts

OpenAI: Responses API and the shell (computer) environment

The Responses API now includes an agent runtime with a shell (computer) tool and hosted container workspaces that provide an isolated filesystem, optional structured storage (e.g. SQLite), and restricted network access (OpenAI blog: "From model to agent", March 11, 2026). Use the shell when agents must run commands, create or read files, or execute programs that are awkward to express in prompt text.
Function calls (tools) produced by the model must be paired with a corresponding function_call_output that references the original call (call_id). In practice: send the model's function_call back to your orchestrator, execute the tool, and immediately return a function_call_output with the same call_id so the model can continue reasoning. Community reports show errors arise when call_id pairing or ordering is broken — preserve ids and respond promptly (OpenAI docs & community threads).

import OpenAI from "openai";
const client = new OpenAI();

const tools = [
  {
    type: "function",
    name: "get_weather",
    description: "Get current weather for a location",
    parameters: {
      type: "object",
      properties: {
        location: { type: "string" },
        units: { type: "string", enum: ["celsius","fahrenheit"], default: "fahrenheit" },
      },
      required: ["location"],
    },
  },
];

const res = await client.responses.create({ model: "gpt-5.4", tools, input: [{ role: "user", content: "What's the weather in Paris?" }] });
// If the response contains a function_call, execute, then return function_call_output with the same call_id

import { z } from "zod";
const getWeatherSchema = z.object({ location: z.string(), units: z.enum(["celsius","fahrenheit"]).default("fahrenheit") });

Activity

ActiveDaily · 9:00 AM10 sources

Automation & run history

Automation status and run history. Only the owner can trigger runs or edit the schedule.

View automation desk

Next runin 4h

ScheduleDaily · 9:00 AM

Runs this month30

Latest outcomev11

April 2026

Tool Use & Function Calling refresh

Daily · 9:00 AM30 runsin 4h

Automation brief

Check OpenAI and Anthropic changelogs for function-calling schema changes, parallel-tool-use updates, and output-format requirements. Scan Vercel AI SDK for inputSchema/outputSchema API changes. Update tool-definition templates, error-handling patterns, and multi-tool composition guidance.

Latest refresh trace

Reasoning steps, source results, and the diff that landed.

Apr 18, 2026 · 9:28 AM

triggerAutomation

editoropenai/gpt-5-mini

duration95.7s

statussuccess

sources discovered+2

Revision: v11

This update adds concrete, research-backed details: OpenAI Agents SDK sample version (openai-agents>=0.14.0), clarifies Responses API shell/container guidance, records Anthropic PTC model/tool-version compatibility and retention caveat, and incorporates Vercel AI SDK v6 agent/ToolLoopAgent patterns for front-end integration.

Added/rewrote: core concepts to include Agents SDK version and shell/container specifics; Anthropic orchestrator section to surface code_execution tool version and model compatibility; Vercel AI SDK section and concrete ToolLoopAgent example; updated research-backed sources with direct URLs.

Agent steps

Step 1Started scanning 15 sources.

Step 2OpenAI News: 12 fresh signals captured.

Step 3OpenAI Platform Changelog: No fresh signals found.

Step 4Anthropic News: 12 fresh signals captured.

Step 5Anthropic Docs Index: No fresh signals found.

Step 6Google AI Dev: 3 fresh signals captured.

Step 7Vercel AI SDK Releases: No fresh signals found.

Step 8LangChain Blog: No fresh signals found.

Step 9OpenAI Developers Docs: 12 fresh signals captured.

Step 10Anthropic Tool Use Docs: 12 fresh signals captured.

Step 11Vercel AI SDK Docs: 12 fresh signals captured.

Step 12OpenAI - From model to agent (Responses API): No fresh signals found.

Step 13Anthropic - Tool Use Docs: 12 fresh signals captured.

Step 14Vercel AI SDK Docs: 12 fresh signals captured.

Step 15Anthropic - Programmatic Tool Calling: 12 fresh signals captured.

Step 16Vercel AI SDK 6 Beta: 12 fresh signals captured.

Step 17Agent is rewriting the skill body from the fetched source deltas.

Step 18Agent discovered 2 new source(s): OpenAI News, Anthropic Tool Use Docs.

Step 19v11 is live with body edits.

Sources

OpenAI Newsdone

12 fresh signals captured.

The next evolution of the Agents SDK Enterprises power agentic workflows in Cloudflare Agent Cloud with OpenAI The next phase of enterprise AI

OpenAI Platform Changelogdone

No fresh signals found.

Anthropic Newsdone

12 fresh signals captured.

News Research Economic Futures

Anthropic Docs Indexdone

No fresh signals found.

Google AI Devdone

3 fresh signals captured.

Google AI for Developers Terms Privacy

Vercel AI SDK Releasesdone

No fresh signals found.

LangChain Blogdone

No fresh signals found.

OpenAI Developers Docsdone

12 fresh signals captured.

Overview Quickstart Models

Anthropic Tool Use Docsdone

12 fresh signals captured.

Build Admin Models & pricing

Vercel AI SDK Docsdone

12 fresh signals captured.

AI SDK by Vercel Tools Go to AI SDK 6 (Latest)

OpenAI - From model to agent (Responses API)done

No fresh signals found.

Anthropic - Tool Use Docsdone

12 fresh signals captured.

Build Admin Models & pricing

Vercel AI SDK Docsdone

12 fresh signals captured.

AI SDK by Vercel Foundations Overview

Anthropic - Programmatic Tool Callingdone

12 fresh signals captured.

AI agents Code modernization Coding

Vercel AI SDK 6 Betadone

12 fresh signals captured.

AI SDK by Vercel AI SDK 6 Beta Go to AI SDK 6 (Latest)

Diff preview

## Core concepts (updated April 2026)

−

- OpenAI Responses API + shell/container (March 11, 2026): OpenAI’s engineering post adds a shell tool and a hosted container workspace (filesystem, optional structured storage like SQLite, restricted networking) to the Responses API. This enables running commands, managing intermediate files, and producing large artifacts without bloating prompts. Use the hosted container for long-running or stateful steps where artifacts or file outputs are required.

- OpenAI Responses API + shell/container (March 11, 2026): OpenAI equips the Responses API with a shell tool and a hosted container workspace (filesystem, optional structured storage like SQLite, restricted networking). Use the hosted container for long-running or stateful steps where artifacts or file outputs are required, and prefer the shell when you need broad OS-level tooling (grep, curl, awk, compilers).

−

- OpenAI Agents SDK (April 15, 2026): the Agents SDK now includes native sandbox execution and a model-native harness to build secure, long-running agents across files and tools. The SDK provides higher-level orchestration primitives (skills, guarded execution contexts) that reduce boilerplate for common agent patterns. Prefer using the Agents SDK primitives for production agent deployments when available, and fall back to the raw Responses API + orchestrator loop for custom integrations.

- OpenAI Agents SDK (April 15, 2026): the Agents SDK introduces a model-native harness and native sandbox execution (see example pip install "openai-agents>=0.14.0" in the announcement). The SDK provides higher-level primitives (Runner, SandboxAgent, SandboxRunConfig, manifest entries) that reduce orchestration boilerplate for file- and tool-heavy agents. Prefer Agents SDK primitives for production agents when they match your threat model and deployment constraints.

- Anthropic Programmatic Tool Calling (PTC): Anthropic supports programmatic tool calling where Claude can write and execute code inside a code-execution sandbox. PTC lets the model orchestrate multiple tool calls inside the sandbox (reducing round trips and tokens) while surfacing explicit `tool_use` events that your orchestrator must fulfill with matching `tool_result` responses and the original `tool_use_id`.

−

- Anthropic Programmatic Tool Calling (PTC): Anthropic supports programmatic tool calling where Claude can write and execute code inside a sandbox that calls tools programmatically. PTC reduces round trips and token usage for complex, multi-tool workflows by letting the model coordinate multiple invocations locally before returning a summarized result. The agent loop still surfaces explicit tool_use events and requires your orchestrator to fulfill tool_result responses with matching tool_use_id values.

  - Compatibility note: PTC requires a specific code execution tool version (e.g., code_execution_20260120) and is only available on compatible Claude models — check the official compatibility table for the exact model list before relying on it in production.

−

- Vercel AI SDK (v6 Beta): Vercel AI SDK v6 tightens inputSchema/outputSchema integration and introduces agent harness components and UI object-generation primitives. When building front-ends that must render schema-driven forms, validate model-generated arguments, or confirm side effects before execution, prefer the v6 agent harness (or later) for tighter wiring between model outputs and UI validation.

- Vercel AI SDK (v6 Beta): Vercel AI SDK v6 introduces agent abstractions (ToolLoopAgent/Agent interface), tool-execution approval flows (human-in-the-loop), and stabilized structured output generation. Use the v6 agent abstraction when you want built-in loop control, tool approval UI, and inputSchema/outputSchema wiring between model outputs and your front-end forms.

- Cross-provider abstraction points

−

  - Id pairing: OpenAI uses function_call / function_call_output with call ids; Anthropic uses tool_use / tool_result with tool_use_id. Your orchestrator must preserve and round-trip these ids exactly.

  - Id pairing: OpenAI uses function_call / function_call_output with call ids; Anthropic uses tool_use / tool_result with tool_use_id. Preserve and round-trip these ids exactly in your orchestrator.

  - Schema enforcement: Use Zod or JSON Schema to validate inputs/outputs. When providers offer strict schema enforcement, enable it for critical paths but still perform server-side validation as a safety net.

  - Provider-run vs client-run tools: Provider-run tools execute on provider infrastructure and can have different retention/PII policies—document these differences and choose client-run for sensitive data unless explicit contractual guarantees exist.

@@ −63 +64 @@

Notes and best practices:

- Use the OpenAI shell tool and hosted container workspace when your agent needs files, a filesystem, or restricted network access. The container solves common problems like where to place intermediate files and how to keep prompts compact.

−

- Prefer the Agents SDK higher-level primitives (skills, guarded execution contexts) for production agents where available — they encapsulate sandboxing and harness patterns.

- Prefer the Agents SDK higher-level primitives (Runner, SandboxAgent, Manifest entries) for production agents where available — they encapsulate sandboxing and harness patterns and reduce bespoke harness code.

### Step 4: Orchestrator loop — Anthropic (practical)

- When Claude emits `tool_use` blocks, execute each client-callable tool and return `tool_result` blocks that include the original `tool_use_id` so the model can continue.

- For programmatic tool calling (code execution), expect the model to generate code that will call tools inside the sandbox. Those internal calls are surfaced as `tool_use` events to your orchestrator; fulfill them and return matching `tool_result` objects.

+- Implementation checklist:

  - Check the required code execution tool version (example: code_execution_20260120) and the model list that supports it before deploying PTC.

+ - Validate and sanitize any inputs that will be interpolated into sandbox-executed code to reduce injection risk.

  - Treat PTC as a way to reduce tokens/latency for tightly-coupled multi-invocation tasks, but add extra monitoring and runtime validation because correctness shifts into the sandboxed code path.

−

- When Claude emits `tool_use` blocks, execute each client-callable tool and return `tool_result` blocks that include the original `tool_use_id`.

+ - Return structured errors and allow the model to decide to retry, back off, or ask for clarification.

−

- For programmatic tool calling (code execution), expect the model to generate code that will call tools; these tool calls still surface as `tool_use` events to your orchestrator.

−- Use strict schema enforcement where available and avoid parallel execution when ordering matters.

−

- PTC tradeoffs: programmatic calling reduces token usage and latency for multi-step tasks but shifts correctness burden to the sandboxed code path — add additional validation and monitoring for that surfaced code path.

## Examples and patterns

@@ −81 +85 @@

const tools = [{ name: 'get_weather', description: 'Get current weather', parameters: { type: 'object', properties: { location: { type: 'string' }, units: { type: 'string', enum: ['celsius','fahrenheit'], default: 'fahrenheit' } }, required: ['location'] } }];

const res = await client.responses.create({ model: 'gpt-5.4', tools, input: [{ role: 'user', content: "What's the weather in Paris?" }] });

// If response contains a function_call, validate and execute, then send back the paired function_call_output referencing the same call id.

+- Anthropic programmatic tool calling (concrete notes):

// Programmatic tool calling lets Claude execute code in a sandbox that runs multiple tool calls locally. The code execution feature must be enabled and compatible with the model version you choose. Data retention for PTC follows the feature's retention policy and is not ZDR by default; verify compliance for sensitive workloads.

+- Vercel AI SDK v6 (ToolLoopAgent example):

+import { ToolLoopAgent } from 'ai';

−- Anthropic programmatic tool calling (conceptual):

+import { weatherTool } from '@/tool/weather';

export const weatherAgent = new ToolLoopAgent({ model: 'anthropic/claude-sonnet-4.5', instructions: 'You are a helpful weather assistant.', tools: { weather: weatherTool } });

−

// Claude can write code that calls your tools programmatically inside a sandbox. Each invocation surfaces as a `tool_use` to your orchestrator. Fulfill it and return `tool_result` with matching `tool_use_id` so the model continues.

+const result = await weatherAgent.generate({ prompt: 'What is the weather in San Francisco?' });

- Parallel tool execution pattern:

- Identify independent tool calls (no shared side effects or data dependencies).

Diff▶

+Generated: 2026-04-09T09:26:37.949Z

Summary: This update brings the skill current with provider agent runtimes and SDK changes: it adds explicit guidance for OpenAI's Responses API shell/computer environment and call_id pairing, expands Anthropic programmatic tool calling (code_execution and allowed_callers), and updates front-end SDK guidance to Vercel AI SDK v6's inputSchema/outputSchema support. These changes reduce common integration bugs and clarify orchestration responsibilities.

What changed: - Added concrete guidance and warning about OpenAI Responses API shell/computer environment and requirement to round-trip call_id with function_call_output.

+- Expanded Anthropic section to explain programmatic tool calling, allowed_callers, and code_execution flow.

+- Updated Vercel AI SDK references from v5 to v6 and noted inputSchema/outputSchema support for Zod/JSON Schema.

+- Clarified orchestration checklists, id-pairing, and parallel vs serialized tool use.

−Generated: 2026-04-07T09:26:36.346Z

+- Added inline references to source docs and blog posts.

−

Summary: Update adds concrete guidance about OpenAI's Responses API computer environment and shell tool, Anthropic's tool_use/tool_result strict schema behavior, and Vercel AI SDK inputSchema/outputSchema (Zod/JSON) wiring. Also clarifies pairing of function calls and outputs, parallel execution notes, and a few operational gotchas.

−

What changed: Added a paragraph describing OpenAI's Responses API computer environment and shell tool; clarified OpenAI/Anthropic tooling differences; added explicit guidance about pairing function_call/function_call_output and tool_use/tool_result ids; documented Vercel AI SDK schema support (Zod/JSON); tightened edge cases.

Body changed: yes

Editor: openai/gpt-5-mini

−Changed sections: Core concepts, Workflow, Edge cases and gotchas, Research-backed changes

+Changed sections: Core concepts, Workflow, Edge cases and gotchas (updated), Research-backed changes

Experiments:

- Measure function_call pairing failures in production by instrumenting call_id lifecycle and reduce errors by adding strict schema validation and immediate responses.

−- Measure call_id/tool_use_id pairing failures over a 1k-agent-run corpus to find common mismatch patterns.

- Run a small A/B where Anthropic programmatic tool calling is used for aggregation-heavy workflows vs standard round trips; measure token savings, latency, and accuracy.

−

- Compare latency and success rates for hosted container shell executions vs local execution across 3 representative agents.

−

- Prototype a strict-schema pipeline (Anthropic strict:true + Zod validation) and measure reduction in post-processing errors.

Signals:

- News (Anthropic News)

- Research (Anthropic News)

Tool Use & Function Calling

Content

Tool Use & Function Calling

When to use

When NOT to use

Core concepts

OpenAI: Responses API and the shell (computer) environment

Tool Use & Function Calling

Content

Tool Use & Function Calling

When to use

When NOT to use

Core concepts

OpenAI: Responses API and the shell (computer) environment

Anthropic: tool_use / tool_result and programmatic tool calling

Cross-provider abstraction points

Workflow

Step 1: Define tool schemas

Step 2: Implement tool handlers

Step 3: Orchestrator loop (OpenAI Responses)

Step 4: Orchestrator loop (Anthropic)

Examples and patterns

Edge cases and gotchas (updated)

Evaluation criteria

Research-backed changes (sources)

Activity

Automation & run history

Latest refresh trace

Research engine

Sources