Loop

Type	Role	Example
Router	Classifies and dispatches to specialists	Triage agent for support tickets
Specialist	Handles a narrow domain with focused tools	SQL agent, code-review agent
Orchestrator

Type	Role	Example
Router	Classifies and dispatches to specialists	Triage agent for support tickets
Specialist	Handles a narrow domain with focused tools	SQL agent, code-review agent
Orchestrator

OpenAI Agent Orchestration · Loop · Loop

// Vendor-agnostic handoff definition (pattern)
const handoffPayload = {
  target: "code_agent",
  contextPointer: "redis://ctx:12345", // or inline context
  instructions: "Implement feature X using the research summary",
  metadata: { reason: "needs_implementation", priority: "high" },
  auth: { type: "service-account", tokenRef: "vault://tokens/agent-sa" },
  model_hint: "capability:reasoning-large",
};

┌─────────────┐     context      ┌─────────────────┐
│   Router     │ ──────────────> │  Specialist A    │
│   Agent      │                 │  (research)      │
└──────┬───── ┘                 └────────┬─────────┘
       │                                  │ result
       │    ┌─────────────────────────────┘
       ▼    ▼
┌─────────────┐     context      ┌─────────────────┐
│ Orchestrator │ ──────────────> │  Specialist B    │
│   (merge)    │                 │  (code)          │
└──────┬──────┘                 └────────┬─────────┘
       │                                  │ result
       ▼                                  ▼
┌─────────────────────────────────────────────────┐
│                 Guardrail Agent                   │
│            (validate final output)                │
└──────────────────────────────────────────────────┘

Approach	Pros	Cons
Conversation history passthrough	Simple, stateless agents	Token bloat on long chains
External store (Redis/DB)	Persistent, queryable, can store pointers and compact summaries	Adds infra dependency; secure access needed
Vector DB + persistent memory	Natural for retrieval-augmented agents and long-lived contexts (LangChain + MongoDB patterns)	Requires vector index maintenance and cost for storage/search
Structured context object	Typed, compact	Requires schema discipline and migrations
Tool-based state	Agents read/write via tools (file store, DB APIs)	Natural LLM interface; needs transactional semantics if concurrent

from deepagents import AsyncSubAgent, create_deep_agent

agent = create_deep_agent(
    model="anthropic:claude-sonnet-4-6",
    subagents=[
        AsyncSubAgent(
            name="researcher",
            description="Performs deep research on a topic.",
            url="https://my-agent-server.dev",
            graph_id="research_agent",
        ),
    ],
)
# Async tools available on the main agent: start_async_task, check_async_task,
# update_async_task, cancel_async_task, list_async_tasks

// Agent graph as config
const agentGraph = {
  entrypoint: "triage",
  agents: {
    triage: {
      handoffs: ["research", "code", "clarify"],
      model_hint: "mini/fast",
      authorization: "user-delegated",
    },
    research: {
      handoffs: ["code", "triage"],
      tools: ["web_search", "doc_retrieval"],
      model_hint: "reasoning-large",
    },
    code: {
      handoffs: ["review"],
      tools: ["file_write", "shell_exec", "test_run"],
      authorization: "service-account",
    },
    review: {
      handoffs: ["code", "triage"],
      tools: ["lint", "test_run"],
    },
  },
  guardrails: ["safety_check", "pii_redact"],
};

const reviewAgent = new Agent({
  name: "review_agent",
  instructions: `You review code changes for:
1. Correctness
2. Security
3. Performance
4. Style

If issues found, return {status: "REVIEW_FAIL", comments: [...]}. If approved, return {status: "APPROVED"}.`,
  handoffs: [handoff(codeAgent)],
  tools: [lintTool, testRunTool],
});

async function orchestrate(userMessage: string) {
  const result = await run(triageAgent, userMessage, {
    maxTurns: 15,
    stream: true,
  });

  for await (const event of result) {
    if (event.type === "agent_handoff") {
      console.log(`Handoff: ${event.from} → ${event.to}`);
    }
    if (event.type === "tool_call") {
      console.log(`Tool: ${event.name}(${JSON.stringify(event.args)})`);
    }
    if (event.type === "message") {
      yield event.content;
    }
  }
}

const safetyGuardrail: InputGuardrail = {
  name: "safety_check",
  execute: async (input) => {
    const result = await classifySafety(input);
    if (result.category === "unsafe") {
      return { blocked: true, reason: result.reason };
    }
    return { blocked: false };
  },
};

const supportTriage = new Agent({
  name: "support_triage",
  instructions: `Classify the support ticket:
- billing → billing_agent
- technical → tech_agent
- account → account_agent
- unknown → ask clarifying question`,
  handoffs: [
    handoff(billingAgent),
    handoff(techAgent),
    handoff(accountAgent),
  ],
});

const billingAgent = new Agent({
  name: "billing_agent",
  instructions: "Handle billing inquiries. You can look up invoices and issue refunds.",
  tools: [lookupInvoice, issueRefund, updateSubscription],
  authorization: "service-account",
});

async function codeWithReview(task: string) {
  let result = await run(codeAgent, task);
  let attempts = 0;

  while (attempts < 3) {
    const review = await run(reviewAgent, result.output);
    if (review.output.includes("APPROVED")) break;
    // Run fixes in a sandbox and re-run tests
    result = await run(codeAgent, `Fix these issues:\n${review.output}`);
    attempts++;
  }

  return result.output;
}

// Supervisor launches a long-running research task
const taskId = await startAsyncTask({ target: "research_agent", input: { query } });
// Store taskId in Redis or DB; supervisor returns to user and continues

// Later: check status or retrieve result
const status = await checkAsyncTask(taskId);
if (status.done) {
  const result = await getAsyncTaskResult(taskId);
}

// Update or cancel when needed
await updateAsyncTask(taskId, { refine: "Focus on peer-reviewed sources only" });
await cancelAsyncTask(taskId);

async function vendorAgentRouter(userMessage: string) {
  const routerPrompt = `You are a router. Analyze the request and return a JSON handoff object with these fields: {agent, context, priority, routing_reason}. Do not call vendor tools directly from this prompt.`;

  const response = await callRouterModel({
    model: routerModel, // use a fast router model
    prompt: routerPrompt + "\n" + userMessage,
  });

  const handoff = JSON.parse(extractJson(response));
  // Normalize and dispatch using the vendor adapters
  await dispatchToAgent(handoff);
}

Is one LLM call enough?
├── Yes → Single agent with tools (no orchestration needed)
└── No
    ├── Is the flow linear (A → B → C)?
    │   ├── Yes → Pipeline pattern (chain agents sequentially)
    │   └── No
    │       ├── Does routing depend on input classification?
    │       │   ├── Yes → Router + Specialists pattern
    │       │   └── No
    │       │       ├── Do agents need to iterate (code → review → fix)?
    │       │       │   ├── Yes → Loop pattern with max iterations
    │       │       │   └── No → Parallel fan-out pattern
    │       │       └── Need human approval mid-flow?
    │       │           ├── Yes → Add approval gates between stages
    │       │           └── No → Fully autonomous pipeline
    └── Need guardrails?
        ├── Yes → Wrap entry/exit with guardrail agents and monitor per authoritative model-spec guidance
        └── No → Direct orchestration

Additional checks:
- Which model class for each agent? (fast/mini for routers, larger for specialists)
- Which authorization pattern is required? (user-delegated vs service-account)
- Do any agents execute code? If yes, run in sandboxes with resource limits (see Vercel changelog entry for CLI sandbox management)
- Where is state stored? Use pointers for long-lived context and vector DBs for retrieval memory
- Do you need shareable skills across teams? Consider Fleet + skills patterns for governance and reuse

Criterion	How to measure
Routing accuracy	% of requests dispatched to the correct specialist
Task completion rate	% of end-to-end tasks resolved without human escalation
Handoff efficiency	Average number of handoffs per task (lower is better)
Latency budget	Total wall-clock time from input to final output
Error recovery rate	% of tool failures gracefully retried or escalated
Context preservation	Does the specialist have all needed info after handoff?
Guardrail precision	False positive rate on blocked legitimate requests

Activity

ActiveDaily · 9:00 AM12 sources

Automation & run history

Automation status and run history. Only the owner can trigger runs or edit the schedule.

View automation desk

Next runin 4h

ScheduleDaily · 9:00 AM

Runs this month30

Latest outcomev13

April 2026

OpenAI Agent Orchestration refresh

Daily · 9:00 AM30 runsin 4h

Automation brief

Scan OpenAI, Anthropic, and Google blogs for multi-agent protocol changes, handoff-API updates, and orchestration pattern guidance. Check Vercel AI SDK releases for Agent class changes. Monitor LangChain for graph-based orchestration updates. Update the architecture decision tree and state-management patterns.

Latest refresh trace

Reasoning steps, source results, and the diff that landed.

Apr 18, 2026 · 9:28 AM

triggerAutomation

editoropenai/gpt-5-mini

duration101.2s

statussuccess

Revision: v13

Updated the skill to incorporate recent provider updates and community patterns: clarified OpenAI Agents SDK sandbox primitives and linked to the sandboxes guide, emphasized LangChain Deep Agents async subagent lifecycle, clarified model-selection guidance mentioning GPT-5.4, and added concrete tooling references and experiment suggestions.

- Updated "Model selection & granularity" to reference GPT-5.4 as the current high-capability class and to advise benchmarking routers vs specialists. - Rewrote "Async & long-running tasks" to explicitly reference OpenAI sandboxes and LangChain Deep Agents lifecycle patterns with links to docs. - Expanded "Tooling & libraries" with explicit doc links and guidance for managed hosting and sandbox evaluation. - Revised "Examples" to emphasize sandboxed agent workflows and point to OpenAI sandboxes docs for exact code samples. - Polished Research-backed changes to cite OpenAI sandboxes and LangChain Deep Agents.

Agent steps

Step 1Started scanning 14 sources.

Step 2Openai rss: 12 fresh signals captured.

Step 3Platform Openai changelog: No fresh signals found.

Step 4Anthropic news: 12 fresh signals captured.

Step 5Docs Anthropic sitemap: 12 fresh signals captured.

Step 6Blog rss: 12 fresh signals captured.

Step 7Github releases: No fresh signals found.

Step 8Blog Langchain feed: No fresh signals found.

Step 9Huggingface feed: 12 fresh signals captured.

Step 10OpenAI Model Spec: No fresh signals found.

Step 11LangChain Blog: 12 fresh signals captured.

Step 12Vercel Changelog: 12 fresh signals captured.

Step 13OpenAI Academy: No fresh signals found.

Step 14OpenAI Agents SDK docs: 12 fresh signals captured.

Step 15LangChain Blog - Deep Agents: 12 fresh signals captured.

Step 16Agent is rewriting the skill body from the fetched source deltas.

Step 17v13 is live with body edits.

Sources

Openai rssdone

12 fresh signals captured.

Codex for (almost) everything Introducing GPT-Rosalind for life sciences research Accelerating the cyber defense ecosystem that protects us all

Platform Openai changelogdone

No fresh signals found.

Anthropic newsdone

12 fresh signals captured.

News Research Economic Futures

Docs Anthropic sitemapdone

12 fresh signals captured.

Build Admin Models & pricing

Blog rssdone

12 fresh signals captured.

7 ways to travel smarter this summer, with help from Google A new way to explore the web with AI Mode in Chrome New ways to create personalized images in the Gemini app

Github releasesdone

No fresh signals found.

Blog Langchain feeddone

No fresh signals found.

Huggingface feeddone

12 fresh signals captured.

Illustrating Reinforcement Learning from Human Feedback (RLHF)Building a Fast Multilingual OCR Model with Synthetic Data Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents

OpenAI Model Specdone

No fresh signals found.

LangChain Blogdone

12 fresh signals captured.

LangSmith Evaluation Deep Agents LangSmith Platform

Vercel Changelogdone

12 fresh signals captured.

Changelog AI Cloud Security

OpenAI Academydone

No fresh signals found.

OpenAI Agents SDK docsdone

12 fresh signals captured.

Quickstart Models Pricing

LangChain Blog - Deep Agentsdone

12 fresh signals captured.

here Deep Research sub agents

Diff preview

### Model selection & granularity

- Match model capability to the agent role. Use smaller, cheaper models for high-volume routing/triage and larger, more capable models for specialist reasoning or multimodal work.

- Prefer the latest provider-recommended model classes for specialist reasoning when available (example: OpenAI GPT-5.4 as the current high-capability class at the time of writing). Benchmark latency and cost for both router and specialist roles on representative workloads.

- Recent community signals and vendor updates show managed agent hosting and model-native harnesses are now common; test managed offerings for latency, cost, and sandboxing trade-offs before committing to a deployment architecture (see provider docs linked in Tooling & libraries).

−

- Recent community signals show open models are viable for many agent tasks. Evaluate them on your workloads before committing.

- For routers, optimize for classification accuracy and latency; for specialists, optimize for reasoning depth and tool integration (code execution, file system access, multimodal inputs).

−

- Platform-hosted models (e.g., OpenAI GPT-5.4, Codex on provider platforms) are now offered as part of managed agent hosting; benchmark their latency and cost against in-house or open alternatives when planning enterprise deployments (OpenAI: "Enterprises power agentic workflows in Cloudflare Agent Cloud" — see OpenAI signals).

−

- Important: OpenAI's Agents SDK (Apr 15, 2026) introduces a model-native harness and native sandbox execution. The SDK includes SandboxAgent and SandboxRunConfig primitives and example clients (e.g., UnixLocalSandboxClient) to run agents in controlled workspaces — upgrade guidance in the OpenAI developer docs shows the package example using openai-agents>=0.14.0 and sandboxing primitives (https://openai.com/index/the-next-evolution-of-the-agents-sdk/, https://developers.openai.com/api/docs/guides/agents/sandboxes).

−

- Consider model latency and token budget per hop; design the graph to minimize unnecessary round-trips (summarize between hops when appropriate).

−

- For long-running or deep tasks, prefer non-blocking subagent patterns (async subagents) so supervisors can continue interacting with users while background work completes. LangChain Deep Agents documents async subagent lifecycle semantics and task-management primitives; adopt similar lifecycle semantics (start/check/update/cancel/list) when implementing background tasks (https://blog.langchain.com/deep-agents).

### Agent authorization and credentials

@@ −73 +71 @@

## Async & long-running tasks

- Adopt task lifecycle semantics: start_async_task, check_async_task, update_async_task, cancel_async_task, list_async_tasks. Persist task IDs, status, provenance, expected duration, and owner in your external store.

- LangChain's Deep Agents work and related community implementations formalize async subagent lifecycle semantics for background work (start/check/update/cancel/list) — adopt similar lifecycle semantics when implementing supervisors and UIs (see LangChain Deep Agents: https://blog.langchain.com/deep-agents).

−

- LangChain Deep Agents describes async subagents and treating background work as first-class tasks — follow those lifecycle semantics when designing supervisors and UIs (https://blog.langchain.com/deep-agents).

- OpenAI's Agents SDK now includes native sandbox execution primitives suitable for long-running, file-based workflows where an agent must inspect or modify a controlled workspace. Use sandboxed agents to isolate untrusted execution and restrict filesystem/tool access; see the OpenAI sandboxes guide for primitives such as SandboxAgent and SandboxRunConfig (OpenAI docs: https://developers.openai.com/api/docs/guides/agents/sandboxes).

−

- OpenAI Agents SDK now includes native sandbox execution which is well suited for long-running, file-based workflows where the agent must inspect or modify a controlled workspace. Use sandboxed agents to isolate untrusted execution and restrict filesystem/tool access (OpenAI: Apr 15, 2026 announcement and docs). Example primitives: SandboxAgent, SandboxRunConfig, UnixLocalSandboxClient (see OpenAI docs for exact SDK version and example code).

- Design supervisors to avoid blocking on long-running subagents: allow the supervisor to continue interacting with the user, poll or subscribe to task completion events, and provide follow-up instructions to running tasks when necessary.

- Ensure idempotency and cancellation semantics for background tasks. Provide meaningful timeouts and resource limits; surfaced task metadata should include expected duration and resource class.

@@ −101 +99 @@

### Step 3: Run the orchestration loop — secure execution

−

- For agents that execute code or write files, run untrusted code in isolated sandboxes. Use timeouts and resource limits. OpenAI's Agents SDK provides sandbox primitives and examples for controlled workspaces — upgrade your SDK where applicable and follow provider guidance for sandbox configuration.

- For agents that execute code or write files, run untrusted code in isolated sandboxes. Use timeouts and resource limits. OpenAI's Agents SDK provides sandbox primitives and examples for controlled workspaces — consult the SDK docs for configuration and runtime examples (https://developers.openai.com/api/docs/guides/agents/sandboxes).

- Signal streaming data with agent identity so consumers can attribute output to the correct agent.

### Step 4: Add guardrails and safety processes

@@ −111 +109 @@

## Tooling & libraries

- OpenAI Agents SDK (Apr 2026): model-native harness + native sandbox execution. See the SDK docs for SandboxAgent, SandboxRunConfig, and client examples (e.g., UnixLocalSandboxClient) for file-based workflows and isolated execution (OpenAI docs: https://developers.openai.com/api/docs/guides/agents/sandboxes).

Diff▶

+Generated: 2026-04-13T09:49:19.751Z

Summary: Updated async subagent guidance (include LangChain Deep Agents v0.5 task lifecycle and primitives), removed an unverified Vercel CLI version claim and pointed to the Vercel changelog for exact requirements; added OpenAI Academy as a tracked docs source.

−Generated: 2026-04-11T09:26:36.481Z

What changed: Added explicit async subagent primitives and lifecycle details from LangChain Deep Agents v0.5, removed an unverified specific Vercel CLI version requirement and replaced with a changelog reference, added OpenAI Academy as a tracked source for Custom GPT guidance.

−

Summary: This update integrates April 2026 signals: OpenAI Custom GPTs and enterprise guidance, LangChain Deep Agents v0.5 async subagent primitives and Agent Protocol clarity, and Vercel Sandbox/CLI management notes. It clarifies sandbox management, skill packaging, and preserves existing handoff/state patterns while removing fragile provider-specific resource claims.

−

What changed: - Added reference to OpenAI Custom GPTs as a packaging/distribution pattern. - Clarified LangChain Deep Agents v0.5 async subagent primitives (start/check/update/cancel) and Agent Protocol linkage. - Added Vercel changelog / CLI sandbox management reference and removed firm platform resource claims. - Tightened Skills & Fleet guidance to include Custom GPTs and cite enterprise guidance. - Preserved overall structure, vendor-agnostic handoff schema, and examples.

Body changed: yes

Editor: openai/gpt-5-mini

−Changed sections: Skills, Fleet, and reuse, Async & long-running tasks, Tooling & libraries, Research-backed changes

Changed sections: Async & long-running tasks, Tooling & libraries, Research-backed changes, Model selection & granularity

Experiments:

- Run a 2-week canary comparing async subagent vs inline subagent for long-running research tasks (measure throughput, user latency, error rates)

- Benchmark representative agent tasks on open models (GLM-5, MiniMax M2.7) vs closed models to quantify cost/latency tradeoffs for routing and file-ops

−- Add a short sample showing Custom GPT packaging for a skill and CI test suite in a follow-up refresh.

- Integrate Agent Protocol server stubs and run cross-vendor handoffs in a sandboxed environment to validate interoperability and security boundaries

−

- Track Agent Protocol adoption: measure how many external agents in production implement Agent Protocol; add adapter templates for top 3 vendors.

−- changedSectionsArrayIsDeprecated

Signals:

- News (Anthropic news)

- Research (Anthropic news)

Update history8▶

5d ago4 sources

Updated async subagent guidance (include LangChain Deep Agents v0.5 task lifecycle and primitives), removed an unverified Vercel CLI version claim and pointed to the Vercel changelog for exact requirements; added OpenAI Academy as a tracked docs source.

Apr 11, 20264 sources

This update integrates April 2026 signals: OpenAI Custom GPTs and enterprise guidance, LangChain Deep Agents v0.5 async subagent primitives and Agent Protocol clarity, and Vercel Sandbox/CLI management notes. It clarifies sandbox management, skill packaging, and preserves existing handoff/state patterns while removing fragile provider-specific resource claims.

Apr 9, 20264 sources

Updated the orchestration guide to reflect recent vendor changes in Q1–Q2 2026: LangChain Deep Agents v0.5 introduces async subagents and an Agent Protocol; Vercel AI Gateway and Sandbox added fast inference modes and larger sandbox sizing; OpenAI announced Frontier and enterprise agent patterns. These changes affect state management, handoff schemas, and deployment choices, so the skill was revised to include concrete async task patterns, cross-vendor adapter guidance, and vendor links.

Apr 7, 20264 sources

Updated agent orchestration guidance to: (1) align guardrails and instruction precedence with the OpenAI Model Spec, (2) add Fleet/skills patterns for shareable, versioned agent capabilities, and (3) recommend a vendor-agnostic handoff schema and monitoring of SDK changelogs (Vercel, LangChain).

Apr 5, 20264 sources

OpenAI Agent Orchestration agent run was interrupted: Free credits temporarily have rate limits in place due to abuse. We are working on a resolution. Try again later, or pay for credits which continue to have unrestricted access. Pur

Apr 3, 20264 sources

Apr 2, 20264 sources

OpenAI Agent Orchestration now tracks AI agents and 3 other fresh signals.

Mar 31, 20264 sources

Updates to include model-selection guidance (mini/nano for routers), agent authorization patterns, pointer-based state & vector DBs (LangChain+MongoDB), secure sandboxes, and evaluation updates (LangChain checklist). Adds auth and model_hint to handoff schema and safety guidance referencing Model Spec and Bug Bounty.

Content

OpenAI Agent Orchestration

When to use

When NOT to use

Core concepts

Agent types

Content

OpenAI Agent Orchestration

When to use

When NOT to use

Core concepts

Agent types

Model selection & granularity

Agent authorization and credentials

Handoff protocol

State management patterns

Async & long-running tasks

Skills, Fleet, and reuse

Workflow

Step 0: Define evaluation and safety requirements before building

Step 1: Define your agent graph

Step 2: Implement agents with focused instructions

Step 3: Run the orchestration loop — secure execution

Step 4: Add guardrails and safety processes

Tooling & libraries

Examples

Example 1: Customer support pipeline

Example 2: Code generation with review loop (with sandboxing)

Example 3: Async background work with task lifecycle

Cross-vendor routing

Decision tree

Edge cases and gotchas

Evaluation criteria

Research-backed changes

Activity

Automation & run history

Latest refresh trace

Research engine

Sources