Sub Agents — Context Isolation

Context Perspective: A Sub Agent creates an isolated context to tackle a specific sub-task, preventing contamination of the main context.

The previous chapter covered orchestration patterns—how to organize steps. This chapter looks at the execution unit: when the main Agent needs a clean environment for a sub-task, it spawns a Sub Agent.

The Problem: Context Gets Dirty

Remember "context pollution" from The First Principle?

The longer a conversation goes, the longer the messages array gets. Early explorations, rejected solutions, irrelevant tool outputs… all piling up. When you ask the Agent to perform a precise sub-task in this noise—say, "write an integration test based on the latest API schema"—the LLM’s attention gets diluted. It might reference outdated code or follow a deprecated convention.

You need a clean room.

That said — not every task needs one. A single Command trigger enough? Use that — simpler. A Skill loads once and stays active. Sub-task is independent with a clear goal? Spawn a Sub Agent right away — no need to wait until the context gets noisy.

Spawning a Sub Agent

A Sub Agent is that clean room.

The main Agent can spawn one or more Sub Agents. Each Sub Agent’s messages history starts from zero—it can’t see what the main Agent discussed with you. But "clean" doesn’t mean "blank": Sub Agents typically inherit the main Agent’s System Instructions. The project standards, coding conventions, and safety rails you wrote in CLAUDE.md—the Sub Agent follows those too.

What’s isolated is the conversation history, not the project rules.

One more thing that’s easy to miss: the Sub Agent’s initial prompt is usually constructed by the main Agent automatically. You give the main Agent a big task. The main Agent analyzes it, decides "this sub-task needs isolated handling," and constructs an initial prompt for the Sub Agent. You can guide this through System Instructions—for example, "when delegating, always include file paths and constraints." The more precise your instructions, the better the prompt it constructs.

What a Good Initial Prompt Looks Like

The most common mistake when delegating to a Sub Agent is dumping the entire chat history.

The right approach — a focused task description with just three things:

Goal: Be specific. "Fix the login bug in the auth module."
Constraints: State the boundaries. "Do not touch the DB schema. Do not add new dependencies."
Key Context: Provide only what's necessary. "Relevant files are A and B. Error logs are in C."

Dumping context is lazy. The Sub Agent will get lost in the noise.

How It Works

The main Agent delegating to a Sub Agent boils down to three steps:

── Inside the Sub Agent ──

The Sub Agent’s messages start from zero, but its system prompt inherits the project’s System Instructions:

Round 1: Receiving the task

json

// → REQUEST (Sub Agent → LLM API)
{
  "system": "You are a code assistant.\n\n[Project System Instructions]\n- TypeScript strict mode\n- Tests use Vitest\n- No any types\n...",
  "messages": [
    { "role": "user", "content": "You are a QA Engineer. Here is the API schema: {...}\nWrite an integration test for createUser..." }
  ]
}

Notice messages has exactly one entry—clean, focused, no baggage from the main Agent’s history.

Round 2: Context grows after tool calls

The Sub Agent reads the schema, writes a test file, runs the test, sees a failure, and corrects:

json

// → REQUEST (Sub Agent → LLM API, Round 2)
{
  "messages": [
    { "role": "user", "content": "You are a QA Engineer..." },
    { "role": "assistant", "tool_calls": [{ "name": "read_file", "arguments": { "path": "src/api/v2/schema.json" } }] },
    { "role": "tool", "content": "{ \"endpoints\": { \"createUser\": { ... } } }" },
    { "role": "assistant", "tool_calls": [{ "name": "write_file", "arguments": { "path": "tests/createUser.test.ts", "content": "..." } }] },
    { "role": "tool", "content": "File written." },
    { "role": "assistant", "tool_calls": [{ "name": "bash", "arguments": { "command": "vitest run createUser" } }] },
    { "role": "tool", "content": "FAIL: expected 201 but got 500..." },
    { "role": "assistant", "content": "Test failed with 500. Fixing test mock..." }
  ]
}

The Sub Agent might go through a dozen rounds of tool calls internally—reading specs, writing code, running tests, fixing bugs. All of this happens in the isolated context. The main Agent can’t see it and isn’t disturbed by it.

The Final Return

When the Sub Agent finishes, it returns a summary to the main Agent—not dozens of raw messages, but a compressed result. Think of it like git stash: stash your current complex context, do an atomic task on a clean branch, then switch back with the output.

What the main Agent receives is just: "Tests created, covering 201 and 400, file at tests/integration/createUser.test.ts." Whatever struggles the Sub Agent went through in between—the main Agent doesn’t need to know.

Isolation Often Matters More Than Concurrency

Most introductions to Sub Agents lead with concurrency: you can run several tasks at once, things finish faster. That's true, but it's only half the story.

Concurrency is just a scheduling mode; isolation is a way of managing context. The common problem isolation solves: the main context is already long and noisy, and a precise sub-task shouldn't have to run inside that mess.

Four scenarios make this concrete.

Exploratory tasks. When you send an agent to investigate an unfamiliar library or validate whether an approach is even viable, the work generates a lot of intermediate noise — attempts, errors, dead ends. Dumping all of that into the main context easily disturbs later decisions. Before handing such a task to an isolated Sub Agent, agree up front on what comes back: conclusion, evidence, risks — not the full exploration trail.

Review tasks. Code review often fits isolation too. It has its own analytical path, which doesn't always align with the implementation thread. Mixed into the same context, the two interfere with each other, and the review can be pulled off course by the implementer's framing. Isolation isn't only about saving context; it's also about reducing bias. The main thread only receives the review conclusions, evidence, and recommendation priorities.

Bulk operations. Batch renames, formatting passes, file migrations — these pile up large volumes of tool-call records, while the main thread usually only needs the final state, the impact scope, and the exceptions. Stuffing the entire process verbatim into the main context is usually a waste of context space. When delegating to a Sub Agent, make the risk boundaries explicit: return the change list, the failed items, the unhandled files — not just a curt "done."

Long-output tasks. Some tool calls return enormous output: test reports, compile logs, large API response bodies. In tools that support summarized return, a Sub Agent can digest that mass and bring back the error category, key evidence, and next-step suggestions. The point isn't taking up less space; it's compressing noise into signal.

These four scenarios share a common feature: the main thread doesn't need to consume the full execution process — only the result and the necessary evidence. Drill in further if needed, but that depends on whether the tool retains the trace.

This is also the boundary between this chapter and the next. If the question is "should this be done off to the side," that's isolation. If the question becomes "between several results, which goes first, which depends on which," that's orchestration. This chapter handles the earlier step: whether to isolate, and what isolation gives you.

Don't reach for concurrency for its own sake; first ask whether this stretch of execution will pollute the main thread.

Connecting Back to the First Principle

A Sub Agent’s performance depends on two things:

The quality of System Instructions: The project rules you wrote in CLAUDE.md—the Sub Agent consumes those too. Good rules mean the Sub Agent’s behavior aligns with project standards. This is the knowledge feeding rule layer at work inside Sub Agents.
The quality of the initial prompt the main Agent constructs: This loops back to The First Principle—context quality determines output quality. A good initial prompt must be:
- Self-contained: Not reliant on hidden information from the parent context.
- Complete: Including all necessary background materials (code snippets, file paths, clear objectives).
- Focused: Only information relevant to the sub-task, no noise.

What you can do: give the main Agent clear instructions and sufficient background. The better the raw material the main Agent has, the better the prompt it constructs for the Sub Agent.

Key Takeaways

Context flow: The main Agent extracts information from its own context to construct the initial prompt → Sub Agent executes independently in isolated context → returns a summary that gets injected back into the main Agent’s context. Runtime isolation and full logging coexist—they don’t conflict.
Risk: Isolation is a double-edged sword. If the main Agent omits a key constraint when constructing the prompt, the Sub Agent works without critical information and may produce non-compliant code. Over-splitting also has costs—each Sub Agent needs to rebuild context from scratch, and coordination overhead accumulates.
Auditability: Each Sub Agent’s full session log is saved independently and can be traced. When a summary looks wrong, you can drill down into the Sub Agent’s complete context to investigate. A summary is compression, not truth.

Next up: Human-in-the-Loop—your role in the workflow: when to let go, when to step in.

Sub Agents — Context Isolation ​

The Problem: Context Gets Dirty ​

Spawning a Sub Agent ​

What a Good Initial Prompt Looks Like ​

How It Works ​

Isolation Often Matters More Than Concurrency ​

Connecting Back to the First Principle ​

Key Takeaways ​