Article

Supervisor, Router, and Planner-Executor Patterns

Routers dispatch, planners break work into a roadmap, and supervisors retain control across the run. The right orchestration pattern depends on where authority should live.

Once you move beyond one generic agent loop, the next design question is not which framework should I use?

It is:

Where should control live?

That is the real question behind supervisor, router, and planner-executor patterns.

These are not interchangeable multi-agent buzzwords.

They solve different orchestration problems.

The short answer is this:

That is the simplest useful distinction.

This article builds on Planning and Task Decomposition, ReAct and the Basic Reasoning Loop, Tool Use: How Agents Take Action, When to Use a Workflow Instead of an Agent, and Structured Outputs, Guardrails, and Execution Boundaries. Those pieces explain planning, local reasoning loops, tool execution, bounded autonomy, and runtime control. This one explains how those components get arranged once one undifferentiated loop is no longer enough.

Why Orchestration Patterns Start to Matter

For simple tasks, one agent may be enough.

If the job is:

then extra orchestration often just adds cost and confusion.

But once the work starts to involve:

the main question becomes architectural.

Who decides what happens next?

Is the task handed off once and then owned by something else?

Is the plan made upfront and then executed mechanically?

Or is there a central controller that keeps deciding throughout the run?

That is where these patterns become useful.

They are not just implementation styles.

They are different answers to the question of where authority lives across the task lifecycle.

Router Pattern: Choose the Lane

A router pattern is the lightest of the three.

Its job is to decide where the work should go.

That usually means:

The key point is that the router normally does not stay deeply involved after the handoff.

It dispatches.

Then it steps back.

That is why a router is best understood as a traffic controller, not a project manager.

What a Router Is Good At

Routers work best when the main challenge is classification.

For example:

In those cases, the real problem is:

Which lane should handle this request?

That is a routing problem.

Router vs Supervisor

This is the distinction many readers need most.

A router says:

This belongs over there.

A supervisor says:

I am still responsible for this task, and I will decide who does each part, when, and how the outputs come back together.

That is a much bigger job.

When Not to Use a Router

Do not use a router when the task needs ongoing cross-specialist coordination.

If the system must:

then the problem is no longer just dispatch.

At that point, a router alone is too thin.

Also note that not every router needs an LLM.

If the lanes are stable and the criteria are obvious, rules are often better.

That is the same lesson from When to Use a Workflow Instead of an Agent: do not spend dynamic reasoning on a choice that can be encoded simply.

Planner-Executor Pattern: Separate Strategy From Action

The planner-executor pattern splits the work into two roles.

One component creates the roadmap.

Another component executes the roadmap.

That is the core idea.

The planner answers:

The executor answers:

So planner-executor is not mainly about dispatching between specialists.

It is about separating strategic decomposition from tactical action.

Why Teams Use It

This pattern is attractive because it creates more structure than a raw ReAct loop.

Instead of rethinking the whole task after every small step, the system can build an initial roadmap and then work through it.

That often helps with:

The system does more of its thinking upfront.

That can make execution more predictable.

Planner-Executor vs ReAct

This distinction needs to be explicit.

ReAct is a local reasoning loop:

Planner-executor is a higher-level orchestration split.

It decides that planning and doing should be handled as separate roles.

So the difference is not:

That is too simplistic.

A planner-executor system might use:

The real distinction is structural.

ReAct keeps reasoning tightly interleaved with execution.

Planner-executor moves more of the reasoning into an earlier planning phase.

Main Tradeoff

Planner-executor is often stronger than a pure loop when the task can be decomposed well.

But it can become brittle if the world changes in the middle of execution.

If the plan is wrong, incomplete, or overtaken by new evidence, the system may need replanning.

That is why this pattern works best when the environment is uncertain enough to need planning, but not so unstable that the plan becomes obsolete immediately.

Supervisor Pattern: Retain Control Across the Run

A supervisor pattern is the heaviest of the three.

The supervisor stays in charge while the work is happening.

It does not just hand the task off once.

It manages the lifecycle.

That usually means the supervisor can:

So the key difference is authority retention.

The router transfers authority.

The planner embeds authority into the plan.

The supervisor keeps authority centrally and uses it dynamically.

That is why the supervisor pattern looks most like a manager.

Supervisor vs Orchestrator-Worker

Many framework docs describe something like orchestrator-worker.

In practice, that usually lands closest to the supervisor family.

Why?

Because the orchestrator is not merely classifying and dispatching once.

It is typically:

That is supervisor behavior.

The exact labels vary by framework.

The underlying control shape is what matters.

When a Supervisor Makes Sense

Use a supervisor when the work needs:

This is most useful on ambiguous tasks where the right sequence is not obvious upfront and one simple handoff is not enough.

What It Costs

Supervisor systems are powerful.

They are also expensive.

They usually mean:

That is why many teams overuse them.

A supervisor feels architecturally advanced.

But if the task only needed routing or a bounded planner-executor split, the supervisor is often overhead rather than progress.

The A.L.M. Lens

The cleanest way to distinguish these patterns is to use one comparison lens.

Call it The A.L.M. Lens.

A.L.M. stands for:

The point of the lens is simple:

Ask when decisions are made and who still owns the task after each decision.

Router

The router chooses the lane and exits.

Planner-Executor

The planner decides the roadmap first.

The executor carries it out.

Supervisor

The supervisor keeps deciding as the task unfolds.

That is why this lens is useful.

It does not depend on LangGraph terminology, AutoGen terminology, CrewAI terminology, or any vendor taxonomy.

It explains the patterns by control shape.

That is the durable distinction.

Quick Comparison

PatternWhen decisions happenWho owns the task after the first decision?Best forMain failure mode
Routerupfrontthe destination workflow or specialistclassification and dispatchpretending dispatch is enough when the task really needs coordination
Planner-executormostly upfront, with occasional replanningthe plan during executiondecomposable work with a legible roadmapbrittle execution when new evidence invalidates the plan
Supervisorcontinuously through the runthe supervisorambiguous, cross-specialist, adaptive workcoordination overhead, latency, and cost

How These Patterns Map to Real Systems

The labels vary across frameworks.

The control shape is more stable than the product vocabulary.

That is why it helps to map the pattern names back to the common public implementations.

So if you see terms like:

do not ask whether the names match perfectly.

Ask which control shape they represent.

That question is more transferable than any one framework taxonomy.

One Example Across All Three: Support Operations

Use one example and keep it stable.

Suppose a company is building an AI support system for incoming customer issues.

The system receives this request:

My laptop was charged twice, the replacement order is missing, and support already promised a refund.

Now compare the three patterns.

Router Version

A router classifies the issue and sends it to:

Whichever lane it chooses becomes the new owner.

This is strong when the main goal is triage.

It is weak when the issue clearly spans several domains.

Planner-Executor Version

A planner breaks the task into a roadmap such as:

  1. verify the duplicate charge
  2. check replacement-order status
  3. inspect prior support history
  4. decide whether refund conditions are already met
  5. execute the approved remediation steps

An executor or set of executors then performs those steps in sequence.

This is strong when the workflow can be structured well enough upfront.

It is weaker when new evidence changes the sequence materially halfway through.

Supervisor Version

A supervisor keeps ownership and dynamically delegates:

This is strong because the task is cross-functional and ambiguous.

It is expensive because the coordination itself becomes a major part of the run.

Hybrid Version

In real systems, the best design often combines patterns.

For example:

That is normal.

These patterns are composable.

The mistake is to treat them as mutually exclusive product categories instead of reusable control shapes.

Which Pattern Should You Start With?

Start with the smallest pattern that matches the real control problem.

That is the practical rule.

Start With a Router When

Start With Planner-Executor When

Start With a Supervisor When

Do Not Start With a Supervisor By Default

This deserves a direct warning.

Many teams hear multi-agent and jump straight to a supervisor architecture.

That is often premature.

If the system can succeed with:

then start there.

That is easier to:

And it aligns with the broader site principle that autonomy should be earned, not assumed.

The Practical Rule

If you only remember one distinction, use this one:

Routers dispatch. Planners structure. Supervisors stay responsible.

That is the cleanest first-principles summary.

And if you need the design rule behind it, use this:

Choose the pattern based on where authority should live after the first decision.

If authority should transfer, route.

If authority should be embedded into an upfront roadmap, plan and execute.

If authority must stay active and central throughout the run, supervise.

That is the real architecture choice.

The next topics naturally move from here into human-in-the-loop control design and then into evaluation and observability, because once orchestration patterns are clear, the next question is how to control and measure them.

FAQ

Isn’t a router just a simple supervisor?

No.

A router mainly decides where the task should go.

A supervisor keeps ownership of the task and continues coordinating while the work is happening.

That ongoing authority is the main difference.

Is planner-executor the same thing as multi-agent?

No.

Planner-executor is a role split, not a synonym for multi-agent.

You can have:

The core idea is the separation between roadmap creation and execution.

When should I use a router instead of a supervisor?

Use a router when the main problem is dispatch:

Use a supervisor when the main problem is coordination:

How is planner-executor different from ReAct?

ReAct is a local loop where reasoning and action stay tightly interleaved.

Planner-executor moves more of the reasoning into an earlier planning phase and then executes against that roadmap.

So ReAct is an execution pattern.

Planner-executor is an orchestration pattern.

Can one system use all three patterns?

Yes.

That is common in production.

A system might:

These are composable patterns.

Which pattern is cheapest?

Usually:

That is not a law, but it is the normal pattern because continuous coordination costs more than one dispatch decision or one upfront plan.

Where should the human sit in these patterns?

At high-risk transitions.

That includes:

The orchestration pattern does not remove the need for human control. It changes where that control should be inserted.

What comes after orchestration patterns in the learning path?

The next clean topics are:

That is where orchestration moves from architecture into production discipline.

Further Reading

If you want to compare this article against the main public pattern surface, start with these: