Murph's Take

When to Use One Big Agent vs. Many Small Ones: Tradeoffs Explained

The multi-agent vs. monolithic agent decision isn't aesthetic — it's architectural. The right answer depends on context coherence, failure tolerance, and the shape of the work.

Jason Murphy•March 29, 2026•8 min read

I've been building with Claude agents long enough to have made most of the architecture mistakes at least once. The one that takes the longest to see clearly is the monolith-vs-network question — when to keep everything in one context and when to distribute across multiple agents.

The answer isn't intuitive because it depends on factors that aren't obvious until you've shipped both configurations in production.

The Case for One Agent

The strongest argument for keeping work in a single agent is context coherence.

A language model reasons from what's in its context. Everything it knows about the current task — the goal, the constraints, the decisions made so far, the complications that have come up, the patterns it's noticed — lives in that context. When you add each new piece of information, the model can reason about how it relates to everything else. It maintains a coherent model of the task in progress.

When you split work across multiple agents, you introduce handoffs. At each handoff, some of that context gets lost. The summary you pass from Agent A to Agent B is accurate but compressed — it conveys state without necessarily conveying the reasoning that produced it. Agent B knows what Agent A decided but not why, and the "why" often matters for the decisions Agent B needs to make.

For complex, interconnected work — multi-file refactoring, long documents that need to maintain coherence, strategic analysis that builds on itself — the context coherence of a single agent is worth a lot. You don't pay the handoff tax. The reasoning flows continuously through the task.

The Case for Many Agents

The strongest argument for distributing work across agents is independence.

When subtasks are genuinely independent — they can be completed without knowing the status or output of the others — parallelism is free leverage. Multiple agents working simultaneously on non-overlapping scopes can do in one unit of time what a single agent would take multiple units to complete.

Independence also means isolated failure modes. If you have one agent with a large scope and it runs into a confusing situation, it can go sideways in ways that contaminate the whole task. If you have three agents with scoped tasks and one goes sideways, the other two complete successfully. The failure is contained. The fix is specific. This is standard distributed systems design applied to agent networks.

The other argument for many agents is capability specialization. Different subtasks might genuinely benefit from different system prompts, different context, different instructions. A research agent and an implementation agent and a review agent might have meaningfully different operating parameters. A single agent holding all of that simultaneously might be managing more context than it needs for any individual step.

The Design Test

The practical test I apply when deciding the architecture: are the decisions in later steps genuinely dependent on the outcomes of earlier steps in ways that are hard to encode in a handoff protocol?

If yes: one agent, or agents with very rich handoff protocols. If no: distributed agents with clean interfaces between them.

"Hard to encode in a handoff protocol" is doing a lot of work in that sentence. A status update is easy to encode: "Component X is done, here is the output." The nuance of why you made certain choices, what you tried that didn't work, what constraints you discovered during execution — that's harder. The more of that nuanced reasoning matters for subsequent steps, the more you pay for distributing the work.

What I've Built and What I've Learned

At Vibe Tokens we have 28 agent roles. That's not 28 simultaneous agents — it's 28 defined scope-and-authority configurations that can be instantiated for specific tasks.

What I've found: the most valuable specialization is at the level of role definition, not agent count. Each role definition is a carefully designed scope: what this agent knows, what it can decide, what it must escalate. Having clear role definitions makes single-agent tasks more focused (you're loading a specific role configuration, not all of it) and makes multi-agent coordination cleaner (each agent has a defined interface, not an ambiguous scope).

The failure mode I keep seeing in multi-agent systems is scope creep between agents — one agent starts doing work that belongs to another because the interfaces weren't defined clearly enough. The fix isn't technical. It's definitional: clearer scope statements, more explicit handoff protocols, more specific escalation conditions.

The honest meta-lesson: agent architecture questions are mostly organizational design questions with a technical interface. The skills that make human team organization work — clear roles, clean handoffs, good coordination protocols — are the same skills that make agent networks work. The ones that make human organizations fail — ambiguous authority, poor communication, undefined escalation — produce the same failure modes in agent systems.

What's the shape of the work you're trying to automate — and does the distribution of context requirements match the distribution of agents you're considering?

Want to see how your business stacks up?

Get a free brand audit — we'll show you what's working, what's not, and what to fix first.

Free Brand Audit →

Frequently Asked

What's the core tradeoff between one agent and many?

One agent maintains full context coherence — it holds the complete picture of what it's doing and why, can reason across the whole task, and adapts to unexpected developments without coordination overhead. Many agents can work in parallel and have isolated failure modes — one agent failing doesn't take down the whole operation. The tradeoff is context coherence vs. parallelism and resilience.

When does one big agent outperform a network of specialized agents?

When the task requires continuous context across many steps — complex refactoring, multi-file architectural changes, long-form writing that needs to maintain voice and coherence throughout. When the decisions in later steps depend heavily on the outcomes of earlier steps in ways that are hard to encode in a handoff protocol. When the overhead of coordinating many agents would exceed the benefit of parallelism.

When does a network of specialized agents outperform a monolith?

When subtasks are genuinely independent — each agent can complete its scope without needing to know what the others are doing. When the operation needs to scale beyond a single context window. When you want isolated failure modes — one agent getting confused or going wrong shouldn't affect the others. When different subtasks require genuinely different capabilities or system prompts.

What's the most common failure mode in multi-agent systems?

Context loss at handoff. Agent A completes its scope and passes a summary to Agent B. The summary is accurate but loses the reasoning — why certain decisions were made, what alternatives were considered, what constraints were discovered. Agent B operates on the output without the judgment that produced it, and makes decisions that are locally reasonable but globally incoherent. The fix is designing handoff protocols that pass reasoning context, not just state.

Written by

Murph

Jason Matthew Murphy. Twenty years building digital systems for businesses. Former CardinalCommerce (acquired by Visa). Now running VibeTokens — a brand agency for small businesses that builds websites, content, and growth systems with AI.

Live Workshop · April 27

Build your Claude OS in 4 hours. CLAUDE.md, MCP servers, 3 custom workflows. 8 seats, $247.

Reserve Seat →

Your brand is your first impression.

Find out if it's costing you customers.

Free brand audit. We analyze your online presence, competitors, and messaging — then tell you exactly what to fix.

Get Your Free Brand Audit →