The Focused Version

The Core
Flywheel

Three tools. One loop. Most of the value. Learn the beginner-friendly core of the Agentic Coding Flywheel: Agent Mail for coordination, br for task structure, and bv for intelligent routing.

Scroll to explore
1

Why a Simpler Starting Point

The full Flywheel system has grown large enough that many people find it overwhelming on first contact. That reaction makes sense. But there is a much smaller core that already captures most of what makes the approach powerful.

The larger system includes planning workflows, memory systems, prompt libraries, launch tooling, safety tooling, skills, and a lot of accumulated operational detail. You do not need to absorb all of that up front.

The core loop uses just three tools. If you understand those three and use them together correctly, you already have the heart of the system.

Layer
Frontier models
What It Is For
Generating and refining the markdown plan
Layer
br (beads_rust)
What It Is For
Turning that plan into explicit task structure with dependencies
Layer
bv (beads_viewer)
What It Is For
Routing work through the dependency graph to find highest-leverage next bead
Layer
Agent Mail
What It Is For
Coordinating claims, reservations, progress, and handoff between agents

Separate the process into two layers: the planning substrate (frontier models used to create and refine the markdown plan) and the core operating loop (Agent Mail, br, and bv once the plan is ready to drive execution).

Who This Is For

This document is for a relatively smart software developer who is new to agentic coding and does not want to absorb the entire larger Flywheel guide up front. The goal is narrower: get you to the point where you can coordinate multiple agents without chaos, keep work organized as explicit tasks with dependencies, and keep agents working on the best next unblocked task instead of choosing randomly.

If that works well for you, the larger Flywheel stack becomes much easier to appreciate later.

Note
When this guide says "three-tool core," it means the durable execution substrate you keep live while the project is being carried out. It does not mean the planning phase happens in a vacuum. The first step is still creating an excellent markdown plan.
2

Five Terms You Need

If these five terms stay clear in your head, most of the rest of the guide gets much easier to follow.

Term
Bead
Meaning
A self-contained task with enough context, dependency information, and completion criteria that an agent can work it without guessing.
Term
Ready bead
Meaning
A bead whose blockers are cleared, so it can be started right now.
Term
Claim
Meaning
When an agent announces via Agent Mail that it is taking responsibility for a bead. Agents do this automatically.
Term
Reservation
Meaning
A coordination lock on files so two agents do not unknowingly collide. Agents manage these automatically via Agent Mail.
Term
Thread
Meaning
The Agent Mail conversation tied to one bead, where start, progress, and completion messages accumulate. Created automatically by agents.

How the Tools Work Together (Behind the Scenes)

You do not need to manually manage the coordination between these tools. When your AGENTS.md file is set up correctly, the agents handle the integration automatically: they use bead IDs as thread identifiers in Agent Mail, they announce claims and reserve files before editing, and they update bead status as they work. You configure this once in AGENTS.md and then the agents just do it.

Note
The human's job is to write a good plan, create good beads, launch agents with the right marching orders, and then tend the swarm. The agents handle the coordination plumbing themselves.
3

The Core Loop

The core loop is simple: generate a plan, encode it as beads, launch agents with marching orders, let them coordinate through Agent Mail while bv routes them toward the best next bead, and tend the swarm until the graph is done.

Core Loop

Six stages, one loop, compounding leverage

Each closed bead reshapes the graph. The next agent gets a better map. Click any stage to see the details.

Stage Detail
Plan

Create & refine markdown plan with multiple frontier models

Artifact

Markdown Plan

Command

Ask 3+ frontier models, synthesize

Planning tokens are far cheaper than implementation tokens

Stage

Normal Chat Coding vs. The Core Loop

Normal Chat Coding
The task mostly lives in chat
The Core Loop
The task gets formalized into a markdown plan and then beads
Normal Chat Coding
Coordination lives in scrollback or in the human's head
The Core Loop
Coordination is externalized through Agent Mail
Normal Chat Coding
Agents pick work based on local convenience
The Core Loop
bv routes work from the dependency graph
Normal Chat Coding
Progress is hard to inspect later
The Core Loop
The bead graph and message threads form a durable record
Normal Chat Coding
A crashed agent often takes its local state with it
The Core Loop
Another agent can resume from the bead and the thread

The core loop moves work out of ephemeral chat and into explicit, inspectable artifacts. That is the short answer to "why bother?"

Interactive Comparison

Why the tools matter

Four agents, six tasks, side by side. One side uses the core loop, the other does not. Press Start.

Without the Core Loop

All agents boot up and read the codebase

Agents

A1
reading codebase...
A2
reading codebase...
A3
reading codebase...
A4
reading codebase...

Tasks

T1
T2
T3
T4
T5
T6
Conflicts0
Idle Burn0
Completed0/6
With the Core Loop

All agents boot up and read the codebase

Agents

A1
reading codebase...
A2
reading codebase...
A3
reading codebase...
A4
reading codebase...

Tasks

T1
T2
T3
T4
T5
T6
T1 T2T1 T5T2 T4T3 T4
Conflicts0
Idle Burn0
Completed0/6

Four agents spin up in parallel and read the same codebase. Without coordination tools, they have no way to know what the others are doing.

Phase
4

The Three Tools Are a Single Machine

These three tools solve three different failure modes. Each helps on its own, but the value shows up most clearly when they form a stable loop together.

Agent Mail Solves Coordination

Without Agent Mail, multiple agents constantly collide: two agents edit the same files, nobody knows who is doing what, messages disappear into chat history, and work gets stranded when an agent crashes.

Agent Mail gives agents a shared coordination layer with identities, threads, inboxes, and file reservations. Agents announce what they are doing, reserve edit surfaces, and recover when another agent disappears. All of this happens automatically once your AGENTS.md tells agents to use Agent Mail.

br Solves Task Structure

Without br, work collapses into vague conversational intentions: "fix the auth stuff," "clean up the admin area," "someone should improve tests." That kind of tasking is too fuzzy for a swarm.

br turns work into explicit beads with status, priority, and dependencies. Once work is represented that way, multiple agents can make progress without constant human steering.

bv Solves Routing

Even with good beads, agents still need to know what to do next. Without bv, they choose work based on local convenience or whatever they most recently saw in context.

bv reads the bead graph and computes what is most worth doing next. That turns the swarm from "many agents doing work" into "many agents pushing the project forward efficiently."

Coordination Logic

Beads, Agent Mail, and bv are a single machine

Hover or tap to inspect each piece. Click again to remove it and watch the system lose a capability it cannot replace. This is the Coordination Triangle.

BeadsAgent Mailbv
Capability Map
Agent Mail

The high-bandwidth negotiation layer.

br

The durable, localized issue state.

bv

The graph-theory compass for triage.

System Architecture

The trio is not three nice-to-have tools. It is one operating system split into memory, communication, and leverage analysis. Remove any side of the triangle and the swarm loses determinism.

What Goes Wrong If You Skip One

If You Skip...
Agent Mail
What Usually Happens
Agents overlap, duplicate work, and lose shared situational awareness
If You Skip...
br
What Usually Happens
Work stays vague, hidden in chat, and hard to coordinate across agents
If You Skip...
bv
What Usually Happens
Agents pick tasks based on convenience instead of graph-aware leverage
5

The Artifact Ladder

One reason agentic coding feels confusing at first is that the active artifact keeps changing. The easiest way to stay oriented is to know what the current artifact means and what you do with it next.

Artifact Progression

What you produce at each stage

An idea becomes a plan, a plan becomes beads, beads become finished code. Click any stage to see the artifact, what it means, and your next move.

Active Stage
Raw Idea
Main Artifact

A rough description

What It Means

You know the goal, not the system.

What You Do Next

Turn it into a serious markdown plan.

Progression
Total Stages
6

Raw idea to finished code. Each stage adds precision.

Key Transition
Plan to Bead

Prose becomes executable memory. This is where agents stop guessing.

The Insight

Each completed bead reshapes the graph and unblocks new work. Finished beads create ready beads.

Plan Space, Bead Space, and Code Space

Plan space is where you decide the workflows, constraints, architecture, and testing expectations. Bead space is where you transform that thinking into executable memory for agents. Code space is where agents implement the local task that a bead defines.

The general rule is simple. Debates belong in plan space. Translation and dependency shaping belong in bead space. Implementation belongs in code space.

Critical
If the swarm starts discovering missing structure while coding, the answer is often to step back up into bead space or plan space instead of forcing more code through a weak task graph.

What a Good Plan Looks Like

A strong plan lets a fresh reader answer five questions without guessing: what are the main workflows? What constraints matter? What architecture are we choosing? How will we know it works? What failure cases must not disappear into hand-waving?

markdown
## Upload workflow
- Users drag Markdown files into the upload surface.
- The system parses frontmatter plus body text and stores a normalized note record.
## Constraints
- Unauthorized users must never see note content or note metadata.
- Failed ingestions must be preserved for operator review instead of discarded.
## Architecture choice
- Use a dedicated ingestion pipeline so parse failures can be persisted and retried.
- Keep search indexing separate from upload handling so indexing can be retried independently.
## Tests and failure handling
- Unit coverage for parsing and index mapping.
- E2E coverage for upload, failed-ingestion review, retry, search, and filtering.

It gives a fresh agent workflows, constraints, architecture, testing, and failure handling in one place. Before you turn the plan into beads, check that these five questions are answerable from the plan alone.

Escalation Ladder

When something feels wrong, use the smallest escalation that actually fits the problem:

  • Local code confusion — stay in code space and resolve it there
  • Weak or underspecified bead — step back into bead space and rewrite the bead
  • Wrong graph — fix the dependencies or add the missing bead
  • Missing plan work — step back into plan space and revise the markdown plan
  • Degraded agent — restart it with a fresh session
6

A Concrete Example: Atlas Notes

A small project makes the workflow easier to picture. Imagine building an internal tool called Atlas Notes: team members upload Markdown notes, the system tags and indexes them, users can search them quickly, and admins can inspect failed ingestions.

If you gave four agents only that vague description, they would step on each other and make mismatched assumptions. The core loop instead looks like this:

  1. 1You ask multiple frontier models to produce competing markdown plans, then synthesize them into one strong plan.
  2. 2You tell an agent to convert that plan into beads — upload pipeline, indexing, admin screen, auth, and end-to-end tests — with explicit dependencies.
  3. 3You launch 2-4 agents with marching orders. They read AGENTS.md, join Agent Mail, and start picking up beads using bv.
  4. 4You tend the swarm: check progress every 10-15 minutes, rescue confused agents, and add missing beads when needed.
  5. 5Agents implement, review their own work with fresh eyes, close beads, and move to the next one. You step in for strategic decisions.

What a Good Bead Looks Like

The bead is the unit of work agents actually execute. Weak beads force improvisation. Rich beads make execution mechanical. Here is a real bead from the ACFS project:

markdown
bd-01s: Add --deep flag to acfs doctor
Context:
Part of EPIC: Enhanced Doctor with Functional Tests.
What to Do:
Add --deep flag to doctor.sh that enables functional tests beyond
binary existence checks:
- Add DEEP_MODE=false global
- Parse --deep flag alongside existing --json
- --deep and --json can be combined
Acceptance Criteria:
- --deep flag parsed correctly
- Default doctor unchanged (fast, existence checks only)
- --deep runs additional functional tests
- Works with --json for structured output
Files to Modify:
- scripts/lib/doctor.sh: Argument parsing

The prose does not need polish. A fresh agent should be able to understand the task, the reason for it, and the acceptance criteria without reopening the whole markdown plan. You can browse real beads from actual Flywheel projects at FrankenEngine, FrankenTUI, and Asupersync.

Weak vs. Strong Artifacts

Quality thresholds get easier to feel when you compare weak and strong versions directly. The weak version names a topic. The strong version scopes the actual requirement, constraint, and testing obligation.

Interactive Comparison

Weak vs. Strong Artifacts

A weak artifact names a topic. A strong one carries scope, constraints, and a test plan. See the difference.

Weak1 line
1- Add auth
No scope, no constraints, no tests
Scope
Constraints
Tests
Strong4 lines
1- Internal-only auth gates every note and note-metadata surface.
2- Unauthorized users must never see content, filenames, tags, or timestamps.
3- Admin review routes require explicit permission checks and should be
4 covered by e2e tests for allowed and denied access.
The Takeaway

The weak version names a topic. The strong version scopes the actual requirement, the constraint, and the testing obligation.

What the Agents Do Automatically

Once you launch agents with good marching orders, they automatically handle the coordination mechanics. A typical bead thread in Agent Mail looks like this — created entirely by agents, not by you:

text
[br-103] Start: Failed-ingestion admin screen
Claiming br-103. Reserving admin UI files plus retry handler path.
Will send update once list view is working and retry path is wired.
[br-103] Progress: Main path wired
List view and detail view working. Now handling edge cases and tests.
[br-103] Completed
Admin screen done. List view, detail view, and retry action wired.
Auth checks in place. E2E coverage for malformed upload → admin review → retry.

You do not write these messages. The agents create them because your AGENTS.md tells them to coordinate through Agent Mail and use bead IDs as thread anchors. Your job is to monitor these threads to see if work is flowing or stuck.

7

The Operating Rhythm

This section describes what you, the human, actually do. The agents handle the coordination plumbing (Agent Mail messages, file reservations, bead status updates). Your job is to create the conditions for them to succeed.

Operating Rhythm

From plan to production in five steps

Steps 3-5 repeat for every bead. You stop thinking about the process after the second cycle.

Repeating loop
Repeat from Route

Step 1: Create an Excellent Markdown Plan

Before beads or swarms or file reservations, create a serious markdown plan. Do not settle for one quick draft from one model.

Multi-Model Synthesis
I asked 3 competing LLMs to do the exact same thing and they came up with pretty different plans which you can read below. I want you to REALLY carefully analyze their plans with an open mind and be intellectually honest about what they did that's better than your plan. Then I want you to come up with the best possible revisions to your plan that artfully and skillfully blends the "best of all worlds" to create a true, ultimate, superior hybrid version of the plan: [Paste all plans here]
Context

GPT Pro web app with Extended Reasoning, or your strongest available model

Psychology

Different frontier models have different blind spots. Competitive synthesis forces the model to admit where others are better and merge the strongest ideas.

At minimum, you want: the user-facing workflows, the important constraints, the major architectural decisions, and the testing expectations.

Step 2: Tell an Agent to Convert the Plan into Beads

You do not need to manually create every bead yourself. Tell a coding agent to do the conversion:

Plan to Beads Conversion
OK so please take ALL of that and elaborate on it and use it to create a comprehensive and granular set of beads for all this with tasks, subtasks, and dependency structure overlaid, with detailed comments so that the whole thing is totally self-contained and self-documenting. The beads should be so detailed that we never need to consult back to the original markdown plan document. Remember to ONLY use the `br` tool to create and modify the beads and add the dependencies. Use ultrathink.
Context

Claude Code with Opus

Psychology

Beads become the active source of truth for execution. Once they're strong enough, you never look back at the markdown plan.

Then polish the beads 4-6 times with fresh review passes. Each round catches things the previous round missed. This is the "measure twice, cut once" of the methodology.

Insight
Beads are executable memory. The markdown plan is the best artifact for whole-system thought. Beads are the plan after it has been transformed into a format optimized for distributed execution. Weak beads force improvisation. Rich beads make execution mechanical.

Step 3: Launch Agents with Marching Orders

Once beads are polished and your AGENTS.md is solid, start up a swarm of agents. Give each one these marching orders:

Swarm Marching Orders
First read ALL of the AGENTS.md file and README.md file super carefully and understand ALL of both! Then use your code investigation agent mode to fully understand the code, and technical architecture and purpose of the project. Then register with MCP Agent Mail and introduce yourself to the other agents. Be sure to check your agent mail and to promptly respond if needed to any messages; then proceed meticulously with your next assigned beads, working on the tasks systematically and meticulously and tracking your progress via beads and agent mail messages. Don't get stuck in "communication purgatory" where nothing is getting done; be proactive about starting tasks that need to be done, but inform your fellow agents via messages when you do so and mark beads appropriately. When you're not sure what to do next, use the bv tool mentioned in AGENTS.md to prioritize the best beads to work on next; pick the next one that you can usefully work on and get started. Use ultrathink.
Context

Every agent in the swarm gets this as their initial prompt

Psychology

Every agent is fungible and a generalist. The specifics come from AGENTS.md and the beads, not from the prompt. This generic prompt works for every project.

Stagger agent starts by at least 30 seconds to avoid the "thundering herd" problem where all agents grab the same bead. Start smaller than your ego wants to: 1 agent to learn, 2 to feel coordination, 4 for real swarm behavior.

Step 4: Tend the Swarm

Now you are the operator. On roughly a 10-15 minute cadence, check on the swarm:

  1. 1Run bv --robot-triage and check whether the top recommendation still makes sense.
  2. 2Glance through Agent Mail threads — are agents making progress or stuck?
  3. 3Look for beads stuck in in_progress without movement.
  4. 4If an agent seems confused after compaction, send: "Reread AGENTS.md so it's still fresh in your mind."
  5. 5If an agent is truly degraded, kill it and start a fresh one.

That is usually enough to keep the loop healthy without turning the human into a full-time traffic cop.

Step 5: Review, Close, Repeat

After agents finish each bead, have them review their own work:

Fresh-Eyes Review
Great, now I want you to carefully read over all of the new code you just wrote and other existing code you just modified with "fresh eyes" looking super carefully for any obvious bugs, errors, problems, issues, confusion, etc. Carefully fix anything you uncover. Use ultrathink.
Context

After each bead is implemented — run until no more bugs found

Psychology

Forces a mode switch from writing to adversarial reading while the code is still fresh. One of the cheapest quality multipliers in the whole method.

Then they move to the next bead using bv to find the most impactful one. The cycle repeats until the graph is done.

8

The Human's Job

The human is not supposed to micromanage every code edit or manually coordinate Agent Mail threads. The human is there to keep the structure clean enough that the agents can work effectively inside it.

Interactive Exhibit

Who Does What in the Core Loop

You design the system and tend the swarm. Agents do the coordination work.

Breakdown
5 you2 agents1 both
YouAgentsBoth
You
Create the Plan
Who

You

Tools

GPT Pro, Claude, Gemini web apps

Ask multiple frontier models for competing plans, then synthesize into one strong design document

This is where 85% of your thinking goes. No code yet.

What You Do

  • Create the plan and beads — this is where most of your time and thinking goes
  • Write a good AGENTS.md — this is the operating manual that makes everything else work
  • Launch agents with marching orders — the same generic prompt every time
  • Keep the bead graph honest — notice when a missing task or dependency must be added
  • Restart or redirect agents when they drift, get loopy, or lose context
  • Ask the hard question periodically (see below)

What the Agents Do (Not You)

  • Register with Agent Mail and discover other active agents
  • Claim beads and announce what they are working on
  • Reserve files before editing to prevent conflicts
  • Update bead status (in_progress, closed) as they work
  • Use bv to find the next best bead when they finish one
  • Send progress updates and completion messages in Agent Mail threads

All of this is configured once in your AGENTS.md. You do not need to manually invoke Agent Mail calls, update bead statuses, or thread bead IDs into messages. The agents do it because the operating manual tells them to.

The Reality Check

When the swarm looks active but you suspect it is not actually closing the real gap, stop and ask:

Where are we on this project? Do we actually have the thing we are trying to build? If not, what is blocking us? If we intelligently implement all open and in-progress beads, would we close that gap completely? Why or why not?

If the answer is "no," the fix is usually not more implementation effort. Revise the bead graph, add missing work, or step back into planning.

Minimum Viable AGENTS.md

Even in the smaller core-loop version, you still need a minimal AGENTS.md. It does not have to be a giant doctrine document, but it should say:

  • What the repo is for
  • What the stack is
  • Any non-negotiable safety or style rules
  • How to use Agent Mail, br, and bv (include the prepared blurbs from each tool's docs)
Treat AGENTS.md as the swarm's durable operating manual. It tells a fresh or partially confused agent how to behave, what tools exist, and what "doing a good job" looks like in this repo.
Critical
After compaction, agents must re-read AGENTS.md. If an agent still seems confused or loopy after rereading it, stop trying to rescue the degraded state and start a fresh session instead.
9

Common Failure Modes

Symptom
Agents keep overlapping
Likely Cause
AGENTS.md doesn't emphasize Agent Mail claims and reservations enough
Fix
Strengthen the coordination section in AGENTS.md
Symptom
Agents choose random work
Likely Cause
AGENTS.md doesn't explain bv usage clearly
Fix
Add bv usage instructions to AGENTS.md with the robot-flag commands
Symptom
A task keeps stalling
Likely Cause
The bead is underspecified or missing a dependency
Fix
Rewrite the bead or add the missing dependency
Symptom
The swarm feels busy but confused
Likely Cause
The markdown plan was too weak
Fix
Go back up a level and improve the plan before continuing

Agent Disappeared Mid-Bead

When an agent vanishes mid-bead, the recovery path should be boring:

  1. 1Check the Agent Mail thread for the last meaningful progress update.
  2. 2Launch a fresh agent with the standard marching orders.
  3. 3The new agent will read AGENTS.md, discover the abandoned bead via bv, and pick it up.
  4. 4If the bead was partially completed, the new agent can continue from the code state plus the thread history.
Note
If recovery feels hard, the bead or the AGENTS.md was probably too thin. That is a signal to write richer beads and a more thorough operating manual.
10

What It Feels Like Once It Clicks

At some point, the workflow stops feeling like extra ceremony and starts feeling like a calmer control surface:

  • Less duplicated work, because agents manage ownership and reservations automatically
  • Less "what should I do next?" drift, because bv keeps answering that question for the agents
  • Easier restart after context loss, because the work lives in beads and threads instead of only in chat history
  • Easier handoff, because any agent can read the bead, read the thread, and continue

That operator feeling is a good sign. It usually means the artifacts are carrying the work instead of your short-term memory.

Why This Captures Most of the Value

People often assume the magic of the Flywheel comes from the total number of tools. It does not. Most of the value comes from three things:

  1. 1Work is explicit instead of implicit
  2. 2Coordination is externalized instead of living in human memory
  3. 3Task choice is graph-aware instead of random

Those three properties are already present in the core loop. That is why the smaller system gets you surprisingly far.

When Not to Use the Core Loop

You probably do not need it for a tiny one-file change with no real dependency structure, a purely local experiment, or a quick one-agent cleanup that does not need externalized coordination.

The loop earns its keep when work has enough structure, enough ambiguity, or enough parallelism that explicit planning, explicit tasks, and explicit coordination start paying for themselves.

11

Helper Utilities: DCG, UBS & CASS

Once the core loop is running smoothly, three helper utilities significantly improve safety, quality, and learning. They are multipliers on top of the core loop, not prerequisites.

Destructive Command Guard

DCG

A Claude Code hook that blocks dangerous git and filesystem commands before execution. Sub-millisecond latency, mechanical enforcement.

Usage

Works automatically. When a dangerous command is blocked, use safer alternatives or ask the user to run it manually.

dcg test 'rm -rf /' --explain
Ultimate Bug Scanner

UBS

Multi-language bug scanner with guardrails. Run it on changed files before every commit to catch injection, unquoted variables, and other hazards.

Usage

ubs <changed-files> before every commit. Exit 0 = safe. Exit >0 = fix and re-run.

ubs $(git diff --name-only --cached)
Cross-Agent Session Search

CASS

Indexes prior agent conversations so solved problems can be reused. Finds patterns, decisions, and solutions across session history.

Usage

Search before reinventing a solution. If an agent solved a similar problem before, CASS will find it.

cass search 'auth middleware' --robot --limit 5

What You Can Ignore for Now

If you are just getting started, you do not need to master all of this immediately:

  • Large-scale session memory systems like CASS and CM
  • Big prompt libraries
  • Advanced launch tooling like ntm
  • The full exhaustive planning doctrine
  • Every supporting tool in ACFS

Those things help. Some help a lot. But they are multipliers on top of the core loop, not prerequisites. You can run the core loop with separate terminal tabs and no special session manager.

What You Should Not Ignore

Even in the smaller version, a few principles still matter a lot:

  • Do not start a swarm with only vague goals — make a real plan first
  • Do not treat beads as tiny throwaway todo lines — they need rich context
  • Do not skip the bead polishing rounds — single-pass beads are never optimal
  • Do not rely on chat scrollback as your coordination system — that is what Agent Mail is for

If you violate those, the workflow quickly degrades back into ordinary multi-agent chaos.

12

Getting Started

The First 30 Minutes

  1. 1Pick one real project, not a toy.
  2. 2Ask multiple frontier models for competing markdown plans.
  3. 3Synthesize them into one strong plan.
  4. 4Tell an agent to create beads from the plan with dependencies.
  5. 5Polish the beads 4-6 times with fresh review passes.
  6. 6Run bv --robot-triage to verify the graph makes sense.
  7. 7Launch 2-4 agents with the standard marching orders.
  8. 8Tend the swarm. Check every 10-15 minutes.

Start smaller than your ego wants to:

Mode
1 agent
What It Is Good For
Learn the artifact flow without coordination overhead
Mode
2 agents
What It Is Good For
Feel the first real coordination benefits
Mode
4 agents
What It Is Good For
Experience meaningful swarm behavior where routing and handoff matter

Try This Now

If you want to feel the method instead of only reading about it:

  1. 1Pick one real repo
  2. 2Write one serious markdown plan
  3. 3Tell an agent to create two real beads with one dependency
  4. 4Run bv --robot-next and check that the recommendation makes sense
  5. 5Launch a second agent with the marching orders and watch them coordinate

Those five steps are enough to make the core loop stop feeling theoretical.

The Cheat Card

If you want the loop on one screen, keep this:

  1. 1Plan with multiple models
  2. 2Synthesize into one markdown plan
  3. 3Tell an agent to create beads
  4. 4Polish beads 4-6 times
  5. 5Write a good AGENTS.md
  6. 6Launch agents with marching orders
  7. 7Tend the swarm every 10-15 minutes
  8. 8Have agents do fresh-eyes review after each bead
  9. 9Repeat until the graph is done

When to Graduate to the Full Flywheel

Move up to the full guide when one or more of these becomes true:

  • Your projects are large enough that you want much richer planning workflows
  • You want stronger AGENTS.md operating manuals with comprehensive tool documentation
  • You want repeatable prompt libraries and skills
  • You want better recovery from compaction and session loss
  • You want memory systems (CASS, CM) that improve the workflow over time

At that point, the bigger document stops feeling like overhead and starts feeling like leverage.

Ready for More?

Graduate to the Full Flywheel

Once the core loop feels natural, the full methodology adds richer planning workflows, memory systems, prompt libraries, and the complete Dicklesworthstone stack.