subagent-driven-development - SKILL.md Agent Skill

name: subagent-driven-development description: Use when executing implementation plans with independent tasks in the current session

Skill: Subagent-Driven Development

When

You have an implementation plan with mostly-independent tasks and want to execute them in-session via fresh subagents with two-stage review.

NOT for:

If no subagent dispatch capability is available → use executing-plans instead (single-agent sequential)
If the work is iterative self-correction without structured plan tasks → use loop instead

CLI Primer: arcs --commands --json for discovery. Mutating commands run directly — no token.

Flow

flowchart TD
    A[Load plan + extract all tasks] --> B{Diagram .mmd exists?}
    B -->|yes| C[arcs diagram ready → get parallel-safe nodes]
    B -->|no| D[Pick next task sequentially]
    C --> E[Read node metadata: skill, scope, acceptance, verify]
    D --> F[Construct subagent prompt]
    E --> F
    F --> G[Dispatch implementer subagent]
    G --> H{Subagent status?}
    H -->|DONE / DONE_WITH_CONCERNS| I[Dispatch spec-reviewer subagent]
    H -->|NEEDS_CONTEXT| J[Provide context, re-dispatch]
    H -->|BLOCKED| K{Assess blocker}
    J --> G
    K -->|context gap| J
    K -->|needs stronger model| L[Re-dispatch with capable model]
    K -->|task too large| M[Split task, update plan]
    K -->|plan wrong| N[Escalate to human]
    L --> G
    I --> O{Spec compliant?}
    O -->|no| P[Implementer fixes spec gaps] --> I
    O -->|yes| Q[Dispatch code-quality reviewer]
    Q --> R{Quality approved?}
    R -->|no| S[Implementer fixes quality issues] --> Q
    R -->|yes| T[Mark task done]
    T --> U{More tasks?}
    U -->|yes| B
    U -->|no| V[Dispatch devil-advocate PHASE: completion — the single full-project pass]
    V -->|PASS| W[Report completion]
    V -->|BLOCK| X[Re-dispatch scoped fixes per FAILURES] --> V

Gate cap: two consecutive completion BLOCKs → stop and escalate to human; never loop the V→X cycle a third time.

Under the ARCS orchestrator: the orchestrator's devil-advocate PHASE: execute gate replaces the code-quality reviewer step (the gate runs the scoped VERIFY and the drift check); spec review remains. Running standalone, keep both reviewer stages as drawn.

Retry & Escalation

flowchart TD
    A[Subagent fails] --> B{First attempt?}
    B -->|yes| C[Re-dispatch with more context or stronger model]
    B -->|no| D{Same failure mode?}
    D -->|yes| E[Escalate to human — don't loop]
    D -->|no| C
    C --> F{Success?}
    F -->|yes| G[Continue pipeline]
    F -->|no| D

Diagram-First Dispatch

When the plan has a .mmd file:

arcs diagram ready <slug> <planId> → all returned nodes are dispatch-safe in parallel
Use per-node %% metadata (skill, scope, files, acceptance, verify) to construct prompts
After completion: arcs task transition <slug> <taskId> done --diagramNodeId=T001 --planId=<planId>
Re-run diagram ready to discover newly-unblocked nodes
If node metadata is incomplete, fall back to reading the plan body for that task

Ownership: Dispatcher owns .mmd updates. Implementer subagents MUST NOT edit diagrams.

Sub-Agent Prompt Construction

Every implementer subagent prompt MUST include:

Section	Content
Goal	Exact task description from plan (full text, not summary)
Context	Where this task fits in the plan; what came before
Scope	File boundaries — what to touch, what NOT to touch
Acceptance	Done criteria copied verbatim from plan/diagram
Verify	Exact command to run before claiming done — scoped to the task's files, never the full suite
Skill	Which work-mode skill to load (from diagram metadata or inferred)
Return	Structured Return envelope (below) — brief prose findings first, JSON block last

Do NOT make the subagent read the plan file. Provide full text in the prompt.

Model Selection

Task complexity	Model tier
1-2 files, clear spec, mechanical	Fast/cheap
Multi-file integration, pattern matching	Standard
Architecture, design, review	Most capable

Prompt Templates

./implementer-prompt.md
./spec-reviewer-prompt.md
./code-quality-reviewer-prompt.md

Structured Return

All sub-agents MUST return a JSON block as the LAST thing in their message — brief prose findings first, JSON block last, nothing after it:

{
  "status": "DONE | DONE_WITH_CONCERNS | BLOCKED | NEEDS_CONTEXT",
  "summary": "<1-2 sentences>",
  "payload": { "<role-specific fields per prompt template>": "..." }
}

Role payloads: implementer → filesChanged/filesCreated/verification{command,result,scopeReason}/concerns/scopeChanges; spec reviewer → compliant/issues; quality reviewer → approved/issues. Orchestrator parses status for routing, payload for action. Mapping to the orchestrator's Standard Return Envelope: DONE→done, DONE_WITH_CONCERNS→done + concerns surfaced, BLOCKED→blocked, NEEDS_CONTEXT→blocked.

Include in every dispatch prompt:

"Return format: brief prose findings first, then the JSON envelope (status + typed payload) from your role's prompt template as the LAST thing in your message — nothing after it."

Git State Discipline

Sub-agents MUST NOT run git stash — ever, under any circumstance
Sub-agents MUST NOT run git checkout on shared branches
Sub-agents commit their changes atomically (scoped to task files) before reporting back
Other agents may be working concurrently — do not assume a clean worktree
Use git diff HEAD -- <your-files> to verify YOUR changes only — bare git diff is unreliable in parallel
If you see unexpected changes in files outside your scope: ignore them — they belong to another agent

Verification Scoping

Sub-agents lint and test only files they touched:

Scope	Command	NOT this
Lint	`biome check src/changed.ts`	`biome check .`
Test	`vitest run test/changed.test.ts`	`vitest run` / `npm test`
Type check	`tsc --noEmit` — read-only signal; out-of-scope errors are report-only	fixing type errors outside your scope

Sub-agents NEVER run the full suite — not even for pervasive changes. If a change is pervasive (shared types, config, build), record it in scopeChanges/concerns; the orchestrator defers full-project verification to the devil-advocate completion gate. Type errors or test failures in files outside your scope are report-only — never fix them; the authoritative project-wide tsc run belongs to that gate. Sub-agent must state scopeReason in return payload.

Parallelism Rules

Parallel implementers are allowed when tasks touch zero shared files.

Independence check: Orchestrator verifies no file overlap before dispatch. If overlap → serialize.
Batch limit: Maximum 4 concurrent subagents per round. Queue remaining.
Prompt construction: Per the Sub-Agent Prompt Construction table above — all rows required.
Conflict detection: After fan-out completes, check for conflicting edits before committing.
Shared context: Fetch once (e.g., project brief), inject into all subagent prompts — don't make each agent re-fetch.

When to serialize instead:

Tasks share source files (even different functions in same file)
Task B's approach depends on Task A's output
Both tasks modify test fixtures or shared mocks

Constraints

Fresh subagent per task — never reuse session context
Spec review BEFORE code quality review (never reverse)
Parallel implementers only when zero file overlap (orchestrator verifies)
Never skip re-review after fixes
Never ignore BLOCKED/NEEDS_CONTEXT status — something must change
Never start on main/master without explicit user consent
If reviewer finds issues → implementer fixes → reviewer re-reviews → repeat until approved
DONE_WITH_CONCERNS: read concerns before proceeding; address if correctness/scope related
Scope changes discovered by subagents: report in summary, dispatcher handles diagram regeneration