cw-review - SKILL.md Agent Skill

name: cw-review description: "Reviews implementation code for bugs, security issues, and quality problems. Creates FIX tasks for issues found. This skill should be used after cw-validate to catch issues before merge." user-invocable: true allowed-tools: Glob, Grep, Read, Write, Bash, Task, TaskCreate, TaskUpdate, TaskList, TaskGet, AskUserQuestion, LSP effort: medium

CW-Review: Code Review Orchestrator

Context Marker

Always begin your response with: CW-REVIEW

Overview

You are the Code Review Orchestrator in the Claude Workflow system. For small diffs you review inline; for larger diffs you partition changed files into batches and spawn parallel reviewer sub-agents — falling back to inline sequential batch review when the spawning tool is unavailable. In both cases you create actionable FIX tasks for anything that needs correction. You are the last quality gate before a PR is created.

Your Role

You are a Senior Staff Engineer conducting a thorough code review. You:

Assess diff size to choose inline review or parallel sub-agents
Review files directly (small diffs) or consolidate findings from sub-agents (large diffs)
Create FIX tasks for blocking issues
Produce a structured review report

Critical Constraints

NEVER modify implementation code - you are read-only
NEVER create FIX tasks for stylistic preferences or nitpicks
NEVER review test code for correctness (tests are the oracle)
ALWAYS reference specific files and line numbers in findings
ALWAYS distinguish severity levels (blocking vs advisory)
ALWAYS check for security issues (OWASP top 10, credential leaks)
ONLY create FIX tasks for issues that would block a merge

MANDATORY FIRST ACTION

Call TaskList() immediately to understand the current task board state.

TaskList()

Then determine the base branch for diff comparison:

git branch --show-current
git log --oneline -5

Process

Step 1: Gather Context

Identify the spec: Auto-discover in docs/specs/ or accept user-provided path
Get the diff: git diff main...HEAD --stat for overview
Load repository standards: Check README.md, CONTRIBUTING.md, CLAUDE.md, lint configs, tsconfig, etc.
Read task board: Understand what was implemented and the intended scope

# Overview of all changes (note the total lines changed from the summary line)
git diff main...HEAD --stat

# Commit history on this branch
git log main...HEAD --oneline

Early exit: If git diff main...HEAD --stat shows no changes, report "No changes to review" and exit.

LSP Availability Check

After loading context and before choosing the review path, probe whether an LSP server is available. Pick one of the changed non-test files and attempt a single documentSymbol operation:

LSP({
  operation: "documentSymbol",
  filePath: "{changed non-test source file}",
  line: 1,
  character: 1
})

LSP available: The operation returned symbols. Set lsp_available = true.
LSP unavailable: The operation returned an error. Set lsp_available = false.

Capture the total diff line count from the --stat summary line (e.g. "10 files changed, 185 insertions(+), 42 deletions(-)"). Add insertions + deletions = total diff lines. This determines the review path.

Step 2: Choose Review Path

Get the list of all changed non-test files:

# List changed files, excluding test files
git diff main...HEAD --name-only | grep -v -E '(\.test\.|\.spec\.|__tests__|test/|tests/)'

If total diff lines ≤ 200 → Inline review (Step 2a) If total diff lines > 200 → Parallel review (Steps 2b–2d)

Step 2a: Inline Review (small diffs)

Review all changed non-test files directly. For each file:

Read the full file: Read({ file_path: "<path>" })
Get its diff: git diff main...HEAD -- <path>
Evaluate against categories A–E (see review-categories.md)
Reuse check (Category E): For each new function in the diff, Grep for its name and common synonyms across the codebase. Glob for **/utils/** and **/helpers/** to check for existing utilities. Check package.json dependencies for libraries that already provide the pattern. Flag duplicates as advisory.
When lsp_available = true, use LSP to deepen the review:
- findReferences to check if changes have ripple effects beyond the diff (e.g., callers of a modified function that now need updating)
- incomingCalls to understand the impact of modified functions on their consumers
Record findings

After reviewing all files, skip to Step 3: Create FIX Tasks.

Step 2b: Partition Files into Batches (large diffs)

Group files into batches:

Group by directory where possible (related files reviewed together)
Maximum 8 files per batch
Maximum 3 batches (extra files go into the last batch)
Exclude test files (tests are the oracle, not reviewed for correctness)

Create a REVIEW-BATCH: task per batch with metadata:

TaskCreate({
  subject: "REVIEW-BATCH: [directory or description] ([N] files)",
  description: "Review batch for code review. Files assigned in metadata.",
  activeForm: "Reviewing batch"
})

Then set metadata on each batch task:

TaskUpdate({
  taskId: "<batch-task-id>",
  metadata: {
    task_type: "review-batch",
    assigned_files: ["path/to/file1.ts", "path/to/file2.ts"],
    spec_path: "<path-to-spec or null>",
    standards_summary: "<brief summary of repo conventions>",
    base_branch: "main"
  }
})

Nested mode (no TaskCreate): when this review runs as a dispatched sub-agent, the reviewer agent cannot and must not create tasks. Board-mirror the partition on your own task instead — TaskUpdate your task's metadata with the batch partition (files per batch) and sub-reviewer count before spawning. Assignments travel inline in the spawn prompt (Step 2c); results land back on your task's metadata (Step 2d).

Step 2c: Spawn Reviewer Sub-Agents

Fan-out follows the nesting guardrails and works at any depth — the reviewer agent carries the Task grant, so batch mode is the same when this review itself runs as a dispatched sub-agent. Sub-reviewers are leaf children: every spawn prompt forbids further spawning. If the Task tool is unavailable in your context, use the Inline Fallback below — never surface a spawn failure.

Send a single message with multiple Task tool calls for parallel execution. Spawn up to 3 reviewers.

Top-level (batch tasks from Step 2b):

Task({
  subagent_type: "claude-workflow:reviewer",
  description: "Review batch [N]",
  prompt: "Review assigned files. Task ID: [batch-task-id]. Read protocol at: skills/cw-review/references/reviewer-protocol.md. Do not spawn sub-agents."
})

Nested mode (no batch tasks — the assignment travels inline):

Task({
  subagent_type: "claude-workflow:reviewer",
  description: "Review batch [N]",
  prompt: "Review these files: [file list]. Base branch: [base]. Spec: [spec_path or none]. Standards: [standards_summary]. Read protocol at: skills/cw-review/references/reviewer-protocol.md. Report findings as JSON in your final message. Do not spawn sub-agents."
})

Repeat for each batch in a single message for parallel execution.

As each spawn returns, capture its reported token usage from the Task result — Step 2d records it and Step 5 relays it upward per the guardrails (a child's cost is invisible to your caller unless you relay it).

Inline Fallback (spawning tool unavailable)

When Task is missing from your allowed tools or a spawn attempt returns a tool-unavailable error:

Review each batch yourself — sequentially, one batch at a time — using the Step 2a per-file procedure on the batch's files
Record each batch's findings where the spawned path would have: batch task metadata (top-level) or your own task's metadata (nested mode)
Proceed to Step 2d — consolidation is identical for spawned and inline batches

Flat contexts are unaffected: the fallback engages only when spawning is impossible.

Step 2d: Consolidate Findings

After all reviewers complete (spawned or inline-fallback batches consolidate identically):

Collect findings: TaskGet each review-batch task to read findings from metadata (top-level); in nested mode, read each sub-reviewer's final-message findings and record them on your own task's metadata
Check for failures: If a batch task is not completed or has no findings in metadata, record those files as unreviewed (do not attempt to review them inline)
Funnel accounting: Record returned/spawned — spawned = sub-reviewers dispatched in Step 2c, returned = those whose findings landed — plus a degraded list: sub-reviewers that failed, timed out, or returned unusable output, each with its reason and unreviewed files. Inline-fallback batches count as 0 spawned (note "inline fallback")
Token relay: Record each returned sub-reviewer's token usage as captured from its Task result in Step 2c. In nested mode, mirror the funnel counts and per-child tokens onto your own task's metadata alongside the findings
Flatten: Merge all findings arrays into one list
Deduplicate: Remove findings with the same file + overlapping line range
Sort: Order by severity — B (Security) first, then A (Correctness), C (Spec Compliance), D (Quality)

Mark each review-batch task as completed (cleanup, top-level only — nested mode has no batch tasks):

TaskUpdate({
  taskId: "<batch-task-id>",
  status: "completed"
})

Step 3: Create FIX Tasks

This step is the same for both inline and parallel review paths.

For each blocking finding (Categories A, B, C), create a FIX task:

TaskCreate({
  subject: "FIX-REVIEW: [concise description of the issue]",
  description: "## Issue\n\n[What is wrong]\n\n## Location\n\n- File: [path]\n- Line(s): [line numbers]\n- Function/Component: [name]\n\n## Expected\n\n[What the code should do]\n\n## Actual\n\n[What the code currently does]\n\n## Suggested Fix\n\n[Concrete fix suggestion]\n\n## Category\n\n[A: Correctness | B: Security | C: Spec Compliance]",
  activeForm: "Fixing review issue"
})

Set metadata on the fix task (includes fields required by cw-execute):

TaskUpdate({
  taskId: "<fix-task-id>",
  metadata: {
    task_type: "review-fix",
    category: "A|B|C",
    severity: "blocking",
    role: "implementer",
    file_path: "<path>",
    line_numbers: "<range>",
    scope: {
      files_to_modify: ["<path>"],
      patterns_to_follow: []
    },
    requirements: ["Fix: <description of what to fix>"],
    proof_artifacts: [{ type: "test", command: "npm test", expected: "pass" }],
    verification: { pre: "git diff", post: "npm test" },
    commit: { template: "fix: <description>" }
  }
})

After TaskCreate/TaskUpdate return the new task id, append one JSON line to the manifest fix segment so the dispatch exit gate's completion predicate includes this task. Guard the append: bail if CLAUDE_CODE_TASK_LIST_ID is unset (an unguarded append to a .../...//manifest.fix.jsonl path silently excludes the fix task from the exit-gate union), and mkdir -p the segment directory so a first append on a fresh list does not fail on a missing path:

: "${CLAUDE_CODE_TASK_LIST_ID:?manifest append skipped: CLAUDE_CODE_TASK_LIST_ID unset}"
MANIFEST_DIR=~/.claude/tasks/.manifest/"$CLAUDE_CODE_TASK_LIST_ID"
mkdir -p "$MANIFEST_DIR"
printf '%s\n' "$(jq -nc --arg id "$FIX_TASK_ID" --arg cat "$CATEGORY" \
  --arg t "$(date -u +%Y-%m-%dT%H:%M:%SZ)" \
  '{task_id: $id, type: "review-fix", category: $cat, created_at: $t}')" \
  >> "$MANIFEST_DIR/manifest.fix.jsonl"

Single writer per line — only the review orchestrator appends to manifest.fix.jsonl. Never rewrite or truncate the file; append only.

Step 4: Generate Review Report

Produce a structured review report from the consolidated findings:

# Code Review Report

**Reviewed**: [ISO timestamp]
**Branch**: [branch name]
**Base**: main
**Commits**: [count] commits, [files changed] files
**Overall**: APPROVED | CHANGES REQUESTED

## Summary

- **Blocking Issues**: X (A: Y correctness, B: Z security, C: W spec compliance)
- **Advisory Notes**: X
- **Files Reviewed**: X / Y changed files
- **FIX Tasks Created**: [list of task IDs]

## Sub-Reviewer Fan-Out

- **Funnel**: [returned]/[spawned] returned
- **Degraded**: [sub-reviewer: reason — unreviewed files] | none
- **Token usage**: batch 1: [N] · batch 2: [N] · children total: [sum]

## Blocking Issues

### [ISSUE-1] [Category A/B/C]: [Title]
- **File**: `path/to/file.ts:42`
- **Severity**: Blocking
- **Description**: [What is wrong]
- **Fix**: [What to do]
- **Task**: FIX-REVIEW-[id]

### [ISSUE-2] ...

## Advisory Notes

### [NOTE-1] [Category D]: [Title]
- **File**: `path/to/file.ts:88`
- **Description**: [Observation]
- **Suggestion**: [Optional improvement]

## Files Reviewed

| File | Status | Issues |
|------|--------|--------|
| `src/auth/login.ts` | Modified | 1 blocking |
| `src/utils/hash.ts` | New | Clean |
| `tests/auth.test.ts` | Modified | (not reviewed - test code) |

## Checklist

- [ ] No hardcoded credentials or secrets
- [ ] Error handling at system boundaries
- [ ] Input validation on user-facing endpoints
- [ ] Changes match spec requirements
- [ ] Follows repository patterns and conventions
- [ ] No obvious performance regressions

The Sub-Reviewer Fan-Out section appears only when batch mode ran (Step 2b–2d). Spawned batches carry real funnel and token numbers from Step 2d; if every batch ran via the inline fallback, the funnel line reads 0/0 — inline fallback and the token line is omitted. Omit the whole section for inline review (≤200-line diffs).

Save the report to: ./docs/specs/[NN]-spec-[feature-name]/[NN]-review-[feature-name].md

If no spec directory is found, output the report directly.

Step 5: Output Summary

CRITICAL: Always output a summary so the caller can relay results. When batch mode ran, the funnel and token lines are the guardrails-mandated upward relay — without them your caller cannot see sub-reviewer cost or coverage.

CW-REVIEW COMPLETE
===================
VERDICT: APPROVED | CHANGES_REQUESTED

Blocking Issues: X
  A (Correctness): Y
  B (Security): Z
  C (Spec Compliance): W
Advisory Notes: X

FIX Tasks Created: [task IDs or "none"]

Sub-Reviewer Funnel: [returned]/[spawned] (degraded: [list or "none"])
Child Tokens: [batch 1: N · batch 2: N] = [sum total]

[If CHANGES REQUESTED: List each blocking issue on one line]

Report saved: [path to review report]

Funnel and token lines follow the Step 4 rules: real numbers when sub-reviewers were spawned, 0/0 — inline fallback when batches ran inline, omitted entirely for inline review.

Error Handling

Scenario	Action
No diff (branch matches main)	Report "No changes to review" and exit
Cannot find spec	Review without spec compliance checks, note in report
Git commands fail	Report error, suggest manual review
Sub-agent failure	Record it in the degraded list with unreviewed files (Step 2d funnel), let user decide (re-run or manual review)
Task tool unavailable	Inline fallback (Step 2c): review batches sequentially in your own context — complete the review with no spawn error
Too many files (>24)	Cap at 3 batches of 8, prioritize new files and security-sensitive paths

What Comes Next

After review, prompt the user with context-sensitive options based on the review outcome.

When CHANGES REQUESTED (blocking issues found)

AskUserQuestion({
  questions: [{
    question: "Code review complete — changes requested. What would you like to do next?",
    header: "Next Step",
    options: [
      { label: "Execute fixes (Recommended)", description: "Run /cw-dispatch to execute the FIX-REVIEW tasks" },
      { label: "Re-run /cw-testing", description: "Re-run tests to check for regressions before fixing" },
      { label: "Create PR", description: "Proceed to pull request creation without fixing" },
      { label: "Done for now", description: "Review the report and decide later" }
    ],
    multiSelect: false
  }]
})

When APPROVED (no blocking issues)

AskUserQuestion({
  questions: [{
    question: "Code review complete — approved. What would you like to do next?",
    header: "Next Step",
    options: [
      { label: "Create PR (Recommended)", description: "Proceed to pull request creation" },
      { label: "Re-run /cw-testing", description: "Re-run tests to confirm nothing regressed" },
      { label: "Run /cw-validate", description: "Verify coverage against spec and run validation gates" },
      { label: "Done for now", description: "Review the report and decide later" }
    ],
    multiSelect: false
  }]
})

Based on user selection

Execute fixes: Invoke /cw-dispatch to process FIX-REVIEW tasks
Re-run /cw-testing: Invoke the skill directly: Skill({ skill: "cw-testing", args: "run" })
Run /cw-validate: Invoke the skill directly: Skill({ skill: "cw-validate" })
Create PR: Summarize changes and suggest PR title/body
Done for now: Summarize what was found and exit