name: mk:cook description: 'Orchestrates single-task implementation pipeline: plan → test → build → review → ship. Use for feature work, plan execution, or fixes scoped to a single task. NOT for green-field product builds (see mk:autobuild); NOT for auto-invoked workflow orchestration (see mk:workflow-orchestrator).' source: local version: 1.0.0 argument-hint: '[task|plan-path] [--interactive|--fast|--parallel|--auto|--no-test|--tdd|--verify|--strict|--no-strict]' keywords: - cook - implement-pipeline - plan-execution - feature-build - seven-phase - single-task when_to_use: Use when implementing a single task end-to-end (plan→test→build→review→ship). NOT for green-field product builds (see mk:autobuild) or auto-invoked workflow orchestration (see mk:workflow-orchestrator). user-invocable: true owner: lifecycle criticality: critical status: active runtime: claude-code
Cook — Full Implementation Pipeline
End-to-end implementation following the 7-phase workflow. TDD is opt-in via --tdd.
Usage
/cook <natural language task OR plan path>
/cook "Add user auth" --fast
/cook "Build payment processor" --tdd # Strict TDD enforced
/cook tasks/plans/260329-feature/plan.md --auto
Flags: --interactive (default) | --fast | --parallel | --auto | --no-test | --tdd | --verify | --strict | --no-strict
Modifier flags (layer on any mode): --verify [LIGHT] (light browser/artifact check) | --strict [HEAVY] (full evaluator pass) | --no-strict (suppress auto-strict)
Concrete cost depends on the inner harness, model tier, and target surface; treat [LIGHT] vs [HEAVY] as relative ordering only.
--verify is advisory (does not block ship). --strict is a hard gate (FAIL blocks ship and routes back to Phase 3).
TDD mode (--tdd flag)
When --tdd is detected in the invocation, the cook skill MUST write on to .claude/session-state/tdd-mode via a Bash tool call BEFORE any other workflow step:
mkdir -p .claude/session-state && echo on > .claude/session-state/tdd-mode
This sentinel file is read by pre-implement.sh, tdd-detect.sh, and downstream agents to detect TDD mode. Without --tdd, the sentinel is absent and the workflow runs in default mode (Phase 2 optional, no RED-phase gate).
MEOWKIT_TDD=1 env var (set in CI / shell rc) is the highest-precedence opt-in and overrides the sentinel.
When loading an existing plan, scan plan.md and phase files for tdd: true, ## Tests Before, or ## Regression Gate. If any are present but this invocation lacks --tdd and MEOWKIT_TDD is not enabled, warn:
Plan contains TDD sections but cook is not in TDD mode. Re-run with --tdd to enforce RED-first execution, or continue in default mode with TDD guidance only.
HARD GATE
Do NOT write implementation code until a plan exists and Gate 1 is approved.
In TDD mode (--tdd / MEOWKIT_TDD=1): do NOT skip Test RED phase — write failing tests BEFORE implementation.
In default mode: Phase 2 is optional; the developer may implement directly per the approved plan.
Exception: --fast mode skips research but still requires plan + (in TDD mode) TDD-flavored tests.
User override: If user explicitly says "just code it" or "skip planning", respect their instruction.
Anti-Rationalization
Generic implementation-phase rationalizations live in .claude/rules/anti-rationalization.md — read before any non-trivial implementation step. Cook-specific addition (TDD mode):
| Thought | Reality |
|---|---|
| "Tests can come after" | In TDD mode (--tdd), no — failing tests define the spec. In default mode, yes — tests after is permitted. Choose your mode explicitly. |
Before starting work, read references/failure-catalog.md for common implementation failure modes (cross-linked to ## Gotchas below).
Smart Intent Detection
See references/intent-detection.md for full detection logic.
Green-field product build (new kanban app, full SaaS from scratch, multi-sprint autonomous build)? Use
mk:autobuildinstead. Cook handles single features, fixes, and refactors; harness owns the generator↔evaluator loop and adaptive scaffolding density.
| Input Pattern | Mode | Behavior |
|---|---|---|
Path to plan.md / phase-*.md |
code | Execute existing plan |
| "fast", "quick" | fast | Skip research, plan→test→code |
| "trust me", "auto" | auto | Auto-fix issues, human gates still enforced |
| 3+ features OR "parallel" | parallel | Multi-agent execution |
| "no test", "skip test" | no-test | Skip Test phase entirely (force off, even if --tdd) |
| Default | interactive | Full workflow with user approval at each gate |
--verify |
(modifier) | Light browser check after review (Phase 4.5) |
--strict |
(modifier) | Full evaluator after review (Phase 4.5) |
--no-strict |
(modifier) | Suppress auto-strict from scale-routing |
Process Flow (Authoritative)
flowchart TD
A[Phase 0: Orient] --> B{Has plan?}
B -->|Yes| D[Load Plan]
B -->|No| SP[Present scout summary to user]
SP --> C[Phase 1: Research + Plan]
C --> RQ{5 dims captured?}
RQ -->|No| C
RQ -->|Yes| G1[GATE 1: Human Approval]
G1 -->|approved| D
G1 -->|rejected| C
D --> E[Phase 2: Test - RED if --tdd, optional otherwise]
E --> F[Phase 3: Build GREEN]
F --> S[mk:simplify — MANDATORY]
S --> V[mk:verify — MANDATORY]
V --> G2[Phase 4: Review — GATE 2]
G2 -->|approved| BV{--verify or --strict?}
BV -->|"--verify"| P45A["Phase 4.5 --verify advisory"]
BV -->|"--strict"| P45B["Phase 4.5 --strict blocking"]
BV -->|neither| H[Phase 5: Ship]
P45A --> H
P45B --> BG{evaluator FAIL?}
BG -->|no| H
BG -->|yes| F
G2 -->|rejected| F
H --> I[Phase 6: Reflect]
This diagram is authoritative. If prose conflicts, follow the diagram.
Scout-First Contract (Phase 0)
Before any clarifying question or plan generation, present a 3–6 bullet codebase-context summary to the user:
- Project type / language / framework
- Existing modules/files relevant to the task
- Current patterns/conventions for similar features
- In-flight plans (
tasks/plans/) or docs (docs/) covering this area - Public APIs / schemas / contracts the task could affect
Skip when input is a plan.md / phase-*.md path — the plan already encodes scout output.
In --fast mode the summary is abbreviated (1–3 bullets) but still presented; "skip research" reduces external research depth, not codebase-context handoff to the user.
Exact-Requirements Contract (Phase 1)
mk:plan-creator MUST be able to answer all 5 dimensions in concrete sentences before returning a plan:
- Expected output — concrete artifact(s) the user will see (paths, behavior, endpoint, CLI flags)
- Acceptance criteria — specific inputs → outputs / edge cases
- Scope boundary — what is explicitly OUT of scope this round
- Non-negotiable constraints — stack, file locations, naming, compat, deadlines, performance
- Touchpoints — which existing files (from scout) will be modified / which contracts must stay stable
Every clarifying-question option MUST cite scout findings (e.g., file paths). Vague abstract options are a failure mode — see references/failure-catalog.md.
Skip when input is a plan.md / phase-*.md path (plan encodes the 5 dimensions).
Workflow Modes
| Mode | Research | TDD |
|---|---|---|
| interactive | Yes | RED-strict |
| auto | Yes | RED-strict |
| fast | Skip | Plan-level |
| parallel | Optional | RED-strict |
| no-test | Yes | Skip |
| code | Skip | RED-strict |
Gate 1 routing, parallelism, and full per-phase progression live in references/intent-detection.md (canonical Mode Behaviors table).
Gate 2: human approval mandatory in all modes — see .claude/rules/gate-rules.md for the full contract. If the reviewer surfaces a regression / side effect / broken workflow, follow the Regression Recovery Options pattern in references/review-cycle.md rather than silently patching.
Required Subagents
| Phase | Subagent | When |
|---|---|---|
| 0 Orient | mk:scout |
Codebase mapping |
| 1 Plan | mk:plan-creator, researcher |
Research + planning |
| 2 Test | tester via mk:testing |
TDD mode (--tdd): MUST spawn — write failing tests. Default mode: optional (skip unless requested) |
| 3 Build GREEN | developer |
Implementation |
| 3 Build GREEN | developer via mk:investigate |
Root-cause analysis when tests fail after 3 self-heal attempts |
| 3.5 Simplify | developer via mk:simplify |
MANDATORY after Phase 3 GREEN — run before verify |
| 3.6 Verify | mk:verify |
MANDATORY after simplify — unified build+lint+test+coverage check before Phase 4 |
| 4 Review | reviewer via mk:review |
MUST spawn — Gate 2 |
| 4.5 Verify | browser-automation subagent or HTTP verification tool |
Only if --verify flag (light browser check) |
| 4.5 Verify | evaluator via mk:evaluate |
Only if --strict flag or auto-triggered |
Concrete subagent name and HTTP tool depend on installed skill set; cook treats both as interfaces, not specific implementations.
| 5 Ship | shipper via mk:ship | MUST spawn — runs full pre-ship pipeline (git-manager invoked inside shipper for commit + PR) |
| 6 Reflect | analyst | MUST spawn — cost + pattern analysis |
| 6 Reflect | documenter | MUST spawn — sync-back + docs |
| 6 Reflect | mk:memory session-capture | MUST spawn — 3-category learning extraction |
During iterative build-test-fix cycles, follow references/loop-safety-protocol.md for checkpoint tracking, stall detection, and escalation triggers.
See references/subagent-patterns.md for Task() invocation patterns.
See references/workflow-steps.md for detailed per-phase instructions.
See references/review-cycle.md for review gate logic.
Simplify Step (Mandatory)
After Phase 3 (Build GREEN) completes, run mk:simplify to reduce complexity before Phase 4 (Review). This is mandatory — do not skip.
Rationale: Review quality improves when code is already simplified. Reviewers catch logic errors better when complexity is low. Running simplify after tests are green (but before review) ensures the reviewed code is the final, clean version.
Verify Step (Mandatory)
After mk:simplify completes, run mk:verify for a unified build→lint→type-check→test→coverage check. This confirms simplification didn't break anything and gives the reviewer a clean signal. By default mk:verify scopes lint and tests to the changed files (build and type-check stay whole-program); the complete whole-project run happens at ship time. Pass --full only when a full sweep is needed here.
If mk:verify FAILS after simplify: send back to developer to fix, then re-run verify before proceeding to Phase 4. Do not skip verify or proceed to review with a failing verify result.
Status Report (Post-Gate 2)
After Gate 2 verdict PASS and before Phase 5 ship, delegate to project-manager per .claude/rules/post-phase-delegation.md Rule 1 (background — include "Run in the background" in the prompt). Status report is co-located at {plan-dir}/status-reports/{YYMMDD}-status.md. Do not block on PM — continue to ship. Skipped automatically when MEOWKIT_PM_AUTO=off.
Related Rules
.claude/rules/gate-rules.md— Gate 1 (Plan) and Gate 2 (Review) hard-stop conditions this skill enforces across all modes.claude/rules/post-phase-delegation.md— PM delegation fire points and skip conditions.claude/rules-conditional/workflow-evidence-rules.md— workflow evidence index (traceability over existing Phase 0-6 outputs; mirrors the gate scripts, never approves). Seereferences/workflow-steps.md→ Workflow Evidence Index
Gotchas
Operational gotchas live here; pre-build failure modes and rationalizations live in references/failure-catalog.md.
- Skipping mk:simplify before review: Tests pass but code is still complex → run
/mk:simplifybetween Phase 3 and Phase 4 every time, no exceptions - Skipping Gate 1 on "simple" features: Features that seem simple grow during implementation. Always create a plan file; cancel it if truly trivial
- Auto-approve sneaking bugs past Gate 2: "auto" means automatic execution between gates, NOT automatic approval. Auto mode auto-fixes but NEVER auto-approves Gate 2 — gate-rules.md says NO exceptions. The workflow evidence index must be complete before Gate 2 is presented
- Context loss between phases: Long multi-phase workflows exceed the context window — the external-memory write after each completed plan phase is the load-bearing action that lets a fresh session resume the correct phase. This is a mandatory step, not a habit: see the Per-Phase Checkpoint in
references/workflow-steps.md(Phase 6) — flip the phase-file checkboxes and update plan.md Agent State before any next-phase transition - Parallel mode deadlocks: Phase dependencies cause deadlock when phase-03 waits for phase-02 results. Map dependency graph before spawning parallel agents
- Code mode on stale plans: Running old plan against changed codebase. Warn if plan.md is >14 days old
- Fast mode shallow test coverage: Skipping research means tests capture plan-level intent, not edge cases. Document: "fast mode = TDD-flavored, coverage may be lower"
- Missing model tier declaration: Expensive models on trivial tasks, cheap models on security-critical work. Always declare tier in Phase 0
- Forgetting memory read/write: Prior session learnings lost. Phase 0 reads canonical
.claude/memory/*.jsonstores only for fix memory; Phase 6 writes canonical JSON stores and regenerates generated views when needed - Subagent patterns using Agent() not Task(): Task() enables tracking, blocking, and progress. Always use Task() for phases 2-6
- --strict cost surprise:
--strictspawns the full evaluator (mk:evaluate) — a [HEAVY] pass. Auto-triggered by scale-routinglevel=high. Use--no-strictto suppress, or--verifyfor the [LIGHT] check - --strict vs --verify confusion:
--verify= light browser check (advisory).--strict= full evaluator with rubrics (FAIL blocks ship).--strictsupersedes--verify