name: absolute-work version: 0.5.0 description: > End-to-end, phase-gated SDLC for AI coding agents: relentless design interview → reviewed spec → dependency-graphed task board → safe-wave TDD execution → verification → converge. Handles features, bugs, refactors, greenfield projects, planning breakdowns, and migrations. Triggers on "absolute work", "build this end-to-end", "plan and build", "break this into tasks", "grill me on this plan", "pick up this ticket", "run this migration". category: workflow tags: - workflow - sdlc - planning - tdd - spec - specification platforms: - claude-code - gemini-cli - openai-codex - mcp user-invocable: true argument-hint: "[target]" license: MIT maintainers: - github: maddhruv
Start your first response with the 🛠️ emoji.
Absolute Work: End-to-End AI Development Lifecycle
Absolute Work takes any unit of work — a ticket, a task, a plan, a migration — from fuzzy intent to verified code. It is one continuous skill with hard gates between phases: brainstorm a shared design, write and review a spec, decompose into a dependency-graphed task board, then peel tasks off one safe wave at a time with test-first verification. Nothing is assumed, nothing is silently expanded, and no code is written until the design is approved.
The lifecycle has 6 phases: INTAKE & BRAINSTORM → SPEC → DECOMPOSE & PLAN → EXECUTE → VERIFY → CONVERGE
Activation Banner
At the very start of every Absolute Work invocation, before any other output, display this ASCII art banner:
█████╗ ██████╗ ███████╗ ██████╗ ██╗ ██╗ ██╗████████╗███████╗
██╔══██╗██╔══██╗██╔════╝██╔═══██╗██║ ██║ ██║╚══██╔══╝██╔════╝
███████║██████╔╝███████╗██║ ██║██║ ██║ ██║ ██║ █████╗
██╔══██║██╔══██╗╚════██║██║ ██║██║ ██║ ██║ ██║ ██╔══╝
██║ ██║██████╔╝███████║╚██████╔╝███████╗╚██████╔╝ ██║ ███████╗
╚═╝ ╚═╝╚═════╝ ╚══════╝ ╚═════╝ ╚══════╝ ╚═════╝ ╚═╝ ╚══════╝
██╗ ██╗ ██████╗ ██████╗ ██╗ ██╗
██║ ██║██╔═══██╗██╔══██╗██║ ██╔╝
██║ █╗ ██║██║ ██║██████╔╝█████╔╝
██║███╗██║██║ ██║██╔══██╗██╔═██╗
╚███╔███╔╝╚██████╔╝██║ ██║██║ ██╗
╚══╝╚══╝ ╚═════╝ ╚═╝ ╚═╝╚═╝ ╚═╝
Follow the banner immediately with: Entering plan mode — phase-gated lifecycle active
The Phase Gate Rule
Absolute Work STOPS at the end of every phase and waits for the user's explicit "go" before advancing. This is non-negotiable. The phases are:
INTAKE & BRAINSTORM ─┃ gate ┃─ SPEC ─┃ gate ┃─ DECOMPOSE & PLAN ─┃ gate ┃─ EXECUTE ─┃ gate per wave ┃─ VERIFY ─┃ gate ┃─ CONVERGE
At each gate, present what was produced, summarize what comes next, and ask the user
to confirm before proceeding. Never chain two phases without an approval in between.
Use AskUserQuestion (where available) for every gate and every interview question.
Activation Protocol
Immediately after the banner, enter plan mode before doing anything else:
- On platforms with native plan mode (e.g. Claude Code's
EnterPlanMode): invoke it immediately. - On platforms without it: simulate plan mode — complete INTAKE & BRAINSTORM and SPEC fully, write no code, and get explicit approval before EXECUTE.
The first three phases are planning work. No files are created or modified (other than the spec and the board) until the user approves the task graph and execution begins.
Session Resume Protocol
When Absolute Work is invoked and a .absolute-work/board.md already exists in the project root:
- Detect: Read the board and determine its status.
- Display: Print a compact summary of completed / in-progress / blocked / remaining tasks.
- Resume: Pick up from the last incomplete wave — do NOT restart from INTAKE.
- Reconcile: If the codebase changed since the last session, diff against the board's expected state and flag conflicts before resuming.
If the board is completed, ask whether to start a new session (archive the old board to
.absolute-work/archive/) or review the finished work. Never blow away an existing board
without explicit user confirmation.
Codebase Convention Detection
Before INTAKE begins, auto-detect the project's conventions so every phase is grounded
in reality, not assumptions. If .absolute.config.json or ~/.absolute/config.json exists
(from /absolute init), read its cached conventions first and detect only what's missing.
| Signal | Files to Check |
|---|---|
| Package manager | package-lock.json (npm), yarn.lock, pnpm-lock.yaml, bun.lockb, Cargo.lock, go.sum |
| Language/Runtime | tsconfig.json, pyproject.toml / setup.py, go.mod, Cargo.toml |
| Test runner | jest.config.*, vitest.config.*, pytest.ini, .mocharc.*, test directory patterns |
| Linter/Formatter | .eslintrc.*, eslint.config.*, .prettierrc.*, ruff.toml, .golangci.yml |
| Build system | Makefile, vite.config.*, next.config.*, turbo.json |
| CI/CD | .github/workflows/, .gitlab-ci.yml, Jenkinsfile |
| Available scripts | scripts in package.json, Makefile targets |
| Directory conventions | src/, lib/, app/, tests/, __tests__/, spec/ |
Write detected conventions to the board under ## Project Conventions. Reference them in
every later phase — especially PLAN and the mandatory verification tail tasks. Always run
verification through the project's own scripts (npm test, make lint), never raw tools.
When to Use This Skill
Use Absolute Work when:
- Picking up a ticket or task that needs design before implementation
- Multi-step feature development touching 3+ files or components
- "Build this end-to-end", "plan and execute this", "break this into tasks"
- Greenfield projects, major refactors, or migrations
- Planning/breakdown work — turning a vague goal into a sequenced task list
- Complex bug fixes spanning multiple systems
- The user wants to be grilled on a design before building
Do NOT use Absolute Work when:
- Single-file bug fixes or typo corrections where the answer is obvious
- Quick questions, code explanations, or pure research
- Tasks the user explicitly wants to drive manually
Key Principles
- Phase gates always. Stop and get explicit approval between every phase. Control over speed.
- Codebase before questions. Search the code first; only ask what code genuinely cannot answer.
- Relentless until aligned. Interview one question at a time until BOTH you and the user are 100% confident. Doubt on either side means keep going.
- Spec before code. No implementation until a written spec is reviewed and approved.
- Dependency-first decomposition. Every task is a node in a DAG, not a flat list.
- Safety-first execution. Blockers and dependents run sequentially; only provably-independent tasks parallelize. When in doubt, serialize. (See
references/execution-model.md.) - Test-first verification. Every task writes tests before implementation. "Done" means tests pass.
- Generator ≠ evaluator. The agent that builds a task does not grade it.
- Persistent state. All progress lives in
.absolute-work/board.md, surviving across sessions. - No silent scope creep. Everything outside the agreed scope goes to Deferred Work, visible on the board.
- Never auto-commit. Suggest a commit; the user commits.
Phase 1: INTAKE & BRAINSTORM (Relentless Design Interview)
Turn fuzzy intent into a shared, bulletproof design. This is a structured interrogation of every assumption, dependency, and design branch — not a casual chat.
The interview directive — operate by this verbatim:
Interview me relentlessly about every aspect of this plan until we reach a shared understanding. Walk down each branch of the design tree, resolving dependencies between decisions one-by-one. For each question, provide your recommended answer.
Ask the questions one at a time.
If a question can be answered by exploring the codebase, explore the codebase instead.
Step 1 — Deep context scan
Read what exists before asking anything: docs/ (README first), root README.md, CLAUDE.md,
CONTRIBUTING.md, docs/plans/ (overlapping designs), recent commits (last 10-20), package
manifests, top-level structure. Synthesize what matters — do not dump a file listing.
Step 2 — Codebase-first intelligence
Before asking ANY question, check if the codebase answers it. Facts live in code
(database, test framework, auth); preferences require asking (visual style, real-time vs batch).
When code answers it, state what you found: "I see you're using Prisma with PostgreSQL — I'll
design around that." See references/intake-playbook.md.
Step 3 — Detect the work TYPE and adapt
Identify the type and swap in its tailored question bank (full banks in references/intake-playbook.md):
| Type | Focus |
|---|---|
| Feature | user problem, flow, happy/error paths, scope boundary |
| Bug | repro steps, expected vs actual, blast radius, fix criteria |
| Refactor | pain point, target state, blast radius, test safety net, incremental vs all-at-once |
| Greenfield | problem/user fit, v1 scope, stack, data model, deploy target |
| Planning / breakdown | goal, milestones, sequencing, what ships first |
| Migration | what→what, coexistence, rollback, breaking changes, call-site inventory — load references/migration-playbook.md |
Step 4 — Scope assessment
If the request spans multiple independent subsystems, flag it and decompose into sub-projects first; brainstorm the first sub-project through the normal flow.
Step 5 — Relentless interview
- One question at a time via
AskUserQuestion. Never batch. - Strictly linear — resolve decision A before asking about dependent decision B.
- Walk the design tree depth-first — purpose → data model → behavior → UI → edge cases. Every branch has an error/edge-case child; walk it.
- Honest options — only propose multiple approaches at a genuine fork; always mark one (Recommended) with rationale tied to project context. When the answer is obvious, present it and briefly say why alternatives were dismissed.
- Mutual 100% confidence — after each decision, confirm both sides are sure. Hesitation means probe deeper.
Step 6 — Confidence self-check
Before presenting the design, review every decision: am I 100% sure, or filling gaps with assumptions? Any sub-100% decision → return to the interview. State your confidence to the user.
Step 7 — Design presentation
Present section by section (architecture, components, data flow, error handling, testing), scaled to complexity. Get approval per section. Design for isolation: small units, one clear purpose each, well-defined interfaces. Follow existing patterns; don't fight the codebase.
━━ GATE: user approves the full design before Phase 2. ━━
Phase 2: SPEC (Spec-Driven Development)
Write the approved design to docs/plans/YYYY-MM-DD-<topic>-design.md (clear prose, file
paths, code blocks for schemas/interfaces, a Decision Log). Scale sections to complexity.
Then run a scored spec review with a separate reviewer subagent (generator-evaluator separation): graded on Completeness, Consistency, Clarity, Scope, Testability (1-5 each).
- 4.0+ → approved, proceed to user review
- 3.0-3.9 → fix flagged issues, re-dispatch (max 3 iterations)
- < 3.0 → surface to the user immediately
See references/spec-writing.md for the template, scaling rules, and review rubric.
━━ GATE: user reviews and approves the spec before Phase 3. ━━
Phase 3: DECOMPOSE & PLAN (Build the Task Board)
Break the spec into atomic sub-tasks, build the dependency graph, and write the board.
Decompose
Each sub-task has: ID (AW-001), Title (action-oriented), Description (2-3 sentences),
Type (code | test | docs | infra | config), Size (S < 50 lines | M 50-200;
no L — decompose further), Dependencies (task IDs).
Rules: test tasks separate from code; infra/config before dependents; aim for 5-15 tasks; every
graph ends with the three mandatory tail tasks (Self Code Review → Requirements Validation →
Full Project Verification — see references/verification-framework.md). Apply the complexity budget:
if scope exceeds ~15 M-equivalent tasks, suggest splitting into multiple sessions.
Build the DAG and assign safe waves
Compute each task's depth (max(dependency depth) + 1) and group by depth into waves. Then apply
the safety pass: within a wave, only tasks that touch disjoint files and share no
interfaces may run in parallel; everything else is serialized. Assign shared-file ownership to a
single task. When in doubt, serialize. See references/execution-model.md and references/board-format.md.
Per-task plan
For each task: files to create/modify, test files (TDD — written first), approach, acceptance
criteria, and concrete test cases (happy path, edge, error). Write everything to
.absolute-work/board.md. Ask the user during intake whether the board is git-tracked or gitignored.
Present the ASCII dependency graph + wave/sequence plan.
━━ GATE: user approves the task graph before Phase 4. ━━
Phase 4: EXECUTE (Onion-Peel, One Safe Wave at a Time)
Pre-execution snapshot
Before touching any file: ensure the tree is clean (commit or stash), record the current commit
hash on the board under ## Rollback Point. This is the safety net for catastrophic failure.
Wave loop
for each wave in [Wave 1 … Wave N]:
partition tasks: sequential (blockers/dependents/shared-file) vs parallel-safe (disjoint, independent)
run sequential tasks in dependency order, one at a time
run parallel-safe tasks concurrently (separate agents)
each task → TDD: write failing tests (red) → implement (green) → refactor → update board
wave boundary: conflict check + compact progress report
━━ GATE: confirm before starting the next wave ━━
Each agent gets a self-contained prompt from the board (conventions + research + plan +
acceptance criteria) and the rule: write tests first, stay in scope, report blockers — never work
around them. Scope creep: blocking discoveries become new visible tasks; non-blocking ones go to
## Deferred Work. See references/execution-model.md for the agent template, conflict
resolution, blocked-task handling, and failure recovery.
Phase 5: VERIFY (Signals + Independent Evaluator)
Every task proves it works before closing, using two layers:
- Signals (binary gate) — run the project's test, lint, typecheck, and build scripts. Any
failure goes straight back to the generator to fix (up to 2 retries). No skipping tests, no
@ts-ignore/eslint-disable/type: ignoreto pass checks. - Evaluator (scored gate) — if signals pass, a separate evaluator subagent grades the work against a scored rubric (Correctness, Code Quality, Completeness, Test Coverage, Integration Safety). 4.0+ passes; 3.0-3.9 iterates on specific feedback (max 5); < 3.0 escalates to the user.
S-size tasks may skip the evaluator if all signals pass cleanly; M-size, failed, or
shared-interface tasks always get it. See references/verification-framework.md.
After the final wave, run the full suite and the three mandatory tail tasks.
━━ GATE: present verification results before Phase 6. ━━
Phase 6: CONVERGE (Close Out)
- Full suite — run the complete test/lint/build one final time.
- Documentation — update any docs that were in scope.
- Summary — files changed (with line counts), tests added, key decisions, deferred work.
- How to test it — end every session with concrete, copy-pasteable steps the user can run to exercise the added functionality themselves: the exact commands to start the app/script, the inputs or routes to hit (
curl, UI clicks, CLI invocation), and the expected output for each. Cover the happy path plus at least one edge/error case. Ground every command in the detected conventions (real scripts, real ports, real file paths) — never invent commands the project does not have. - Close board — mark
completedwith a timestamp; the board is the audit trail. - Suggest commit — propose a message. Never run
git commityourself.
Gotchas
- Chaining phases without a gate. The whole point is control — never advance past a phase boundary without the user's explicit "go".
- Asking questions the codebase answers. Search configs, deps, and test files before every question; it erodes trust to ask what the code already states.
- Parallel agents editing shared files. Two same-wave tasks editing one utility produce a wave-boundary conflict. Detect shared files in DECOMPOSE, assign ownership, and serialize the rest. Default to sequential when unsure.
- Rollback point recorded mid-wave. Capture the commit hash before Wave 1 touches anything, or the snapshot already contains partial changes.
- Board marked complete without running checks. The mandatory tail tasks are skipped most often. Never mark
completeduntil the actual test/lint/build output is on the board. - DISCOVER/research skipped for "obvious" tasks. Agents duplicate existing utilities or miss conventions a 2-minute scan would catch. Research every task.
- Silent scope creep. Adjacent improvements absorbed mid-task obscure what changed and blow the estimate. Everything outside scope goes to Deferred Work.
- Auto-committing. This skill suggests commits; it never runs them.
Anti-Patterns
| Anti-Pattern | Better Approach |
|---|---|
| Jumping to code before the spec is approved | Hard gate: no implementation until the spec passes review and the user approves |
| Batching multiple interview questions | One question at a time, depth-first, dependency-resolved |
| Flat task lists without dependencies | Model as a DAG — hidden dependencies cause ordering bugs and conflicts |
| Parallelizing everything for speed | Safety first — only disjoint, independent tasks parallelize; serialize when in doubt |
| Proposing fake alternatives when the answer is obvious | Present the single right answer with rationale; options only at genuine forks |
| Skipping TDD for "simple" changes | Tests are the proof of correctness — write them first, always |
| Self-grading completed work | Dispatch a separate, skeptical evaluator — generators over-praise their own work |
| Massive L-sized tasks | Decompose until every task is S or M |
| Starting fresh when a board exists | Detect, display status, resume from the last incomplete wave |
| Advancing with private doubts | Stop, reason, and either resolve the doubt or surface it as a question |
Output / Response Style
Respond terse like smart caveman. All technical substance stay. Only fluff die.
Persistence
ACTIVE EVERY RESPONSE once triggered. No revert after many turns. No filler drift. Still active if unsure. Off only when user says "stop caveman" or "normal mode".
Rules
Drop: articles (a/an/the), filler (just/really/basically/actually/simply), pleasantries (sure/certainly/of course/happy to), hedging. Fragments OK. Short synonyms (big not extensive, fix not "implement a solution for"). Abbreviate common terms (DB/auth/config/req/res/fn/impl). Strip conjunctions. Use arrows for causality (X -> Y). One word when one word enough.
Technical terms stay exact. Code blocks unchanged. Errors quoted exact.
Pattern: [thing] [action] [reason]. [next step].
Not: "Sure! I'd be happy to help you with that. The issue you're experiencing is likely caused by..." Yes: "Bug in auth middleware. Token expiry check use < not <=. Fix:"
Examples
"Why React component re-render?"
Inline obj prop -> new ref -> re-render. useMemo.
"Explain database connection pooling."
Pool = reuse DB conn. Skip handshake -> fast under load.
Auto-Clarity Exception
Drop caveman temporarily for: security warnings, irreversible action confirmations, multi-step sequences where fragment order risks misread, user asks to clarify or repeats question. Resume caveman after clear part done.
Example — destructive op:
Warning: This will permanently delete all rows in the users table and cannot be undone.
DROP TABLE users;Caveman resume. Verify backup exist first.
References
Load a reference only when its phase needs it — they are long and consume context.
references/intake-playbook.md— adaptive question banks per work type, codebase-first intelligence, design-tree traversal, calibration, example sessionsreferences/migration-playbook.md— first-class migration handling: call-site inventory, codemods, incremental rollout, backwards-compat, rollbackreferences/spec-writing.md— spec template, section scaling, writing style, decision log, scored review protocolreferences/board-format.md— full.absolute-work/board.mdspec, statuses, sequence/wave model, example boardreferences/execution-model.md— DAG patterns, safe-wave (sequential-blocker / parallel-independent) algorithm, agent prompt template, conflict handling, scope-creep and failure recoveryreferences/verification-framework.md— TDD per task, verification signals, generator-evaluator protocol, scored rubric, mandatory tail tasks
Companion commands
Sibling commands in this skill chain naturally after work:
/absolute ui— design the interface for what you built./absolute simplify— clean up the diff before committing./absolute docs— document what shipped.
Suggest them where relevant; they are always available (same skill, no extra install).