name: ~aod-deliver description: "Structured delivery retrospective for the AOD Lifecycle's Deliver stage. Validates Definition of Done, captures delivery metrics (estimated vs. actual duration), logs surprises, feeds new ideas back into discovery via GitHub Issues, and creates Institutional Knowledge entries. Use this skill when you need to close a feature, run a delivery retrospective, capture lessons learned, or complete the AOD lifecycle."
AOD Deliver Skill
Purpose
Close a completed feature with a structured retrospective that captures delivery metrics, surprises, and lessons learned. New ideas from the retrospective feed back into the Discover stage, completing the AOD Lifecycle loop.
Entry point: /aod.deliver
Prerequisites
- A feature branch exists with completed work (typically after
/aod.build [--no-security] [--no-simplify] [--no-docs]) .aod/spec.mdexists with the feature specification.aod/tasks.mdexists with task definitions.aod/scripts/bash/github-lifecycle.shis available for GitHub operations
Reference-Loading Contract
Several sections of this skill are stored in on-demand reference files under .claude/skills/~aod-deliver/references/. This mirrors the ~aod-run pattern.
Trigger idiom (used at every reference load site below):
MANDATORY: You MUST use the Read tool to load
references/X.mdbefore proceeding with [task]. Do NOT rely on memory of prior [X] content. If the file cannot be read, display an error and STOP.
Fail-closed rule: If a reference file cannot be read at a gate, display an error message and STOP execution. Do not attempt to continue from memory.
Re-read rule: If more than 50 lines of interstitial output have been generated since a reference was last read, re-read it before using its content.
Reference Decision Table
| Reference File | Holds | Load When |
|---|---|---|
references/deliver-flags.md |
Step 0 + Step 0g (flag parse + audit log) | Every run — at Step 0 |
references/delivery-lock.md |
Step 0.5 (acquire lock) + Step 14 (release lock) | Every run — at Step 0.5 and Step 14 |
references/close-and-document.md |
Step 12 (close issue) + Step 13 (prompt /aod.document) | Every run — at Step 12 |
references/render-tables.md |
Step 11c.1 Feature-139 render tables | Only when e2e_validation.status is set (E2E was entered) |
references/edge-cases.md |
Edge Cases, Common Rationalizations, Red Flags | Only when handling an edge case or checking a rationalization/red-flag |
Note on infrastructure references (deliver-flags, delivery-lock, close-and-document): these load on every run for size/maintainability, not lazy-loading. Only render-tables.md and edge-cases.md are conditional.
Step 0: Parse Arguments (Feature 139 — PRD FR-001, FR-002, FR-034)
MANDATORY: You MUST use the Read tool to load references/deliver-flags.md before proceeding with flag parsing. Do NOT rely on memory of prior deliver-flags content. If the file cannot be read, display an error and STOP.
Follow all sub-steps (0a through 0g) in references/deliver-flags.md. After completing Step 0, carry forward the following values into working context for downstream INLINE steps:
| Value | Downstream Consumer |
|---|---|
AOD_NO_TESTS_FOUND (true/false) |
Step 9d gate decision (Path A vs Path B) |
AOD_NO_TESTS_REASON (captured reason string) |
Step 9d Path A display + Step 10 delivery doc |
AOD_REQUIRE_TESTS_FOUND (true/false) |
Observability only (deprecated flag) |
invocation_command (full $0 $@ before parsing) |
Step 11c.1 e2e_validation.invocation_command field |
Step 0.5: Acquire Delivery Lock (US-7, Feature 139 — FR-026..FR-030)
MANDATORY: You MUST use the Read tool to load references/delivery-lock.md before proceeding with delivery lock acquisition. Do NOT rely on memory of prior delivery-lock content. If the file cannot be read, display an error and STOP.
Follow the Step 0.5 instructions in references/delivery-lock.md. After Step 0.5 completes, FEATURE and HEAL_BUDGET are available in working context for all downstream steps. The lock is now held — Step 14 MUST run on every clean exit path to release it.
Step 1: Validate Definition of Done
GitHub Lifecycle Update (early): If a GitHub Issue exists for this feature, update its stage label to stage:deliver using aod_gh_update_stage from .aod/scripts/bash/github-lifecycle.sh. This moves the issue to the Deliver column on the Projects board at the start of the delivery retrospective. If gh is unavailable, skip silently (graceful degradation).
Check that the feature meets the Definition of Done criteria:
- Read
.aod/tasks.mdand count incomplete tasks (lines matching- [ ]). - Read
.aod/spec.mdto extract the feature name and scope. - Check for open blockers: Search tasks.md for any items marked
BLOCKED.
Validation Results
If incomplete tasks exist: Display count and list them:
DEFINITION OF DONE — INCOMPLETE
Feature: {feature_name}
Incomplete Tasks: {count}
{list of incomplete task descriptions}
Options:
(A) Mark remaining tasks as complete and proceed
(B) Abort delivery — finish tasks first
Use AskUserQuestion to let the user choose.
If all tasks complete: Proceed to the structural-presence check below.
DEFINITION OF DONE — TASK COUNT PASSED
Feature: {feature_name}
Tasks: {total} complete, 0 remaining
Structural Presence Gate (FR-011 / AC-011a-c)
Before presenting the DoD verdict to the human, verify that the artifacts carry role-tagged sign-offs and the DoD-acknowledgment token. This is a structural presence check only — it confirms the DoD block was seeded and each role's APPROVED tag exists. It does not auto-judge the quality of those sign-offs; the DoD pass/fail quality verdict is always presented to the human in the next sub-step.
1. Resolve artifact paths:
FEATURE=$(git branch --show-current 2>/dev/null)
NNN="${FEATURE%%-*}"
SPEC_PATH=$(ls specs/${NNN}-*/spec.md 2>/dev/null | head -1)
PLAN_PATH=$(ls specs/${NNN}-*/plan.md 2>/dev/null | head -1)
TASKS_PATH=$(ls specs/${NNN}-*/tasks.md 2>/dev/null | head -1)
Use .aod/spec.md, .aod/plan.md, .aod/tasks.md as fallbacks if the specs/ paths are absent.
2. Source run-state.sh and call aod_state_signoff_present:
source .aod/scripts/bash/run-state.sh
aod_state_signoff_present "$SPEC_PATH" "$PLAN_PATH" "$TASKS_PATH"
GATE_RC=$?
3. Branch on exit code (Contract 1 — gate-contracts.md):
Exit 0 (PRESENT): Each artifact's required role sign-off(s) are present and APPROVED (spec→pm; plan→pm+architect; tasks→pm+architect+techlead) and the
<!-- DOD-ACK -->token is intasks.md. Proceed to the DoD quality verdict below.Exit 1 (ABSENT) — BLOCK close:
Determine which element is missing by checking each artifact individually:
# Re-run per-artifact to identify the missing element for the diagnostic source .aod/scripts/bash/run-state.sh # Per-artifact required roles (FR-020) — check only these for the diagnostic: # spec.md -> pm_signoff # plan.md -> pm_signoff, architect_signoff # tasks.md -> pm_signoff, architect_signoff, techlead_signoff, and <!-- DOD-ACK --> tokenDisplay the halt and stop the close:
CLOSE BLOCKED — structural presence check failed Missing element: {role_tag_or_dod_ack} in {artifact_path} Resolve the missing element and re-run /aod.deliver: - Role sign-off absent: run the appropriate /aod.spec, /aod.project-plan, or /aod.tasks governance step to obtain the missing APPROVED tag. - DoD-ACK token absent: ensure tasks.md was generated with the canonical 3-step DoD block (from /aod.tasks; see tasks-template.md).Halt: do not proceed to the DoD quality verdict or Step 2.
Exit 2 (ERROR) — BLOCK close (fail-closed, mirrors
aod_state_cache_is_freshcontract):CLOSE BLOCKED — artifact unreadable or indeterminate Gate error checking: {artifact_path} Treat as absent (fail-closed). Resolve the file issue and re-run /aod.deliver.Halt: do not proceed.
C-012 INVARIANT (critical): this gate is structural-PRESENCE only. It confirms that the sign-off blocks and DoD-acknowledgment token exist — it never auto-approves or auto-vetoes the quality of what those blocks say. The DoD pass/fail quality verdict is always rendered by the human, as described in the next sub-step.
DoD Quality Verdict (Human-Rendered — C-012)
Present the DoD pass/fail verdict to the human. The structural gate above confirmed presence; whether the delivered feature actually satisfies each criterion is the human's judgment to make.
Display:
DEFINITION OF DONE — QUALITY VERDICT (human decision)
Feature: {feature_name}
Tasks: {total} complete, 0 remaining
Structural check: PASSED (sign-offs and DoD-ACK present)
Constitution Principle VII (canonical DoD bar):
1. Pushed to Production — feature deployed and operational.
2. Tested — all automated tests pass (unit, integration, E2E, performance).
3. User Validated — real-world usage confirmed by actual users/stakeholders.
NOTE — Constitution VII.2 vs DEFINITION_OF_DONE.md step 2:
Constitution VII.2 says: "all automated tests pass (unit, integration, E2E, performance)"
DEFINITION_OF_DONE.md step 2 says: {current step-2 text from the file}
If these differ, the divergence is surfaced here for your review — do not
auto-resolve it. Constitution VII is canonical; surface any conflict to the
human so they can record a decision or confirm alignment.
Does this feature meet the DoD criteria?
(A) Yes — DoD passed, proceed to Step 2
(B) No — DoD not met, abort delivery
Use AskUserQuestion to let the user render the verdict.
- If (A): Proceed to Step 2.
- If (B): Abort and display:
"Delivery aborted — DoD not met. Complete the outstanding criteria and re-run /aod.deliver."
Step 2: Capture Delivery Metrics
Estimated Duration
Use AskUserQuestion:
Question: "How long did you originally estimate this feature would take?"
Header: "Estimate"
Options:
- "1-2 days": "A quick feature or fix"
- "3-5 days": "About a week of work"
- "1-2 weeks": "A moderate feature spanning multiple days"
- "3+ weeks": "A large feature requiring significant effort"
Allow "Other" for custom estimates (e.g., "4 sprints", "3 months").
Actual Duration
Compute automatically from the feature branch creation date:
# Get the date of the first commit on this branch (not on main)
git log main..HEAD --reverse --format="%ai" | head -1
If the branch has no commits diverged from main, use the earliest commit date on the current branch.
Calculate the difference between the branch start date and today's date. Express as:
- "N days" if < 14 days
- "N weeks" if >= 14 days and < 60 days
- "N months" if >= 60 days
Store both estimated_duration and actual_duration.
Step 3: Capture Surprise Log
Use AskUserQuestion:
Question: "What surprised you most during this feature? (One sentence minimum)"
Header: "Surprises"
Options:
- "Scope was larger than expected": "The feature required more work than initially scoped"
- "Dependencies were complex": "Integrations or dependencies added unexpected complexity"
- "Smooth sailing": "Everything went roughly as planned — no major surprises"
Allow "Other" for custom surprise statements (required — must be at least 1 sentence).
Validation: If the user provides empty or very short text (<10 chars), re-prompt: "Please provide at least one sentence describing what surprised you."
Store as surprise_log.
Step 4: Capture Next Ideas (Optional Feedback Loop)
Use AskUserQuestion:
Question: "Did this feature reveal any new ideas or follow-up work? (Optional — select 'None' to skip)"
Header: "Next ideas"
Options:
- "Yes — let me describe": "I have one or more ideas for follow-up features or improvements"
- "None": "No new ideas emerged from this feature"
If "Yes": Ask the user to describe each idea. For each idea provided:
MUST use the standalone
create-issue.shscript to create the GitHub Issue (do NOT callgh issue createdirectly — the script handles both issue creation and project board sync):bash .aod/scripts/bash/create-issue.sh \ --title "{idea_description}" \ --body "$BODY" \ --stage discover \ --type retroWhere
$BODYcontains:# {idea_description} ## ICE Score Impact: —, Confidence: —, Effort: — = **Not yet scored** ## Evidence Retrospective: Emerged during delivery of {feature_name} ## Metadata - Source: Retrospective - Priority: Not yet scored - Date: {YYYY-MM-DD} - Status: New (from retrospective) - Origin Feature: {feature_name}The script applies the
type:retrolabel automatically and adds the issue to the Projects board with the correct Status column.If
ghis unavailable, log the idea to stdout with guidance:NEW IDEA FROM RETROSPECTIVE (GitHub unavailable — capture manually): Idea: {idea_description} Suggested next step: Run `/aod.idea {idea_description}` to formally capture and score.
Store ideas as next_ideas[].
If "None": Skip and proceed to Step 5.
Step 5: Capture Lessons Learned
Use AskUserQuestion:
Question: "What is the key lesson learned from this feature that future developers should know?"
Header: "Lesson"
Options:
- "Technical pattern": "A reusable technical approach or architecture decision worth documenting"
- "Process improvement": "A workflow or process change that would help future features"
- "Tooling insight": "A tool, library, or configuration finding worth preserving"
Allow "Other" for custom lesson descriptions.
After category selection, prompt for the full lesson text: "Describe the lesson in 2-3 sentences. What was the problem, what did you learn, and how should it be applied?"
Validation: Require at least 20 characters of lesson text.
Store as lesson_category and lesson_text.
Step 6: Write Institutional Knowledge Entry
Append a new entry to docs/INSTITUTIONAL_KNOWLEDGE.md in the ## Knowledge Entries section.
Determine the next entry number by scanning existing ### Entry N: headers and incrementing.
### Entry {N}: {feature_name} — Delivery Retrospective
## [{lesson_category}] - {one_line_summary}
**Date**: {YYYY-MM-DD}
**Context**: Delivery retrospective for {feature_name}. Estimated: {estimated_duration}, Actual: {actual_duration}.
**Problem**:
{lesson_text — first sentence or clause describing the challenge}
**Solution**:
{lesson_text — remaining sentences describing the approach/learning}
**Why This Matters**:
Captured during structured delivery retrospective. {surprise_log}
**Tags**: #retrospective #delivery #{lesson_category_tag}
### Related Files:
- `.aod/spec.md` — Feature specification
- `.aod/tasks.md` — Task breakdown
---
Map lesson_category to tags:
- "Technical pattern" →
#architecture #pattern - "Process improvement" →
#process #workflow - "Tooling insight" →
#tooling #configuration - Other →
#general
Step 7: Post Delivery Metrics to GitHub Issue
If a GitHub Issue exists for this feature (search by feature name or branch):
- Find the issue:
aod_gh_find_issue "{feature_name}"or search by branch name - Add a comment with delivery metrics:
## Delivery Metrics | Metric | Value | |--------|-------| | Delivery Date | {YYYY-MM-DD} | | Estimated Duration | {estimated_duration} | | Actual Duration | {actual_duration} | | Surprise Log | {surprise_log} | | Lessons Learned | {lesson_category}: {one_line_summary} | | New Ideas | {count of next_ideas or "None"} | - Note: The issue was already transitioned to
stage:deliverin Step 1.
Step 8: Regenerate BACKLOG.md
Run .aod/scripts/bash/backlog-regenerate.sh to update the backlog snapshot with the newly delivered item. If gh is unavailable, skip silently.
Step 9: Run E2E Validation Gate
Run automated E2E validation against the feature's acceptance criteria before collecting test evidence. This step is non-fatal by default (ADR-006): if the tester agent fails to launch, crashes, or times out, log the error and proceed to Step 10. The E2E validation gate supports two modes:
- Soft gate (default): Warn on test failure, let the developer decide whether to proceed or abort
- Hard gate (
--require-tests): Block delivery if any test fails
The gate decision is made in sub-step 9d based on the require_tests flag passed from the command's flag parsing (Step 0).
9a: Read Active Pack Test Contract
Replace filesystem-based Playwright detection with a deterministic lookup against the active stack pack's STACK.md contract. See specs/130-e2e-hard-gate/contracts/deliver-step9-integration.md §Step 9a for the full contract; see specs/130-e2e-hard-gate/data-model.md §2 for the exit-code taxonomy and §3 for the e2e_validation.* payload shape (plan Decision 4).
1. Resolve the active pack:
Read .aod/stack-active.json and extract the .pack key (e.g., via jq -r '.pack' .aod/stack-active.json 2>/dev/null).
Branch on two distinct cases (FR-015 / AC-015a-b — do NOT collapse them back into one condition):
If the file is absent (
[ ! -f .aod/stack-active.json ]): skip Step 9 entirely and proceed directly to Step 10. This is the Feature-139 backward-compatibility silent path for projects not using stack packs (AC-015b). Do NOT write any audit line — the silent skip is deliberate and must not be overturned. Do NOT sete2e_validation.*fields — Step 11c falls through to the legacy path.If the file is present but malformed (file exists AND jq exits non-zero on
jq -r '.pack' .aod/stack-active.json 2>/dev/null, OR.packis null/empty): emit an audit line viaappend_opt_out_line(AC-015a), then skip Step 9 entirely and proceed directly to Step 10. Do NOT sete2e_validation.*fields.# Present-but-malformed path (FR-015 / AC-015a) if [ -f ".aod/stack-active.json" ]; then ACTIVE_PACK=$(jq -r '.pack // empty' .aod/stack-active.json 2>/dev/null) if [ -z "$ACTIVE_PACK" ]; then # File exists but pack key is null, empty, or jq failed — write audit line source .aod/scripts/bash/audit-log.sh TIMESTAMP=$(date -u +"%Y-%m-%dT%H:%M:%SZ") INVOKER=$(git config user.email 2>/dev/null) [ -z "$INVOKER" ] && INVOKER="unknown" FEATURE_ID=$(git branch --show-current 2>/dev/null) MODE="interactive" [ "${autonomous:-false}" = "true" ] && INVOKER="autonomous" && MODE="autonomous" REASON="E2E gate skipped: stack-active.json present but malformed (pack key null/empty or jq parse failure)" append_opt_out_line "$TIMESTAMP" "$INVOKER" "$FEATURE_ID" "$REASON" "$MODE" || { echo "[WARN] audit-log append failed for malformed stack-active.json; proceeding (see stderr)" >&2 } # Proceed to Step 10 — do not set e2e_validation.* fields fi fiNote:
append_opt_out_lineis called withkindsurfaced in the audit line ase2e_skip_malformed_stack_activeper data-model.md §3. Because the existing library does not accept akindparameter, thereasonstring above encodes the kind explicitly so the audit trail remains identifiable. Defends against.aod/stack-active.jsonschema drift (plan Decision 4, architect LOW-4).
2. Invoke the contract lint on the active pack's STACK.md:
LINT_STDERR=$(bash .aod/scripts/bash/stack-contract-lint.sh "stacks/$ACTIVE_PACK/STACK.md" 2>&1 >/dev/null)
LINT_EXIT=$?
On exit code 0, parse the extracted fields (TEST_COMMAND, E2E_COMMAND, E2E_OPT_OUT) from the lint's stdout for use in Step 9b.
3. Branch on exit code (taxonomy from data-model.md §2):
- Exit 0 (valid): If
E2E_OPT_OUTis set → storee2e_validation.status = "skipped",e2e_validation.skip_reason = "opt_out: $E2E_OPT_OUT". Do NOT populatee2e_validation.command(plan Decision 4 —commandis populated only for pass/fail). Proceed to Step 10. Else →E2E_COMMANDis set; pass it forward to Step 9b. - Exit 1 (runtime error), 2 (missing
test_command), 3 (XOR violation), 4 (unknown key), 5 (missing contract block): Storee2e_validation.status = "error",e2e_validation.error = "$LINT_STDERR". Do NOT populatee2e_validation.command. Skip Step 9b and proceed to Step 10. This is NOT a hard-block on delivery — the hard-block is reserved for real E2E test failures under--require-tests(Step 9d).
Output state after 9a:
e2e_validation.status(enum:"skipped"|"error"| unset when proceeding to 9b)e2e_validation.skip_reason(present whenstatus == "skipped"; the opt-out reason)e2e_validation.error(present whenstatus == "error"; the lint stderr diagnostic)e2e_validation.command(absent here — populated later only whenstatusresolves to"pass"or"fail"via Steps 9b/9c per data-model §3)E2E_COMMANDshell variable (populated only when proceeding to Step 9b; sourced from the parsed contract)
9a.5: AC-Coverage Pre-Gate (US-4, Feature 139 — FR-006..FR-011)
Before invoking the tester agent, verify every Acceptance Criterion in spec.md is mapped to at least one test scenario. This is the core promise of User Story 4: bind what we promised (spec ACs) to what we verified (scenarios) before the runner ever executes. Runs only when Step 9a did not short-circuit the gate (i.e., e2e_validation.status is still unset after 9a).
When this step runs:
E2E_COMMANDwas resolved successfully in Step 9a (pack has a real contract; not opt-out, not error)e2e_validation.statusis unset entering 9a.5
When this step is skipped:
- Step 9a short-circuited with
status = "skipped"orstatus = "error"— AC-coverage is irrelevant because the gate is already bypassed .aod/stack-active.jsonwas absent/malformed (Step 9a proceeded directly to Step 10, this step never reached)
1. Source the AC-coverage parser library:
source .aod/scripts/bash/ac-coverage-parse.sh
2. Resolve spec path:
From the current branch name, derive specs/{NNN}-*/spec.md via:
FEATURE=$(git branch --show-current)
NNN="${FEATURE%%-*}"
SPEC_PATH=$(ls -d specs/${NNN}-*/spec.md 2>/dev/null | head -1)
3. Run strict parse + coverage map:
ACS_JSON=$(parse_acs_strict "$SPEC_PATH")
PARSE_RC=$?
# Scenarios directory: tests/e2e/ is the canonical root; individual packs may
# declare alternatives but the parser scans recursively for @US-NN-AC-N tags
# across *.feature, *.test.{ts,tsx,js,jsx}, *.spec.{ts,tsx,js,jsx} regardless.
SCENARIOS_DIR="tests/e2e/"
COVERAGE_JSON=$(printf '%s' "$ACS_JSON" | build_ac_scenario_map - "$SCENARIOS_DIR")
MAP_RC=$?
4. Branch on parse exit code (taxonomy from ac-coverage-parse.sh header):
- Exit 0 (strict parse succeeded): Continue to step 5 (gate decision).
- Exit 2 (spec path missing/unreadable): Emit stderr warning and skip the pre-gate (store
e2e_validation.ac_coverage.status = "skipped",e2e_validation.ac_coverage.skip_reason = "spec_missing"). Proceed to Step 9b — existing tester flow surfaces the missing-spec condition. Do NOT halt here; delivery without spec is an upstream governance failure handled elsewhere. - Exit 3 (legacy prose ACs detected): Tier 2 descope branch (per tasks.md Phase 6 header):
- Default (halt): Emit Step 9e-style halt with
reason = "ac_coverage_fail",scenarios = [],recovery_status = "not_attempted". Human-readable stderr message:
Delivery halts with exit 10 (see Step 9e halt emission sub-step — AC-coverage fail reuses the same channels as e2e_fail, distinguished byHalted — legacy prose ACs detected in $SPEC_PATH. Retrofit spec ACs to Given/When/Then format before delivering. See: docs/guides/AC_COVERAGE_MIGRATION.mdreasonenum). - Tier 2 descope (build-phase flag, when burn-rate triggers): If
AOD_AC_COVERAGE_WARN_ONLY == "true"in the environment ANDheal_attempts == 0(no auto-fix configured), downgrade to warn-only: emit the same stderr message, storee2e_validation.ac_coverage.status = "warn_legacy_prose", and proceed to Step 9b. This branch exists for backward-compat during initial rollout and is opt-in via env; the Tier 2 descope trigger documents when it is acceptable to activate.
- Default (halt): Emit Step 9e-style halt with
- Exit 4 (invalid [MANUAL-ONLY] reason): Halt with
reason = "ac_coverage_fail". Stderr message names the AC ID and the minimum reason length (10 chars). Same halt channels as exit 3 default path. - Exit 1 (runtime error — jq/awk failure): Non-fatal per ADR-006. Store
e2e_validation.ac_coverage.status = "error",e2e_validation.ac_coverage.error = "parse_runtime_failure", proceed to Step 9b. The tester-gate still runs; we just lose the AC-coverage evidence in the delivery doc.
5. Gate decision based on coverage map (only when parse exit 0):
Parse uncovered_acs from the coverage JSON:
UNCOVERED_ACS=$(printf '%s' "$COVERAGE_JSON" | jq -c '.uncovered_acs')
UNCOVERED_COUNT=$(printf '%s' "$UNCOVERED_ACS" | jq 'length')
UNCOVERED_COUNT == 0(all ACs covered or manual-only): PASS the pre-gate. Storee2e_validation.ac_coverage = $COVERAGE_JSON(the full map includingtotal_acs,covered_count,uncovered_acs: [],manual_only_acs[],coverage_by_ac[]) for Step 10 rendering. Proceed to Step 9b.UNCOVERED_COUNT > 0: HALT via the Step 9e protocol. Before invoking the halt emission, set:HALT_REASON = "ac_coverage_fail"SCENARIOS_JSON = $UNCOVERED_ACS(reuse the halt-recordfailing_scenariosfield to carry the uncovered AC IDs — percontracts/halt-record.md, the array is an opaque string list that orchestrators pattern-match on byreason)RECOVERY_STATUS = "not_attempted"(auto-fix loop cannot add missing scenarios; human authorship required)HEAL_PR_URL = ""andHEAL_PR_NUMBER = ""(no heal-PR for AC-coverage failures; the fix is spec+scenario authorship, not patching)
Emit stdout line:
Halted — $UNCOVERED_COUNT acceptance criterion(s) without scenarios: $(printf '%s' "$UNCOVERED_ACS" | jq -r '. | join(", ")') Mark these ACs [MANUAL-ONLY] <reason> in spec.md, or author missing scenarios. See: docs/guides/AC_COVERAGE_MIGRATION.mdThen jump to Step 9e (halt signal emission) with the above context. Step 9e writes the halt record (channel 2) and returns exit 10 (channel 3) as it does for e2e_fail halts — the only difference is the
reasonenum value distinguishing the halt cause.
6. Store coverage payload for Step 10:
On pass path (UNCOVERED_COUNT == 0), the full $COVERAGE_JSON is stored as e2e_validation.ac_coverage. Step 10 (render doc) reads e2e_validation.ac_coverage.coverage_by_ac[] to render the AC→scenario table and e2e_validation.ac_coverage.manual_only_acs[] to render the Manual Validation subsection. Shape per ac-coverage-parse.sh build_ac_scenario_map:
{
"total_acs": 5,
"covered_count": 4,
"uncovered_acs": [],
"manual_only_acs": ["US-03-AC-2"],
"coverage_by_ac": [
{"ac_id": "US-01-AC-1", "scenarios": ["tests/e2e/login.feature:12"], "manual_only": false},
...
]
}
Output state after 9a.5:
e2e_validation.ac_coverage(full coverage map when parse+gate succeeded;status = "skipped"|"error"|"warn_legacy_prose"sub-object when degraded)HALT_REASONshell variable (set only on halt path; consumed by Step 9e)SCENARIOS_JSONshell variable (set only on halt path; carries uncovered AC list into halt record)
Reference contracts:
- AC parser:
.aod/scripts/bash/ac-coverage-parse.sh - Halt record:
specs/139-delivery-verified-not-documented/contracts/halt-record.md - Migration guide:
docs/guides/AC_COVERAGE_MIGRATION.md - Canonical AC ID format:
US-{NN}-AC-{N}(e.g.,US-01-AC-1)
9b: Invoke Tester Agent
Launch the tester agent to generate and execute BDD/Gherkin E2E scenarios from the feature's acceptance criteria.
Resolve spec path: Find
specs/{NNN}-*/spec.mdusing the feature number from the current branchLaunch tester agent via the Agent tool:
Agent tool: subagent_type: "tester" prompt: | Generate and execute BDD/Gherkin E2E scenarios for the feature specification. Input: - Feature spec: {spec_path} - E2E runner command: ${E2E_COMMAND} - Output directory: .aod/test-results/ Instructions: 1. Read the feature spec and extract acceptance criteria from all user stories 2. Generate Gherkin scenarios (.feature files) from the acceptance criteria 3. Execute scenarios by invoking the declared E2E runner command 4. Write test results to .aod/test-results/ Subagent Return Policy: - Return ONLY: status (pass/fail/error), item count ({passed}/{total}), and results file path (.aod/results/tester.md) - Write detailed findings (failing scenario names, error details, execution logs) to .aod/results/tester.md BEFORE returning - Max return: 15 lines / ~200 tokens timeout: 300000 # 5 minutesHandle timeout: If the agent does not return within 5 minutes:
- Display:
"E2E validation timed out after 5 minutes ({elapsed}s elapsed)" - Use AskUserQuestion: "(A) Continue waiting, (B) Abort E2E validation and proceed"
- If Abort: store
e2e_validation.status = "error",e2e_validation.error = "timeout", proceed to Step 10
- Display:
Handle agent failure: If the agent fails to launch or crashes:
- Display:
"E2E validation error: {error_message}" - Store
e2e_validation.status = "error",e2e_validation.error = "{error_message}" - Proceed to Step 10 (non-fatal per ADR-006)
- Display:
9c: Parse Test Results
Read and parse the tester agent's return value and detailed results.
- Parse agent return: Extract status (
pass/fail/error), item count ({passed}/{total}), and results file path - Read detailed results: If status is
fail, read.aod/results/tester.mdto extract:- Failing scenario names
- Error messages for each failure
- Total, passed, failed, and skipped counts
- Store results in the
e2e_validationdata structure:e2e_validation.status:"pass"|"fail"|"error"|"skipped"e2e_validation.total: Total test counte2e_validation.passed: Passed test counte2e_validation.failed: Failed test counte2e_validation.skipped: Skipped test counte2e_validation.failing_scenarios[]: List of failing scenario names (empty if all pass)
- Display summary:
If any failures, also display:E2E Validation: {status} Tests: {passed}/{total} passed ({failed} failed, {skipped} skipped)Failing scenarios: - {scenario_name_1} - {scenario_name_2} - Proceed to sub-step 9c.5 (Auto-Fix Loop) when
e2e_validation.status == "fail"ANDheal_attempts > 0; otherwise proceed directly to sub-step 9d (Gate Decision)
9c.5: Auto-Fix Loop (US-8, Feature 139 — FR-017..FR-023)
When tests fail and heal_attempts > 0 in .aod/config.json, attempt auto-recovery before gate decision. The loop lives strictly between Step 9c (Parse Test Results) and Step 9d (Gate Decision) — if tests pass at any attempt, execution jumps directly to Step 9d with gate_result = "pass". On exhaustion or scope-guard escalation, Step 9d fires the halt path (Step 9e → heal-PR).
Preconditions:
e2e_validation.status == "fail"(from Step 9c)e2e_validation.failed > 0ande2e_validation.failing_scenarios[]non-empty- Skip this step entirely when preconditions are not met
1. Read config:
HEAL_ATTEMPTS=$(jq -r '.deliver.heal_attempts // 0' .aod/config.json 2>/dev/null || echo 0)
HEAL_MULTIPLIER=$(jq -r '.deliver.heal_max_timeout_multiplier // 1.5' .aod/config.json 2>/dev/null || echo 1.5)
2. Short-circuit on HEAL_ATTEMPTS == 0:
If HEAL_ATTEMPTS is 0, skip the loop entirely (config-level opt-out):
- Set
e2e_validation.recovery_status = "skipped_via_config" - Set
e2e_validation.recovery_actions = [] - Emit NO heartbeat sentinel (no auto-fix work will run)
- Proceed directly to Step 9d — the gate decision will halt via Path B (heal-PR escalation) since tests still failed.
3. Initialize loop state:
recovery_status = "pending"— will be finalized on loop exitrecovery_actions = []— append one record per attempt regardless of outcomeFEATURE— current branch name (e.g.,139-delivery-verified-not-documented)
4. Loop (attempt in 1..HEAL_ATTEMPTS):
a. Heartbeat sentinel (FR-029): BEFORE any heal work begins for this attempt, call:
source .aod/scripts/bash/deliver-lock.sh
write_heartbeat_sentinel "$FEATURE" "$attempt" "$HEAL_ATTEMPTS"
Integrates with US-7 crash recovery: if the skill crashes mid-attempt, the next /aod.deliver invocation's detect_abandoned_sentinel (Step 0.5) will halt with exit 12. Heartbeat refresh is integral — every attempt overwrites the sentinel with a fresh last_heartbeat timestamp, so an abandoned process is distinguishable from an actively-running loop.
b. Invoke tester in mode: heal: pass the failing scenario, test_paths, framework, max_timeout_multiplier, attempt_number. The tester returns a proposed diff on stdout and a human-readable description on stderr. See .claude/agents/tester.md §9 "Operating Modes → Mode: heal" for the full input/output contract.
c. Scope-guard evaluation (wired in full by T056 — see Step 9c.5 scope-guard evaluation bullet below): call evaluate_scope_guard on the proposed diff. Library returns a JSON decision record with {decision, reason, violating_lines}.
d. Branch on scope-guard decision:
- **`allowed`**: apply the diff with `git apply --index`, commit using the T048 format, then re-run tests via the tester agent (non-heal mode — just execute and report). If tests pass → set `recovery_status = "recovered"`, break the loop, proceed to Step 9d with `gate_result = "pass"`. If tests still fail → append the attempt record and continue to the next iteration.
- **`rejected`**: abort the loop immediately. Append the attempt record with `scope_guard_decision: "rejected"` and `scope_guard_reason` populated. Set `recovery_status = "scope_guard_escalated"`. Surface the rejection reason into the heal-PR body (pass via `HEAL_PR_*` env vars when Step 9e invokes `heal-pr.sh`). Break loop.
e. Append attempt record to e2e_validation.recovery_actions[] (payload shape defined in T049 below). Each attempt adds exactly one record, regardless of outcome.
5. On loop exhaustion (all HEAL_ATTEMPTS used, tests still failing):
- Set
recovery_status = "escalated_to_heal_pr"(or"exhausted"— both enum values map to the same downstream behavior; use"exhausted"when loop iterated fully without scope-guard rejection) - Proceed to Step 9d — the gate decision will halt (Path B) and Step 9e will invoke
heal-pr.shwith the fullrecovery_actions[]context
6. Commit format enforcement (US-8, T048):
Every auto-fix commit MUST match this exact format:
e2e-heal(attempt {N}/{M}): {description}
Where {N} is the current attempt number (1-indexed), {M} is HEAL_ATTEMPTS, and {description} is the tester's stderr output trimmed to a single line.
After applying a scope-guard-allowed diff, compose and validate the commit message:
_validate_heal_commit_msg() {
local msg="$1"
if ! echo "$msg" | grep -qE '^e2e-heal\(attempt [0-9]+/[0-9]+\): .+'; then
echo "[auto-fix] rejecting commit — message does not match 'e2e-heal(attempt N/M): ...' format" >&2
return 1
fi
return 0
}
COMMIT_MSG="e2e-heal(attempt ${attempt}/${HEAL_ATTEMPTS}): ${description_from_tester_stderr}"
if _validate_heal_commit_msg "$COMMIT_MSG"; then
git commit -m "$COMMIT_MSG"
else
# Refuse to commit — treat as scope-guard-equivalent failure; append record and break loop.
recovery_status="escalated_to_heal_pr"
break
fi
Rationale: commit history becomes machine-auditable; specs/{NNN}-*/delivery.md Recovery Actions table (Step 11c.1 T037) filters by this prefix; rollback scripts can git revert by grepping the prefix.
7. Scope-guard evaluation (US-9 wiring, T056):
# Source scope-guard library
source .aod/scripts/bash/scope-guard.sh
# Resolve test_paths (stack pack or defaults)
TEST_PATHS=$(get_test_paths)
# Detect framework
FRAMEWORK=$(detect_framework "$(pwd)")
# Evaluate the proposed diff (diff content passed as a raw string is auto-materialized by the library)
SCOPE_GUARD_RESULT=$(evaluate_scope_guard "$PROPOSED_DIFF" "$TEST_PATHS" "$FRAMEWORK" "$HEAL_MULTIPLIER")
SCOPE_GUARD_DECISION=$(echo "$SCOPE_GUARD_RESULT" | jq -r '.decision')
SCOPE_GUARD_REASON=$(echo "$SCOPE_GUARD_RESULT" | jq -r '.reason // empty')
SCOPE_GUARD_VIOLATING_LINES=$(echo "$SCOPE_GUARD_RESULT" | jq -c '.violating_lines // []')
Branch logic per FR-018..FR-019 loop behavior:
SCOPE_GUARD_DECISION == "allowed": apply diff withgit apply --index, commit per the T048 format above, re-run tests.SCOPE_GUARD_DECISION == "rejected": DO NOT APPLY. Append arecovery_actions[]record withscope_guard_decision: "rejected"andscope_guard_reasonpopulated. Break loop. Setrecovery_status = "scope_guard_escalated".
The scope-guard library is deterministic (no LLM judgment) — same diff + same test_paths + same framework always yields the same decision. Contract: specs/139-delivery-verified-not-documented/contracts/scope-guard-decision.md.
8. Payload accumulation (US-8, T049):
The e2e_validation payload extends PRD 130's delivered schema with two additive fields:
{
"...PRD 130 existing fields...": "...",
"recovery_status": "pending | recovered | exhausted | scope_guard_escalated | skipped_via_config | escalated_to_heal_pr",
"recovery_actions": [
{
"attempt_number": 1,
"commit_sha": "abc123...",
"description": "Updated selector [data-testid=\"old\"] to [data-testid=\"new\"]",
"outcome": "passed | still_failing",
"scope_guard_decision": "allowed | rejected",
"scope_guard_reason": null
}
],
"ac_coverage": { ... }
}
Construction rules:
recovery_statusstarts"pending"when loop enters; receives a final value on exit (one of:recovered,exhausted,scope_guard_escalated,skipped_via_config,escalated_to_heal_pr)- Each attempt appends exactly one record to
recovery_actions[]regardless of outcome (allowed+passed, allowed+still_failing, rejected) commit_shais empty-string for rejected attempts (no commit was made)scope_guard_reasonisnullwhenscope_guard_decision: "allowed", populated fromSCOPE_GUARD_REASONwhenrejected- On
heal_attempts == 0skip:recovery_status = "skipped_via_config",recovery_actions = [] - This payload flows into Step 11c.1 where T037 (Wave 4b) renders the Recovery Actions table under "Execution Evidence"
Reference schema: specs/139-delivery-verified-not-documented/data-model.md.
Tester contract: .claude/agents/tester.md §9.
Scope-guard contract: specs/139-delivery-verified-not-documented/contracts/scope-guard-decision.md.
9. Proceed to Step 9d: Gate decision consumes e2e_validation.status (possibly now "pass" if the loop recovered), recovery_status, and recovery_actions[] to make its final halt/proceed determination.
9d: Gate Decision
Evaluate test results and determine whether delivery should proceed, based on gate mode and test outcome.
Inputs: e2e_validation.status, e2e_validation.failed, e2e_validation.failing_scenarios[], require_tests flag (from command Step 0), autonomous flag (from command Step 0).
Decision paths:
All tests pass (
e2e_validation.status == "pass"):- Display:
"E2E Validation Gate: PASSED — all tests passed" - Store
e2e_validation.gate_mode = "soft"or"hard"(based onrequire_testsflag) - Store
e2e_validation.gate_result = "pass" - Proceed to Step 10
- Display:
Tests fail + Status is "error" or "skipped" (from sub-steps 9a-9c):
- These statuses bypass the gate entirely — they were already handled in prior sub-steps
- Store
e2e_validation.gate_mode = "hard"(no valid gate evaluation ran; default to hard posture) - Store
e2e_validation.gate_result = "skip" - Proceed to Step 10
Tests fail + delivery halts — HARD-GATE DEFAULT (Feature 139, PRD FR-001/FR-002; no autonomous override):
The gate decision is identical in interactive and autonomous modes —
/aod.deliverhalts on test failure unless an explicit opt-out was accepted at Step 0 via--no-tests=<reason>. This replaces the former soft-gate default and autonomous override.Path A — Opt-out accepted (
AOD_NO_TESTS_FOUND == truefrom Step 0, regardless of mode):- Display:
E2E Validation Gate: SKIPPED via --no-tests opt-out Reason: {AOD_NO_TESTS_REASON} Failing scenarios (recorded for audit trail): - {scenario_name_1} - {scenario_name_2} - Store
e2e_validation.gate_mode = "skipped_via_opt_out" - Store
e2e_validation.gate_result = "skip" - Store
e2e_validation.opt_out_reason = AOD_NO_TESTS_REASON(surfaces in Manual Validation section of delivery.md) - Proceed to Step 10 (audit line already appended at Step 0 when the flag was accepted — T018 Wave 4 integration)
Path B — No opt-out → HALT (both modes, default):
- Display error:
E2E Validation Gate: HALTED — {failed_count} test(s) failed (hard-gate default) Failing scenarios: - {scenario_name_1} - {scenario_name_2} Delivery halted. Options: 1. Fix the failing tests and re-run /aod.deliver 2. Opt out explicitly with /aod.deliver --no-tests="<reason ≥10 chars>" (opt-outs are audit-logged to .aod/audit/deliver-opt-outs.jsonl) - Store
e2e_validation.gate_mode = "hard" - Store
e2e_validation.gate_result = "block" - Store
e2e_validation.failing_scenarios[](already populated at Step 9c) - Proceed to Step 9e (Halt Signal Emission) — do NOT proceed to Step 10
- Display:
9e: Halt Signal Emission (Feature 139 — PRD FR-011, FR-024..FR-025)
Invoked when Step 9d Path B halts delivery. Emits the halt across three channels so orchestrators and human operators can branch on any one. See specs/139-delivery-verified-not-documented/contracts/halt-record.md for the full contract.
When this step runs:
- Reached via Step 9d Path B (hard-gate halt) — either interactive or autonomous mode
- NOT reached when Path A (opt-out accepted) fires — that path proceeds to Step 10
- NOT reached when Path 1 (all pass) or Path 2 (error/skipped) fires
1. Source the halt-signal library:
source .aod/scripts/bash/halt-signal.sh
2. Resolve contextual fields:
FEATURE: current branch name (e.g.,139-delivery-verified-not-documented)HEAL_PR_URL: populated by heal-PR creation (Wave 4+ integration; empty string until T032 lands)HEAL_PR_NUMBER: same — empty integer until heal-PR wiringSCENARIOS_JSON: JSON array ofe2e_validation.failing_scenarios[], built viajq -c -n '$ARGS.positional' --args "${failing[@]}"RECOVERY_STATUS:"not_attempted"at Wave 3 floor (auto-fix loop lands in Wave 5 / US-8); will become"exhausted"or"scope_guard_escalated"once T047 wires the loopHALT_REASON:"e2e_fail"(enum value fromcontracts/halt-record.md§Channel 2);"ac_coverage_fail"and"abandoned_heal"are reserved for US-4 (Wave 4) and US-7 (Wave 4) paths
3. Channel 1 — stdout line (interactive + autonomous):
emit_halt_stdout "$HEAL_PR_URL" ".aod/results/tester.md"
The library emits either "Halted — heal-PR <URL> requires human review" (preferred form, when URL is populated) or the fallback "Halted — test failures logged at .aod/results/tester.md; heal-PR unavailable" when URL is empty. Per T016 (Feature 139), this stdout line is REQUIRED on every hard-gate halt — both modes.
4. Channel 2 — halt record file (autonomous mode only):
Only write the machine-readable halt record when autonomous == true — the record is the orchestrator's branch signal and interactive operators do not need it.
if [ "$autonomous" = "true" ]; then
write_halt_record \
"$FEATURE" \
"$HALT_REASON" \
"$HEAL_PR_URL" \
"$HEAL_PR_NUMBER" \
"$SCENARIOS_JSON" \
"$RECOVERY_STATUS"
fi
Write failure is non-fatal (ADR-006 non-fatal observability): the library returns non-zero and emits a stderr degradation notice, but Channels 1 and 3 still fire. Orchestrators tolerating the degraded signal path read stdout + exit code as sufficient.
5. Channel 3 — exit code 10:
Halt-for-review exit code is canonical 10 per PRD 139 additive exit-code taxonomy (extends PRD 130's 0-5; no collision).
exit "$(halt_exit_code)"
halt_exit_code echoes the literal 10. The caller (this skill at top level) performs the actual exit so library functions never terminate the process directly.
6. Do NOT proceed to Step 10:
Process terminates at exit 10. Step 10 (Collect Test Evidence), Step 11 (Generate Delivery Document), Step 12 (Close Issue), and Step 13 (Prompt for /aod.document) are all skipped. The failure is recorded for review; delivery must be re-attempted after fixes (or explicit opt-out) before the feature can be closed.
Integration notes (for later waves):
- T032 (Wave 4 / US-5): heal-PR creation populates
HEAL_PR_URLandHEAL_PR_NUMBERbefore Step 9e fires — until then, the fallback stdout form is emitted. - T047 (Wave 5 / US-8): auto-fix loop determines final
RECOVERY_STATUSbefore halt record is written — until then,"not_attempted"is correct. - T022-T023 (Wave 4 / US-3):
~aod-orchestrateand~aod-runskills read the halt record and branch.
Step 10: Collect Test Evidence
Collect and archive test artifacts for the delivery audit trail. This step is entirely non-fatal (ADR-006): if any sub-step fails, log the error and continue to Step 11. The delivery workflow must never be blocked by test evidence collection.
10a: Auto-Detect Test Artifacts
Scan these specific locations at the project root only (no recursive scanning):
.aod/test-results/— AOD convention directorytest-results/— project root test resultscoverage/— project root coverage reports- Files matching
junit*.xml,test-report.*,coverage.*in the project root
Collect all discovered files into a detected_files list with file paths and sizes.
10b: Confirm or Prompt
If files found: Display discovered files with sizes:
Test artifacts detected:
- test-results/junit.xml (12 KB)
- coverage/lcov.info (45 KB)
Archive these to specs/{NNN}-*/test-results/? (Y/n/add more)
Use AskUserQuestion with options:
- "Archive all" — proceed with detected files
- "Add more" — prompt for additional file paths, then archive all
- "Skip" — no test evidence archived
If no files found: Use AskUserQuestion:
Question: "No test artifacts found in standard locations. What would you like to do?"
Header: "Test evidence"
Options:
- "Provide paths": "Enter custom file paths to archive"
- "Skip": "No test evidence for this feature"
10c: File Size Check
Before copying, check file sizes:
- Warn if any individual file exceeds 10 MB
- Warn if total size exceeds 50 MB per feature
- Display warning but do NOT block archival — the developer decides
10d: Sensitive Data Warning
Display once before archival:
Reminder: Review test artifacts for sensitive data (API keys, tokens, PII) before archival.
10e: Copy Files to Archive
Copy confirmed files to specs/{NNN}-*/test-results/:
- Create directory if needed:
mkdir -p specs/{NNN}-*/test-results/ - Copy each file preserving original filenames
- If copy fails for any file, log the error and continue with remaining files
10f: Extract Summary Metrics (Best-Effort)
Extract metrics from recognized formats:
JUnit XML: Parse <testsuites> or <testsuite> root element attributes using xmllint --xpath:
xmllint --xpath "string(/testsuites/@tests)" file.xml 2>/dev/null || \
xmllint --xpath "string(/testsuite/@tests)" file.xml 2>/dev/null
Extract: tests, failures, errors, skipped attributes.
LCOV (.info/.lcov): Sum LF: and LH: records:
LF=$(grep "^LF:" file.lcov 2>/dev/null | cut -d: -f2 | paste -sd+ | bc)
LH=$(grep "^LH:" file.lcov 2>/dev/null | cut -d: -f2 | paste -sd+ | bc)
Compute coverage: (LH / LF) * 100.
Other formats: Archive as-is. Summary = "Manual review required".
If parsing fails for any format, use "Manual review required" as the summary.
10g: Store Test Evidence Data
Store the following for use in Step 11 (delivery document generation):
test_evidence.files[]: List of archived files with paths and summariestest_evidence.metrics: Aggregate metrics (tests run, passed, failed, coverage %)test_evidence.skipped: Boolean indicating if evidence was skippedtest_evidence.notes: Context string for the Notes field
Step 11: Generate Delivery Document
Generate a persistent delivery document from retrospective data collected in Steps 1-8, E2E validation results from Step 9, and test evidence from Step 10.
11a: Re-ground on Template
Re-read .aod/templates/delivery-template.md before generating the document (KB Entry 9 re-grounding). This ensures the output structure matches the standardized template exactly.
11b: Resolve Specs Directory
Resolve the specs directory from the branch name:
- Get the feature number from the branch:
git branch --show-current→ extract NNN prefix - Find the specs directory:
specs/{NNN}-*/ - If the directory does not exist, create it:
mkdir -p specs/{NNN}-{feature_kebab_name}/
Store the resolved path as specs_dir.
11c: Populate Delivery Document
Using the template structure from .aod/templates/delivery-template.md, populate all sections from retrospective data:
- Header: Feature number, name, today's date, branch name, PR number (from draft PR or git log)
- What Was Delivered: Read
.aod/spec.mdfor completed user stories and.aod/tasks.mdfor major completed tasks. Summarize as 3-7 user-visible outcomes (not implementation details). - How to See & Test: Extract verification steps from three sources and merge into numbered steps a developer can follow immediately:
- From
.aod/spec.md: Read each acceptance scenario's Then clause — each maps to one or more verification steps. - From
.aod/plan.md: Extract any test commands, run commands, or manual testing instructions mentioned in the plan. - From
.aod/tasks.md: Extract verification steps from task descriptions and checkpoint criteria. - Format: Number each step. Include specific CLI commands, file paths, or UI actions. Avoid vague instructions like "verify it works" — instead specify what to check and how (e.g., "Run
/aod.deliver 091and confirmspecs/091-*/delivery.mdexists").
- From
- Delivery Metrics: Table with estimated_duration, actual_duration, and variance (computed as over/under/on-target)
- Surprise Log: From
surprise_logcaptured in Step 3 - Lessons Learned: Table with
lesson_category,lesson_text, and KB entry reference from Step 6 - Feedback Loop: Count of
next_ideasand list of each idea with GitHub Issue number - Source Artifacts: Paths to spec.md, plan.md, tasks.md, and PRD (from spec.md frontmatter if available)
- Test Evidence: Populate both subsections of the Test Evidence section:
- E2E Validation Gate: Read
e2e_validation.status,e2e_validation.gate_mode,e2e_validation.gate_result,e2e_validation.total,e2e_validation.passed,e2e_validation.failed,e2e_validation.skipped, ande2e_validation.failing_scenarios[]from Step 9 output. Populate the E2E Validation Gate table and Failure Details field. Ife2e_validation.statusis"skipped", use "N/A" for counts; for Failure Details, rendere2e_validation.skip_reasonwhen present (Feature 130 rich-shape — opt-out reason) else fall back to "E2E validation skipped" (legacy payloads). Ife2e_validation.statusis"error", use "Error" for Status and the error message for Failure Details. - Archived Artifacts: Populate from
test_evidencedata collected in Step 10. Iftest_evidence.skippedis true, use "No test artifacts archived for this feature." If metrics were parsed successfully, populate the artifact table and Archived Artifact Metrics (tests run, passed, failed, coverage). If metrics could not be parsed, use "Manual review required" for the summary column.
- E2E Validation Gate: Read
- Documentation Updates: Agent table populated from Step 3 of the command (documentation agent results)
- Cleanup: Checklist items (left unchecked — will be checked during command Steps 7-11)
11c.1: Render Feature 139 Template Fields (US-6, FR-012..FR-014)
Populate the three Test Evidence subsections (Test Scenarios, Execution Evidence, Manual Validation) from the e2e_validation.* payload.
Condition: Only load the render-tables reference when e2e_validation.status is set (E2E validation was entered during Step 9). When e2e_validation.status is NOT set (Step 9 was skipped because .aod/stack-active.json was absent), render "N/A" for all Test Evidence subsections inline without loading the reference.
When e2e_validation.status IS set:
MANDATORY: You MUST use the Read tool to load references/render-tables.md before proceeding with Feature 139 template rendering. Do NOT rely on memory of prior render-tables content. If the file cannot be read, display an error and STOP.
Follow the Step 11c.1 instructions in references/render-tables.md to populate all three Test Evidence subsections from the accumulated e2e_validation.* payload.
When e2e_validation.status is NOT set (no-E2E close):
Render "N/A" inline for the Test Evidence section. Do NOT load references/render-tables.md.
11d: Write Delivery Document
Write the populated document to {specs_dir}/delivery.md using the Write tool.
Store the delivery document path in variable delivery_doc_path for use in Step 12.
11e: Non-Fatal Fallback Guard
If the file write in Step 11d fails (permissions, disk full, or any error), display the full delivery document content in the terminal as a fallback. The deliver workflow MUST NOT be blocked by a file write failure (FR-009, KB Entry 14).
Missing optional data handling: If surprise_log, next_ideas, or lesson_text are empty or unavailable, use "None" or "N/A" for those sections rather than leaving them blank or erroring.
11f: Mark Draft PR Ready
If a draft PR exists for the current branch, mark it ready for review:
# Find draft PR for current branch
PR_NUMBER=$(gh pr list --head "$(git branch --show-current)" --state open --json number --jq '.[0].number')
# Mark ready if found
if [ -n "$PR_NUMBER" ]; then
gh pr ready "$PR_NUMBER"
fi
If gh is unavailable or no draft PR exists, skip silently (graceful degradation). If no draft PR exists, create a regular PR: gh pr create --title "{NNN}: {Feature Name}" --body "...".
11g: Display Delivery Document
After writing (or after fallback display if write failed), show the full document content in the terminal so the developer can review it immediately.
AOD DELIVERY COMPLETE
Feature: {feature_name}
Delivery Document: {delivery_doc_path}
{Full content of the generated delivery.md}
Step 12: Close Issue and Transition to Done
MANDATORY: You MUST use the Read tool to load references/close-and-document.md before proceeding with issue close and /aod.document prompt. Do NOT rely on memory of prior close-and-document content. If the file cannot be read, display an error and STOP.
Follow Steps 12 and 13 in references/close-and-document.md. After Step 13 completes, proceed to Step 14 (lock release).
Step 14: Release Delivery Lock & Cleanup (US-7, Feature 139 — FR-031)
MANDATORY: You MUST use the Read tool to load references/delivery-lock.md before proceeding with delivery lock release. Do NOT rely on memory of prior delivery-lock content. If the file cannot be read, display an error and STOP.
Follow the Step 14 instructions in references/delivery-lock.md. This is the symmetric counterpart to Step 0.5's lock acquisition — it MUST run on every clean exit path (success, halt after Step 9e, opt-out in Step 9d Path A) as the final act of every invocation.
Edge Cases, Common Rationalizations, and Red Flags
When handling an edge case OR checking a rationalization/red-flag:
MANDATORY: You MUST use the Read tool to load references/edge-cases.md before proceeding with edge-case handling or rationalization/red-flag checks. Do NOT rely on memory of prior edge-cases content. If the file cannot be read, display an error and STOP.
Quality Checklist
- Definition of Done validated (all tasks complete or user override)
- Estimated duration captured from user
- Actual duration computed from branch creation date
- Surprise log captured (minimum 1 sentence)
- Next ideas prompted (optional; each creates GitHub Issue with
stage:discover) - Lessons learned captured with category and full description
- KB entry appended to INSTITUTIONAL_KNOWLEDGE.md with correct entry number
- Delivery metrics posted to GitHub Issue as comment
- Issue transitioned to
stage:deliverlabel (start of retrospective) - BACKLOG.md regenerated
- Test evidence collected or skipped (non-fatal, per ADR-006)
- Retrospective summary displayed with all metrics
- Issue transitioned to
stage:donelabel (end of retrospective) - GitHub Issue closed with closing comment
- User prompted to run
/aod.documentfor post-delivery quality review