verification-matrix - SKILL.md Agent Skill

name: verification-matrix description: > Choose the smallest correct validation set for a change in this repository. Use before or after edits to map touched files to the repo validation matrix and separate pre-existing failures from newly introduced ones.

This skill turns the repo's written validation rules into an explicit workflow.

Use it to choose the smallest correct set of checks instead of running full CI by reflex.

when the user explicitly asks for full CI regardless of touched surface
as a substitute for targeted browser verification when runtime UI behavior is uncertain
as a substitute for deeper regression judgment in high-risk engine work

The validation matrix in AGENTS.md is the canonical policy:

docs, setup, and low-risk config -> npm run typecheck
UI, hooks, auth, and general TS changes -> npm run lint and npm run typecheck
engine, parser, or dictionary changes -> npm run test:unit, npm run regression, and npm run regression:gold
deploy or env-boundary changes -> npm run build

If a change spans multiple categories, run the union of the required checks.

Current package-script relationships:

npm run ci:check runs check:migrations, check:repo-boundaries, lint, typecheck, test, and build
npm run test runs test:unit and breadth regression
npm run regression:gold is separate from npm run test
npm run test:overlay is separate from npm run test and ci:check

Run:

git status --short

Separate:

Do not assume the baseline is clean.

Use the smallest matching category or union of categories.

Typical mappings in this repo:

docs and skill files, README.md, CLAUDE.md, AGENTS.md -> docs/setup
app/, components/, hooks/, most TypeScript modules -> UI/hooks/general TS
lib/auth/, lib/supabase/, proxy.ts -> auth/general TS
lib/engine/, lib/parser.ts, lib/dictionaries.ts -> engine/parser/dictionary
build or deploy contract files, env-validation scripts, deployment boundary files -> deploy/env boundary

When in doubt, choose the narrower valid category first and expand only if the change actually crosses boundaries.

Recommend the minimal commands required by the mapped categories.

Examples:

skill or docs-only change -> npm run typecheck
dashboard hook refactor -> npm run lint and npm run typecheck
lib/parser.ts change -> npm run test:unit, npm run regression, and npm run regression:gold
env-validation or build-boundary change -> npm run build

Mixed-surface changes should use the union, not full CI by default.

When checks fail:

When a check fails, do a focused triage before widening the run:

Record the exact command, exit code, and first useful error.
Rerun the narrowest reproducer when one exists, such as a single node --test file or the failing regression case.
Compare the failure to baseline evidence:
- pre-existing failure: same failure was already known, or reproduces without the current change surface
- introduced failure: points at touched files or disappears when the current change is removed
- unknown: evidence is insufficient; say what would prove it
Identify the likely seam: lint/type error, unit contract, regression output, build/env boundary, browser/runtime behavior, or external service.
Recommend the smallest next check or fix. Do not turn a narrow failure into full CI unless the failure shows broader blast radius.

Do not widen validation because it feels safer.

Run more than the matrix minimum only when:

Keep the result compact: