quality-and-verification

name: "quality-and-verification" description: "5-level verification pyramid: static→unit→Playwright E2E (homepage-first, 6bp)→AI visual→post-deploy. 8-check quality gate. Multi-agent testing (functional/security/a11y/performance). Playwright v1.59+ AI agents (Planner/Generator/Healer). WCAG 2.2 AA via axe-core v4.11. Percy+Chromatic visual regression. ADA Title II 2027/2028 deadlines." metadata: version: "2.1.0" updated: "2026-05-03" effort: "high" model: "sonnet" license: "Rutgers" compatibility: claude-code: ">=2.0.0" agentskills: ">=1.0.0" submodules: - accessibility-gate.md - adversarial-testing.md - agentic-security.md - audio-video-sync.md - build-breaking-rules.md - chrome-and-browser-workflows.md - completeness-verification.md - computer-use-automation.md - contract-testing.md - e2e-accumulation.md - eval-driven-development.md - evidence-collection.md - performance-optimization.md - picovoice-eagle-biometric.md - security-hardening.md - semgrep-codebase-rules.md - slop-detection.md - spec-driven-development.md - stagehand-ai-fallback.md - stagehand-ai-testing.md - tdd-verification.md - testing-matrices.md - ui-completeness-sweep.md - visual-inspection-loop.md - visual-regression.md - wcag-2-2-2026.md priority: 2 pack: "testing" triggers: - "test" - "verify" - "qa" - "lighthouse" paths: - "*"

Static — TS strict + ESLint + oxlint + Prettier + knip (dead code)
Unit — Vitest 3 (40% faster on 5k+ tests, Rust sharding, browser mode default)
Playwright E2E — homepage-first, 6 viewports × 3 browsers, hermetic, parallel
AI visual — vision rubric ≥8/10 per route, 6bp screenshots
Post-deploy — wrangler tail clean + console-error-free + axe-clean + Lighthouse green

Any fail = blocker. Fix-forward per rules/verification-loop.md.

Violating any = build fail.

Random snapshot sampling 30% per step (seeded hash, reproducible)
New-section AI vision: e2e/__seen-routes__.json gates first render of any unknown route
Rubric: layout sane / contrast WCAG AA / brand / no slop / ≥8/10 (Claude Sonnet 4.6 or GPT Image 2 vision)
Baselines in e2e/__snapshots__/
Pixelmatch tolerance 0.1% / 0.5% area

Per rules/e2e-visual-inspection.md.

Percy AI Visual Review — 3× faster review, 40% OCR-based noise filter, full-page + flows
Chromatic — component-level via Storybook
pixelmatch — local deterministic CI

Three-tier: local → PR → deploy.

Spawn parallel in single Agent call:

Each: 100-300 word brief, ≤200 word summary back. Per rules/agent-selection.md.

After every deploy, check browser console for CSP violations, JS errors, failed resources
ALL must be 0 before marking complete
Fixed by rules/verification-loop.md console-error gate

e2e/FEATURES.md — row per feature
e2e/COVERAGE.yml — feature→spec map; CI fails on any feature without entry/test
Pre-commit lint: new component without matching e2e/<feature>/ warns