spot-check - SKILL.md Agent Skill

name: spot-check description: > Ad hoc, AI-driven verification of a specific behavior — not a checked-in E2E test. Use after developing a feature, fixing a bug, validating a branch, or post-release. Drives Playwright MCP for Forge iframe UI, or CLI for backend/analytics signals. Triggers on "spot check", "run a spot check on X", "spot check this fix", "spot check on staging", "spot check zenuml-lite@stg", "verify on staging".

Spot Check

A spot check is an ad hoc, AI-driven, ephemeral verification of a specific behavior. It is not a pre-written test case and not meant for long-term use.

What it is NOT: a pre-written .spec.ts file, a comprehensive regression test, or a repeatable automated test.

Key principles

Lightweight — reuse what already exists. If a page with the relevant macro is already available, use it. Navigate directly to the macro when you know which one matters.
AI-driven — use Playwright MCP (mcp__playwright__*) to improvise steps. It is the only tool that reliably reaches inside Forge cross-origin iframes. Claude in Chrome / cursor-ide-browser can handle Confluence page chrome outside the iframe only. No script is checked in.
Ephemeral — test steps are not saved for future use.
Targeted — verify the specific behavior being checked, not a full regression.
Real world — verify the behaviour on a confluence site, not a local fixture or unit test.
Forge tunnel — use forge tunnel (separate skill) and lite-dev.atlassian.net by default.

Write the plan first

STOP. Do not open the browser or run queries until the plan is written.

Each planned check must name:

Behavior — what changed or what you are verifying
Observable signal — UI element, Mixpanel event + property, network response, D1 row, etc.
Method — Playwright step, request intercept, curl, wrangler, etc.

Each item must be independently pass/fail checkable before you run it.

Spot check plan: <short title>

Target: <site / page URL / API path>
  - [ ] <specific observable assertion>  [method]
  - [ ] <specific observable assertion>  [method]

Skipped: <anything out of scope> — <reason>

For post-release spot checks (release delta, commit triage, N/A rules), follow Step 2.6 in the release-app skill — it extends this plan format.

For branch validation before push (Forge tunnel vs dev site), follow Step 2 in the validate-branch skill after writing the plan here.

Choosing the environment

Situation	Target environment
New feature not yet deployed	Forge tunnel → `lite-dev.atlassian.net` (see validate-branch)
Deployed to staging / failing pipeline	Staging site (`lite-stg.atlassian.net`)
Reproducing a production issue	Production site directly (zenuml.atlassian.net)
Post-release validation	Production — same variant as the release
Backend-only (webhook, API, D1)	`curl` / wrangler against staging or prod
Validating the workflow itself	Any appropriate env

App profiles and credentials: tests/e2e-tests/config/apps.ts, .env.forge.local.

Verification methods

Use whichever signals the behavior requires. Mix freely — e.g. drive the browser, then query D1, then check Mixpanel.

Signal	How
UI behavior	Playwright MCP (`mcp__playwright__*`) — use `frameLocator()` inside Forge iframes
Analytics events	`mcp__mixpanel__Run-Query` with `project_id=3373228`
Forge logs	`forge logs --environment staging` / `forge logs --environment production`
Cloudflare Pages Functions logs	`wrangler pages deployment tail --project-name <project> --environment production`
Cloudflare Workers logs	`wrangler tail`
D1 database state	`wrangler d1 execute <db> --remote --command "SELECT ..."`
R2 object storage	`wrangler r2 object get <bucket>/<key>`

Analytics reference: docs/analytics-reference.md.

Workflow

Plan — behavior, target page/macro or data path, expected signal per assertion (see above).
Navigate — open the target Confluence site if UI is involved. Reuse the Chrome logged-in session when possible.
Reuse fixtures — prefer an existing page with the relevant macro. Create one only if none exists (create-test-page skill for API-only setup).
Execute — run each planned check. Screenshot or capture evidence after key steps. Report pass / fail / skipped per assertion.

Forge iframe tooling

Forge Custom UI renders in sandboxed cross-origin iframes (OOPIFs). Only Playwright reliably crosses that boundary.

Tool	Forge iframe access
Playwright MCP	✅ Yes — `frameLocator()`
chrome-devtools-mcp	❌ No
cursor-ide-browser	❌ No — page chrome only
claude-in-chrome	❌ No

Common gotchas:

Selectors from the top frame miss Forge UI — scope to the iframe.
Paywall state: use the macro toolbar Preset: dropdown (Bystander/Owner/etc.) for deterministic variants.
Version label in the macro toolbar confirms which build you hit (vYYYY.MM… = public deploy; branch SHA = tunnel/dev).

Before testing — mandatory pre-flight:

Enable zenumlDebug — open the browser console on the target Confluence page and run localStorage.setItem('zenumlDebug', 'true'), then reload. This unlocks the debug toolbar and version label inside the macro. Without it, key diagnostic signals are invisible.
Confirm the commit — read the version label in the macro toolbar and match it against the expected commit SHA or release tag. GitHub PR commits use a merge commit hash (e.g. abc1234 from the "Merge pull request" commit) that differs from the branch HEAD — always verify against the actual deployed SHA, not the PR's branch tip.

Related skills

Skill	When
repro	Confirm a bug exists before fixing
validate-branch	Pre-push branch smoke via tunnel or dev site
release-app	Step 2.6 — release-delta spot check after PVT
pvt / pvt-*	Reusable production recipes — methods, not substitutes for writing assertions
create-test-page	API-only page setup when you need specific macro content without the editor
graph-macro, copy-macro, etc.	Focused recipes for specific macro flows