canary - SKILL.md Agent Skill

name: canary description: > Post-deploy monitoring with browser or degraded HTTP mode. Checks console errors, performance, page load. Configurable duration (1m-30m) and interval. Reports HEALTHY/DEGRADED/BROKEN. Flags: --duration, --interval, --quick, --max-errors. codesift_tools: always: - analyze_project - index_status - plan_turn

by_stack: {} # browser monitoring — no code analysis

zuvo:canary

Monitor production after deployment. Browser-based or HTTP-only health checks with configurable duration.

Argument Parsing

Argument	Effect
`<url>`	Production URL to monitor (REQUIRED)
`--duration <time>`	Monitoring duration (default: 10m, range: 1m-30m)
`--interval <time>`	Check interval (default: 60s)
`--quick`	Single health check, no loop
`--max-errors <n>`	Error threshold for FAIL verdict (default: 3)

Mandatory File Loading

Before starting any phase, read these shared includes:

INCLUDES LOADED:
  1. ../../shared/includes/env-compat.md     — READ
  2. ../../shared/includes/run-logger.md     — READ
  3. ../../shared/includes/retrospective.md     — READ

Phase 0: Setup

Step 1: Validate URL

If no URL argument is provided, STOP immediately:

URL is required. Usage: zuvo:canary https://myapp.com

Step 2: Detect Browser Capability

Check whether the current environment exposes browser automation tools compatible with Playwright or Chrome DevTools.

If browser tooling is available: set MODE=full.

If not: set MODE=degraded and print:

[DEGRADED: no browser tools] — running HTTP-only checks. Console errors and screenshots unavailable.

Step 3: Check Environment (per env-compat.md)

If running in a non-interactive environment (Codex App, Cursor):

Default to --quick behavior (single check) only if no --duration was explicitly passed.
If the user explicitly passed --duration: honor it — run the full monitoring loop. Non-interactive environments can execute timed loops; the limitation is user interaction, not execution duration.

Print:

[AUTO-DECISION]: defaulting to one-shot mode. Pass --duration to enable monitoring loop.

Step 4: Parse Arguments

Parse --duration and --interval. Validate duration range (1m–30m inclusive). If duration is outside this range, stop with:

--duration must be between 1m and 30m. Got: <value>

Defaults:

--duration: 10m
--interval: 60s
--max-errors: 3

Phase 1: Platform Context (optional)

Read ../../shared/includes/platform-detection.md and follow the detection algorithm.

If a platform is detected, note its health check command. This is supplementary — the primary check is always the URL-based approach in Phase 2. Platform detection may provide additional commands (e.g., fly status) that can supplement the HTTP check.

Phase 2: Monitoring Loop

One-shot mode

If --quick is passed, or non-interactive environment was detected AND no --duration was explicitly provided: run exactly ONE check cycle, then proceed to Phase 3.

Loop mode

Otherwise: run one check cycle every --interval seconds, for the total --duration. Track cumulative error count across all checks. Store all individual response times in an array for p95 computation in Phase 3.

Check Cycle — Full Mode (Playwright or Chrome DevTools available)

Navigate to URL:
- Playwright: mcp__playwright__browser_navigate(url=<url>)
- Chrome DevTools: mcp__chrome-devtools__navigate_page(url=<url>)
Check HTTP status by reading the main document request from network logs:
- Playwright: inspect the main document request and read its status.
- Chrome DevTools: inspect the main document request and read its status.
- If browser network data is unavailable: run a degraded-mode curl request against the same URL for status verification.
- Do not assume navigation success means HTTP 200.
Expect 200. Non-200 = CRITICAL alert.
Capture console errors:
- Playwright: mcp__playwright__browser_console_messages()
- Chrome DevTools: mcp__chrome-devtools__list_console_messages()
Record error-level entries. Each new Error-level console message = HIGH alert. Accumulate total error count.
Measure page load time. Response time >10s = MEDIUM alert.
Take screenshot: save to zuvo/reports/canary-{ISO-timestamp}/check-{N}.png
- Playwright: mcp__playwright__browser_take_screenshot(path=...)
- Chrome DevTools: mcp__chrome-devtools__take_screenshot(path=...)
- ISO-timestamp format: YYYY-MM-DDTHHMM (e.g., 2026-03-28T1430)

Check Cycle — Degraded Mode (HTTP only)

Run:

curl -s -o /dev/null -w "%{http_code} %{time_total}" <url>

Check HTTP status: expect 200. Non-200 = CRITICAL alert.
Record response time. Response time >10s = MEDIUM alert.
No console error capture. No screenshots.

Alert Conditions (per check)

Condition	Severity
Page load failure (non-200 HTTP status)	CRITICAL
New console errors (Error level)	HIGH
Response time >10s	MEDIUM

Phase 3: Output

Step 1: Determine Verdict

After all check cycles complete, assess the cumulative results:

HEALTHY: All checks returned 200, console errors at or below --max-errors threshold, no critical failures.
DEGRADED: No critical failures (all 200s), but warnings present — slow response times, minor console warnings, or console error count approaching --max-errors threshold.
BROKEN: Any of the following: page load failure (non-200), critical console errors observed, or cumulative console error count exceeds --max-errors threshold.

Step 2: Rollback Suggestion (BROKEN verdict only)

If verdict is BROKEN:

If a platform was detected in Phase 1 and it has a rollbackCmd, print the platform-specific rollback command:
```
Rollback command (<platform>): <rollbackCmd>
```
If no platform was detected, attempt git-native rollback:
- Read memory/last-ship.json and extract releaseCommitSha. If available:
```
Rollback (git-native): git revert <releaseCommitSha> && git push
```
- If memory/last-ship.json is not available or releaseCommitSha is missing:
```
Rollback: identify the release commit SHA manually and run: git revert <sha> && git push
```
Do NOT reference zuvo:deploy rollback — that interface does not exist. Always print the concrete command with a real SHA, not a placeholder.

Step 3: Write Summary and Baseline Comparison

After all checks complete, write zuvo/reports/canary-<ISO-timestamp>/summary.json:

{
  "url": "<url>",
  "mode": "full" or "degraded",
  "checks": <N>,
  "avgResponseTime": <seconds>,
  "p95ResponseTime": <seconds>,  // computed from stored response times array: sort ascending, take value at index ceil(0.95 * N) - 1
  "consoleErrors": <N>,
  "consoleWarnings": <N>,
  "verdict": "HEALTHY" or "DEGRADED" or "BROKEN",
  "date": "<ISO-8601>"
}

Check if prior canary runs exist in zuvo/reports/canary-*/summary.json. If found, compare avgResponseTime and consoleErrors against the most recent prior run and compute percentage deltas for the Performance line.

If no prior run exists, omit the baseline comparison from the Performance line.

Step 4: Print CANARY COMPLETE Block

CANARY COMPLETE
  URL:         <url>
  Duration:    <duration> (<N> checks)
  Mode:        full (Playwright available) | degraded (HTTP-only)
  Console:     <N> errors, <N> warnings
  Performance: avg <N>s (baseline: <N>s, +<N>%) | avg <N>s (no baseline)
  Screenshots: zuvo/reports/canary-<ISO-timestamp>/  [full mode only — omit in degraded mode]
  Verdict:     HEALTHY | DEGRADED | BROKEN

Step 5: Append Run Log

Run: <ISO-8601-Z>\tcanary\t<project>\t-\t-\t<VERDICT>\t-\t<DURATION>\t<NOTES>\t<BRANCH>\t<SHA7>\t<INCLUDES>\t<TIER>


### Retrospective (REQUIRED)

Follow the retrospective protocol from `retrospective.md`.
Gate check → structured questions → TSV emit → markdown append.
If gate check skips: print "RETRO: skipped (trivial session)" and proceed.

After printing this block, append the `Run:` line value (without the `Run: ` prefix) to the log file path resolved per `run-logger.md`.

VERDICT mapping: HEALTHY → PASS, DEGRADED → WARN, BROKEN → FAIL.
DURATION: resolved monitoring duration (e.g., `one-shot`, `10m`, `30m`).

Edge Cases

#	Condition	Handling
E11	No browser MCP tools available	Degraded mode — HTTP-only. Annotate `[DEGRADED: no browser tools]`.
E12	Non-interactive environment (Codex, Cursor)	Default to one-shot mode unless `--duration` is explicitly passed. Annotate `[AUTO-DECISION]: defaulting to one-shot mode`.
—	URL is missing	STOP with "URL is required" message.
—	Duration outside 1m–30m	STOP with range validation error.
—	First canary run (no baseline)	Omit baseline comparison from Performance line.
—	Screenshot directory doesn't exist	Create `zuvo/reports/canary-{ISO-timestamp}/` before writing.