name: canary description: > Post-deploy monitoring with browser or degraded HTTP mode. Checks console errors, performance, page load. Configurable duration (1m-30m) and interval. Reports HEALTHY/DEGRADED/BROKEN. Flags: --duration, --interval, --quick, --max-errors. codesift_tools: always: - analyze_project - index_status - plan_turn
by_stack: {} # browser monitoring — no code analysis
zuvo:canary
Monitor production after deployment. Browser-based or HTTP-only health checks with configurable duration.
Argument Parsing
| Argument | Effect |
|---|---|
<url> |
Production URL to monitor (REQUIRED) |
--duration <time> |
Monitoring duration (default: 10m, range: 1m-30m) |
--interval <time> |
Check interval (default: 60s) |
--quick |
Single health check, no loop |
--max-errors <n> |
Error threshold for FAIL verdict (default: 3) |
Mandatory File Loading
Before starting any phase, read these shared includes:
INCLUDES LOADED:
1. ../../shared/includes/env-compat.md — READ
2. ../../shared/includes/run-logger.md — READ
3. ../../shared/includes/retrospective.md — READ
Phase 0: Setup
Step 1: Validate URL
If no URL argument is provided, STOP immediately:
URL is required. Usage: zuvo:canary https://myapp.com
Step 2: Detect Browser Capability
Check whether the current environment exposes browser automation tools compatible with Playwright or Chrome DevTools.
If browser tooling is available: set
MODE=full.If not: set
MODE=degradedand print:[DEGRADED: no browser tools] — running HTTP-only checks. Console errors and screenshots unavailable.
Step 3: Check Environment (per env-compat.md)
If running in a non-interactive environment (Codex App, Cursor):
Default to
--quickbehavior (single check) only if no--durationwas explicitly passed.If the user explicitly passed
--duration: honor it — run the full monitoring loop. Non-interactive environments can execute timed loops; the limitation is user interaction, not execution duration.Print:
[AUTO-DECISION]: defaulting to one-shot mode. Pass --duration to enable monitoring loop.
Step 4: Parse Arguments
Parse --duration and --interval. Validate duration range (1m–30m inclusive). If duration is outside this range, stop with:
--duration must be between 1m and 30m. Got: <value>
Defaults:
--duration: 10m--interval: 60s--max-errors: 3
Phase 1: Platform Context (optional)
Read ../../shared/includes/platform-detection.md and follow the detection algorithm.
If a platform is detected, note its health check command. This is supplementary — the primary check is always the URL-based approach in Phase 2. Platform detection may provide additional commands (e.g., fly status) that can supplement the HTTP check.
Phase 2: Monitoring Loop
One-shot mode
If --quick is passed, or non-interactive environment was detected AND no --duration was explicitly provided: run exactly ONE check cycle, then proceed to Phase 3.
Loop mode
Otherwise: run one check cycle every --interval seconds, for the total --duration. Track cumulative error count across all checks. Store all individual response times in an array for p95 computation in Phase 3.
Check Cycle — Full Mode (Playwright or Chrome DevTools available)
Navigate to URL:
- Playwright:
mcp__playwright__browser_navigate(url=<url>) - Chrome DevTools:
mcp__chrome-devtools__navigate_page(url=<url>)
- Playwright:
Check HTTP status by reading the main document request from network logs:
- Playwright: inspect the main document request and read its status.
- Chrome DevTools: inspect the main document request and read its status.
- If browser network data is unavailable: run a degraded-mode
curlrequest against the same URL for status verification. - Do not assume navigation success means HTTP 200.
Expect 200. Non-200 = CRITICAL alert.
Capture console errors:
- Playwright:
mcp__playwright__browser_console_messages() - Chrome DevTools:
mcp__chrome-devtools__list_console_messages()
Record error-level entries. Each new Error-level console message = HIGH alert. Accumulate total error count.
- Playwright:
Measure page load time. Response time >10s = MEDIUM alert.
Take screenshot: save to
zuvo/reports/canary-{ISO-timestamp}/check-{N}.png- Playwright:
mcp__playwright__browser_take_screenshot(path=...) - Chrome DevTools:
mcp__chrome-devtools__take_screenshot(path=...) - ISO-timestamp format:
YYYY-MM-DDTHHMM(e.g.,2026-03-28T1430)
- Playwright:
Check Cycle — Degraded Mode (HTTP only)
Run:
curl -s -o /dev/null -w "%{http_code} %{time_total}" <url>Check HTTP status: expect 200. Non-200 = CRITICAL alert.
Record response time. Response time >10s = MEDIUM alert.
No console error capture. No screenshots.
Alert Conditions (per check)
| Condition | Severity |
|---|---|
| Page load failure (non-200 HTTP status) | CRITICAL |
| New console errors (Error level) | HIGH |
| Response time >10s | MEDIUM |
Phase 3: Output
Step 1: Determine Verdict
After all check cycles complete, assess the cumulative results:
- HEALTHY: All checks returned 200, console errors at or below
--max-errorsthreshold, no critical failures. - DEGRADED: No critical failures (all 200s), but warnings present — slow response times, minor console warnings, or console error count approaching
--max-errorsthreshold. - BROKEN: Any of the following: page load failure (non-200), critical console errors observed, or cumulative console error count exceeds
--max-errorsthreshold.
Step 2: Rollback Suggestion (BROKEN verdict only)
If verdict is BROKEN:
- If a platform was detected in Phase 1 and it has a
rollbackCmd, print the platform-specific rollback command:Rollback command (<platform>): <rollbackCmd> - If no platform was detected, attempt git-native rollback:
- Read
memory/last-ship.jsonand extractreleaseCommitSha. If available:Rollback (git-native): git revert <releaseCommitSha> && git push - If
memory/last-ship.jsonis not available orreleaseCommitShais missing:Rollback: identify the release commit SHA manually and run: git revert <sha> && git push
- Read
- Do NOT reference
zuvo:deploy rollback— that interface does not exist. Always print the concrete command with a real SHA, not a placeholder.
Step 3: Write Summary and Baseline Comparison
After all checks complete, write zuvo/reports/canary-<ISO-timestamp>/summary.json:
{
"url": "<url>",
"mode": "full" or "degraded",
"checks": <N>,
"avgResponseTime": <seconds>,
"p95ResponseTime": <seconds>, // computed from stored response times array: sort ascending, take value at index ceil(0.95 * N) - 1
"consoleErrors": <N>,
"consoleWarnings": <N>,
"verdict": "HEALTHY" or "DEGRADED" or "BROKEN",
"date": "<ISO-8601>"
}
Check if prior canary runs exist in zuvo/reports/canary-*/summary.json. If found, compare avgResponseTime and consoleErrors against the most recent prior run and compute percentage deltas for the Performance line.
If no prior run exists, omit the baseline comparison from the Performance line.
Step 4: Print CANARY COMPLETE Block
CANARY COMPLETE
URL: <url>
Duration: <duration> (<N> checks)
Mode: full (Playwright available) | degraded (HTTP-only)
Console: <N> errors, <N> warnings
Performance: avg <N>s (baseline: <N>s, +<N>%) | avg <N>s (no baseline)
Screenshots: zuvo/reports/canary-<ISO-timestamp>/ [full mode only — omit in degraded mode]
Verdict: HEALTHY | DEGRADED | BROKEN
Step 5: Append Run Log
Run: <ISO-8601-Z>\tcanary\t<project>\t-\t-\t<VERDICT>\t-\t<DURATION>\t<NOTES>\t<BRANCH>\t<SHA7>\t<INCLUDES>\t<TIER>
### Retrospective (REQUIRED)
Follow the retrospective protocol from `retrospective.md`.
Gate check → structured questions → TSV emit → markdown append.
If gate check skips: print "RETRO: skipped (trivial session)" and proceed.
After printing this block, append the `Run:` line value (without the `Run: ` prefix) to the log file path resolved per `run-logger.md`.
VERDICT mapping: HEALTHY → PASS, DEGRADED → WARN, BROKEN → FAIL.
DURATION: resolved monitoring duration (e.g., `one-shot`, `10m`, `30m`).
Edge Cases
| # | Condition | Handling |
|---|---|---|
| E11 | No browser MCP tools available | Degraded mode — HTTP-only. Annotate [DEGRADED: no browser tools]. |
| E12 | Non-interactive environment (Codex, Cursor) | Default to one-shot mode unless --duration is explicitly passed. Annotate [AUTO-DECISION]: defaulting to one-shot mode. |
| — | URL is missing | STOP with "URL is required" message. |
| — | Duration outside 1m–30m | STOP with range validation error. |
| — | First canary run (no baseline) | Omit baseline comparison from Performance line. |
| — | Screenshot directory doesn't exist | Create zuvo/reports/canary-{ISO-timestamp}/ before writing. |