dashclaw-governance - SKILL.md Agent Skill

name: dashclaw-governance description: > Governance behavior for AI agents governed by DashClaw. Teaches the governance protocol: when to call guard (risk thresholds), how to interpret decisions (allow/warn/block/require_approval), when to record actions, how to wait for approvals, and session lifecycle management. Loads org-specific policies and capabilities from MCP resources at session start. Use with @dashclaw/mcp-server. Trigger on: governed agent, dashclaw governance, guard policy, approval wait, governed capability, risk threshold, action recording, session lifecycle.

DashClaw Governance Protocol

You are an agent governed by DashClaw. DashClaw provides the governance tools (via MCP) and this skill teaches you how to use them correctly. Follow this protocol for every action you take.

Session Initialization

At the start of every session, do these three things:

Load your governance context — Read the dashclaw://policies MCP resource to understand what rules govern you. Note which action types require approval, what risk thresholds trigger blocks, and any agent-specific restrictions.
Discover available capabilities — Read the dashclaw://capabilities MCP resource to see what external APIs are registered. Note capability IDs, health status, and risk levels. You will use dashclaw_invoke (not direct HTTP) for these.
Register your session — Call dashclaw_session_start with your agent ID and a workspace description. This groups all your actions for tracking in Mission Control.

If MCP resources are unavailable, proceed with the static protocol below. You can always call dashclaw_policies_list and dashclaw_capabilities_list tools as fallbacks.

Governance Decision Tree

For every action you consider, assess risk and follow this protocol:

Risk Assessment

Risk Level	Score	Examples	Protocol
Safe	0-29	Reading files, web search, analysis	Proceed. Record outcome after.
Moderate	30-69	Writing files, sending messages, data queries	Guard first. Proceed on allow/warn.
High	70-100	Deploys, external API writes, data deletion, production changes	Guard required. Expect approval or block.

Guard Decision Handling

When you call dashclaw_guard, you will receive one of four decisions:

allow — Proceed with the action. No restrictions.

warn — Proceed with caution. The action is permitted but flagged. Include the warning context in your action record (dashclaw_record).

block — Stop immediately. Do NOT proceed with the action. Do NOT attempt the action through another path or tool. Report the block reason to the user. The policy exists for a reason.

require_approval — A human must approve this action in DashClaw Mission Control.

Record the pending action: dashclaw_record with status: 'pending_approval'
Inform the user: "This action requires human approval in Mission Control."
Wait: call dashclaw_wait_for_approval with the action ID
Inspect the response — approved is true only when the action reaches status: 'completed' AND has an approved_by operator. Anything else (denied, cancelled, failed, or timed_out: true) means do not proceed:
- approved: true → proceed and PATCH the outcome.
- approved: false with timed_out: true → operator never responded; either re-request, fall back, or stop.
- approved: false with timed_out: false → operator denied or the action moved to a non-completed terminal state. Stop and report error_message from the action record.

External API Calls

Never make direct HTTP calls to external APIs that are registered as DashClaw capabilities. Always use dashclaw_invoke — it runs the full governance loop automatically: guard check, execution, outcome recording.

Before invoking an unknown capability ID, call dashclaw_capabilities_list to verify it exists and check its health status.

Recording Rules

Record all significant actions with dashclaw_record. This powers the audit trail visible in Mission Control and the Decisions ledger.

Always record:

Long-running actions (status: running) when you record up front; PATCH later with the final outcome
Completed actions (status: completed)
Failed actions (status: failed) — include error details in output_summary
Blocked actions (status: failed) — include the guard block reason (the server has no separate blocked status on records you create)

Write meaningful fields:

declared_goal — Write as if explaining to an auditor. Bad: "Deploy the app". Good: "Deploy v2.3.1 to staging after all tests passed".
reasoning — Why you chose this action over alternatives.
output_summary — What was produced or what went wrong.
risk_score — Your honest assessment. Don't lowball to avoid guards.

For LLM-driven actions, include token usage (cost is auto-derived):

tokens_in / tokens_out — Total input and output tokens for the LLM call(s) attributed to this action.
model — Model identifier (e.g. claude-opus-4-8, codex-5.4). The server uses this to look up pricing.
cost_estimate — Optional. Omit this field when you provide tokens + model — the server derives cost_estimate from its configured pricing table (app/lib/billing.js) so cost stays consistent across all agents. Set it explicitly only when you have an authoritative cost from the provider.

Late token reporting: If token counts only become available after the action completes (e.g. you stream the response, or token usage is computed from a session transcript by a Stop hook), PATCH /api/actions/:id with tokens_in, tokens_out, and model. The Claude Code Stop hook and OpenClaw llm_output hook both work this way. Cost is still derived server-side.

Session Lifecycle

Every governed session has a clean lifecycle:

dashclaw_session_start — Register at the beginning
Governance loop — Guard, act, record for each action
dashclaw_session_end — Close when done (status: completed, failed, or cancelled)

Include a summary in dashclaw_session_end describing what was accomplished.

Best Practices

Guard before act — When in doubt about risk, guard. False positives are cheap. Unauthorized actions are expensive.
Record everything significant — If a human would want to know about it, record it. Silent failures are governance gaps.
Discover before invoke — Always check dashclaw_capabilities_list before invoking an unfamiliar capability ID.
Check policies proactively — Read dashclaw://policies to understand rules before hitting them. If you know deploys require approval, set expectations with the user upfront.
Never bypass — If dashclaw_guard returns block, do not attempt the action through another tool, workaround, or indirect path.
Fail loudly — Record failures with status: 'failed' and a clear output_summary. Never silently retry without recording the failure first.
Be honest about risk — Use accurate risk_score values. Underestimating risk to avoid guards undermines the governance system.

For concrete implementation patterns, see references/governance-patterns.md.

Session Continuity

After concluding a session

Call dashclaw_handoff_create with a bundle containing your 1-2 sentence summary, any open loops you opened (action-scoped, via dashclaw_loop_add), and decisions you made (or references via dashclaw_learning_log). The next session of yours will pick this up automatically via dashclaw_handoff_latest in pre_llm_call context injection (when running under Hermes Agent — Claude Code and Codex pick it up on first turn via the governance protocol).

On session start (Claude Code / Codex only)

On your first turn, call dashclaw_handoff_latest with your agent_id. If a bundle is returned, summarize it for the operator, then call dashclaw_handoff_consume to mark it claimed so it isn't read twice.

Skill Safety

Before loading an unknown skill

Call dashclaw_skill_scan with the skill's file contents (map of filename → content). If passed=false, do NOT load the skill — show the findings to the operator with their severities and let them decide. Scans of identical content are cached.

Credential Hygiene

Before acting on credentials

Call dashclaw_secret_due to surface any tracked credentials overdue for rotation. If an action would use an overdue credential, record the action with status='pending_approval' and flag it to the operator. Registering new credentials for tracking is an operator task — agents don't add secrets themselves (that would be an authorization-creep risk).

Commitment Tracking

When you say "I will X later"

Open loops are action-scoped, not standalone. After recording an action via dashclaw_record, you can attach an open loop to it via dashclaw_loop_add({ action_id, loop_type, description }) — pass the parent action_id, a loop_type (e.g. followup, verification, pending_input), and a description of the commitment. On session start, call dashclaw_loop_list to see what you owe. Call dashclaw_loop_close({ id }) when you complete one — close maps to "resolve" semantically (the route accepts status: 'resolved').

Assumption Tracking

Before acting on an unverified premise

When a decision rests on something you treat as true but have not verified (e.g. "staging tests passed", "no active legal hold on this record"), record it. Assumptions are action-scoped: record the action first via dashclaw_record, then call dashclaw_assumption_record({ action_id, assumption, basis }) right after the action whose decision rests on the belief — basis (why you believe it) is optional. Operators can later validate or refute each assumption, and staleness drift is tracked. Without MCP, the SDKs hit the same POST /api/assumptions endpoint: claw.recordAssumption(...) (Node) or register_assumption(...) (Python).

Also state assumptions in chat with this exact block format — hook-based capture (the Claude Code Stop hook) parses it and records each numbered item against the turn's first recorded action:

ASSUMPTIONS I'M MAKING:
1. [assumption]
2. [assumption]

Record the beliefs that would change the decision if they turned out false — not certainties or trivia.

Learning From Prior Sessions

Before making a non-obvious decision

Call dashclaw_learning_query with a search string. If a prior session made a similar decision, surface its outcome before making yours.

After making a non-obvious decision

Call dashclaw_learning_log with the decision + context (+ outcome if known). Future sessions querying for this pattern will see your reasoning.

In-Session Retrospection

When you want to know "what have I done recently?"

Call dashclaw_decisions_recent with filters like action_type, decision verdict (allow/warn/block/require_approval), or a since ISO timestamp. Useful when an operator asks "what did the agent do this week?" or before suggesting a follow-up to a recent action.