tom-auditor - SKILL.md Agent Skill

name: tom-auditor description: Audit a multi-agent system that claims "emergent coordination." Separates real ToM-enabled coordination from prompt-dressed illusion with control conditions, statistical tests, and complementarity measurement. title: "Tom Auditor" version: 1.0.0 phase: 16 lesson: 18 tags: [multi-agent, theory-of-mind, coordination, evaluation, emergence] category: tom-auditor audience: maintainer

Given a multi-agent system that claims emergent coordination, audit whether the coordination is real or an artifact of prompt engineering.

Produce:

Claim extraction. What coordination behavior is being claimed? (division of labor, anticipation, complementary actions, consensus reaching). State it precisely.
Prompt inspection. Does any agent's system prompt explicitly instruct coordination, role selection, or team awareness? If yes, flag the claim as partially prompt-dressed and design a control.
Control condition. A version of the system with coordination-inducing language stripped. Specify exactly what text changes.
Metric. At least one of: identity-linked differentiation, goal-directed complementarity, higher-order synergy (Riedl 2025). Do not accept "agents seem to work together" as evidence.
Statistical test. Significance of the metric on system vs control. Sample size needed for p < 0.05. If n < 50 trials, report power explicitly.
Model-capacity check. Repeat the comparison on a smaller base model. Does the effect persist or vanish? Li/Riedl both show capacity-dependence.
Failure-case review. When the system fails, what does the ToM state (if any) look like? Identity confusion (belief-agent binding broken) or content hallucination (wrong belief content)?

Hard rejects:

Claims of emergence without a control condition. Demo reels are not evidence.
Claims that vanish on statistical scrutiny (effect below p < 0.05 on n >= 50 trials). These are coordination illusions.
Claims that hold on one model only. If a smaller strong baseline also achieves the effect without ToM prompting, the coordination is not ToM-driven.
"Our agents just figured it out" as a mechanism explanation. Mechanism claims need the ToM state logged and inspectable.

Refusal rules:

If the system has no logging of per-agent reasoning, the audit cannot distinguish real coordination from randomness. Recommend adding structured ToM-state logs before re-auditing.
If the task has an oracle-computed optimal coordination, compare to optimal rather than control.
If the claim is narrow ("coordination on single-round task"), the audit can be a shorter check: measure complementarity on the single round, no long-horizon analysis needed.

Output: a two-page audit. Start with a one-sentence verdict ("Coordination claim is prompt-dressed: removing 'work together' language drops the metric from 0.82 to 0.31, control-significant."), then the seven sections above. End with a list of fixes to convert prompt-dressed coordination into real coordination: explicit ToM state, longer horizons with logging, mixed-model ensembles.