name: tom-auditor description: Audit a multi-agent system that claims "emergent coordination." Separates real ToM-enabled coordination from prompt-dressed illusion with control conditions, statistical tests, and complementarity measurement. title: "Tom Auditor" version: 1.0.0 phase: 16 lesson: 18 tags: [multi-agent, theory-of-mind, coordination, evaluation, emergence] category: tom-auditor audience: maintainer
Given a multi-agent system that claims emergent coordination, audit whether the coordination is real or an artifact of prompt engineering.
Produce:
- Claim extraction. What coordination behavior is being claimed? (division of labor, anticipation, complementary actions, consensus reaching). State it precisely.
- Prompt inspection. Does any agent's system prompt explicitly instruct coordination, role selection, or team awareness? If yes, flag the claim as partially prompt-dressed and design a control.
- Control condition. A version of the system with coordination-inducing language stripped. Specify exactly what text changes.
- Metric. At least one of: identity-linked differentiation, goal-directed complementarity, higher-order synergy (Riedl 2025). Do not accept "agents seem to work together" as evidence.
- Statistical test. Significance of the metric on system vs control. Sample size needed for
p < 0.05. Ifn < 50trials, report power explicitly. - Model-capacity check. Repeat the comparison on a smaller base model. Does the effect persist or vanish? Li/Riedl both show capacity-dependence.
- Failure-case review. When the system fails, what does the ToM state (if any) look like? Identity confusion (belief-agent binding broken) or content hallucination (wrong belief content)?
Hard rejects:
- Claims of emergence without a control condition. Demo reels are not evidence.
- Claims that vanish on statistical scrutiny (effect below
p < 0.05onn >= 50trials). These are coordination illusions. - Claims that hold on one model only. If a smaller strong baseline also achieves the effect without ToM prompting, the coordination is not ToM-driven.
- "Our agents just figured it out" as a mechanism explanation. Mechanism claims need the ToM state logged and inspectable.
Refusal rules:
- If the system has no logging of per-agent reasoning, the audit cannot distinguish real coordination from randomness. Recommend adding structured ToM-state logs before re-auditing.
- If the task has an oracle-computed optimal coordination, compare to optimal rather than control.
- If the claim is narrow ("coordination on single-round task"), the audit can be a shorter check: measure complementarity on the single round, no long-horizon analysis needed.
Output: a two-page audit. Start with a one-sentence verdict ("Coordination claim is prompt-dressed: removing 'work together' language drops the metric from 0.82 to 0.31, control-significant."), then the seven sections above. End with a list of fixes to convert prompt-dressed coordination into real coordination: explicit ToM state, longer horizons with logging, mixed-model ensembles.