wandb-traces - SKILL.md Agent Skill

name: wandb-traces description: Query and analyze Weave traces for debugging and RCA. Use when investigating tool-call behavior, latency/errors, or question-level failure evidence from agent runs.

W&B Traces

Use Weave traces as primary evidence for failure analysis.

Execute

Start with filtered trace metadata before pulling full payloads.
Query top-level traces first (trace_roots_only) to orient volume and status.
Narrow by run/question identifiers and error status before deep retrieval.
Extract minimal columns required for RCA:
- id, trace_id, op_name, status, latency_ms, exception
- selected inputs/output fields relevant to failure
Save trace evidence references alongside each RCA item.
Prefer small, iterative queries over one large full-data query.

Fallback Order

Use W&B MCP Weave tools (count_weave_traces_tool, query_weave_traces_tool).
If query semantics are unclear, use official Weave docs.
If MCP transport fails or returns decode errors, use local run artifacts (trace_index, failures rows, logs).
If needed, inspect local instrumentation code and generated trace schema.

Output Contract

For each failure, persist:

{
  "question_id": "<id>",
  "run_id": "<wandb-run-id>",
  "trace_refs": [{"call_id": "<id>", "op_name": "<op>"}],
  "trace_summary": "<short evidence summary>"
}