name: wandb-traces description: Query and analyze Weave traces for debugging and RCA. Use when investigating tool-call behavior, latency/errors, or question-level failure evidence from agent runs.
W&B Traces
Use Weave traces as primary evidence for failure analysis.
Execute
- Start with filtered trace metadata before pulling full payloads.
- Query top-level traces first (
trace_roots_only) to orient volume and status. - Narrow by run/question identifiers and error status before deep retrieval.
- Extract minimal columns required for RCA:
id,trace_id,op_name,status,latency_ms,exception- selected
inputs/outputfields relevant to failure
- Save trace evidence references alongside each RCA item.
- Prefer small, iterative queries over one large full-data query.
Fallback Order
- Use W&B MCP Weave tools (
count_weave_traces_tool,query_weave_traces_tool). - If query semantics are unclear, use official Weave docs.
- If MCP transport fails or returns decode errors, use local run artifacts (
trace_index, failures rows, logs). - If needed, inspect local instrumentation code and generated trace schema.
Output Contract
For each failure, persist:
{
"question_id": "<id>",
"run_id": "<wandb-run-id>",
"trace_refs": [{"call_id": "<id>", "op_name": "<op>"}],
"trace_summary": "<short evidence summary>"
}