trial-costs

star 240

Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage.

harbor-framework By harbor-framework schedule Updated 5/16/2026

name: trial-costs description: Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage. allowed-tools: Bash

Run the trial-costs tool and summarize the output for the user.

Default invocation (last 7 days):

uv run tools/trial-costs/trial_costs.py

Flags:

  • --days N — change window (default 7)
  • --since YYYY-MM-DD — explicit start date
  • --top-threshold N — top-spender PR threshold in dollars (default 100)
  • --no-modal — skip the Modal billing report at the end

Pass through whatever window the user asked for. If they didn't specify, default to 7 days.

After running, summarize for the user:

  • Grand total and split by /run vs /cheat
  • Per-provider totals
  • Top-spender PRs above the threshold (with titles)
  • Per-task latest /run trial cost table (one row per PR)

Keep the summary tight — the script already prints formatted tables; don't re-format them all, just lift the key numbers and mention any notable patterns (e.g. one PR responsible for a large fraction).

Install via CLI
npx skills add https://github.com/harbor-framework/terminal-bench-3 --skill trial-costs
Repository Details
star Stars 240
call_split Forks 289
navigation Branch main
article Path SKILL.md
More from Creator
harbor-framework
harbor-framework Explore all skills →