inference-scaling

name: inference-scaling description: "Use when the user wants to run inference-time scaling on a prompt — detect environment, execute scaling, and present results. For algorithm selection, budget tuning, reward models, and troubleshooting, consult the inference-scaling-guide skill." allowed-tools: ["Bash(${CLAUDE_PLUGIN_ROOT}/scripts/its_scale.sh:)", "Bash(${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh:)"]

Execute scaling on a prompt. For algorithm guidance, budget tuning, and troubleshooting, consult the inference-scaling-guide skill.

"${CLAUDE_PLUGIN_ROOT}/scripts/its_detect.sh"

Proceed to Step 2.

Run the scaling script with the user's prompt and any overrides:

"${CLAUDE_PLUGIN_ROOT}/scripts/its_scale.sh" --metadata $ARGUMENTS

If the user provides a file path (e.g., "scale all prompts in data/eval.jsonl"), invoke the batch-scaling skill instead.

Selected response — Show the winning response prominently
Metadata (if available):
- Self-consistency: show vote counts ("Selected by majority vote — 5/8 responses agreed")
- Best-of-N: show scores ("Selected as highest scoring — score: 0.92 out of 8 candidates")
Configuration used — algorithm, budget, model (briefly)

If the scaling failed, consult the inference-scaling-guide skill for troubleshooting.