speed-run

star 1

Token-efficient code generation pipeline using hosted LLM. Triggers on "speed-run", "fast build", "turbo build", "use hosted LLM", "use cerebras". Routes to turbo (direct codegen), showdown (parallel competition), or any% (parallel exploration).

2389-research By 2389-research schedule Updated 2/8/2026

name: speed-run description: Token-efficient code generation pipeline using hosted LLM. Triggers on "speed-run", "fast build", "turbo build", "use hosted LLM", "use cerebras". Routes to turbo (direct codegen), showdown (parallel competition), or any% (parallel exploration).

Speed-Run

Token-efficient code generation pipeline. Uses hosted LLM (Cerebras) for fast, cheap first-pass generation. Claude handles architecture and surgical fixes.

Announce: "I'm using speed-run for token-efficient code generation."

Step 1: Check API Key (MANDATORY)

ALWAYS run this first:

mcp__speed-run__check_status

If status is error / key not set:

Speed-run requires a Cerebras API key for hosted code generation.

Setup (pick one):

Option A (recommended): Add to ~/.claude/settings.json:
  { "env": { "CEREBRAS_API_KEY": "your-key" } }

Option B: Add to ~/.zshrc:
  export CEREBRAS_API_KEY="your-key"

Get a free key at: https://cloud.cerebras.ai

Then restart Claude Code.

STOP here. Do not fall back to Claude direct generation — the whole point of speed-run is hosted LLM. If the user wants Claude direct, they should use test-kitchen instead.

Step 2: Route

If check_status returned OK, present options:

Speed-run ready! Cerebras API connected.

How would you like to proceed?

1. Turbo - Direct code generation (single task, fast)
   → Best for: one feature, algorithmic code, boilerplate
2. Showdown - Parallel competition (same design, multiple runners)
   → Best for: medium-high complexity, want best implementation
3. Any% - Parallel exploration (different approaches)
   → Best for: unsure of architecture, want to compare designs

Routing:

  • Option 1: Invoke speed-run:turbo
  • Option 2: Invoke speed-run:showdown
  • Option 3: Invoke speed-run:any-percent

Skill Dependencies

Subskill Description
speed-run:turbo Direct hosted codegen via contract prompts
speed-run:showdown Parallel same-design competition via hosted LLM
speed-run:any-percent Parallel approach exploration via hosted LLM

When to Use Speed-Run vs Test Kitchen

Situation Use
Want token savings on code generation Speed-run
Generating algorithmic code (parsers, state machines) Speed-run:turbo
Want parallel competition with fast generation Speed-run:showdown
Want to explore approaches with fast generation Speed-run:any%
No API key / don't want external LLM Test Kitchen
CRUD / simple operations Test Kitchen (Claude direct is cheaper)
Need 100% first-pass accuracy Test Kitchen
Install via CLI
npx skills add https://github.com/2389-research/speed-run --skill speed-run
Repository Details
star Stars 1
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator
2389-research
2389-research Explore all skills →