cost - SKILL.md Agent Skill

name: cost description: > Estimate and analyze costs for AI/cloud services, infrastructure, or feature implementations. Use when user asks "how much will this cost", "cost estimate", "cost analysis", "what's the pricing", "budget for this", or wants to understand the financial impact of a technical decision. Covers LLM API costs, cloud hosting, and SaaS tools.

Cost Skill

LLM API Pricing (as of early 2026)

Always verify current prices at the provider's pricing page - these change frequently.

Anthropic Claude

Model	Input (per 1M tokens)	Output (per 1M tokens)
Claude Opus 4	~$15	~$75
Claude Sonnet 4	~$3	~$15
Claude Haiku 3.5	~$0.80	~$4

OpenAI

Model	Input (per 1M tokens)	Output (per 1M tokens)
GPT-4o	~$2.50	~$10
GPT-4o mini	~$0.15	~$0.60

Google Gemini

Model	Input (per 1M tokens)	Output (per 1M tokens)
Gemini 2.5 Pro	~$1.25	~$10
Gemini 2.5 Flash	~$0.15	~$0.60

Token Estimation

Rough approximations:

1 token ≈ 4 characters ≈ 0.75 words
1 page of text ≈ 750 tokens
Average email ≈ 200-400 tokens
Short code file ≈ 500-2000 tokens

Cost Calculation

def estimate_cost(
    input_tokens: int,
    output_tokens: int,
    input_price_per_million: float,
    output_price_per_million: float
) -> float:
    input_cost = (input_tokens / 1_000_000) * input_price_per_million
    output_cost = (output_tokens / 1_000_000) * output_price_per_million
    return input_cost + output_cost

# Example: 1000 requests with 500 input tokens, 200 output tokens each
# Using Claude Sonnet ($3 in, $15 out per 1M)
total_input = 1000 * 500   # 500K tokens
total_output = 1000 * 200  # 200K tokens
cost = estimate_cost(total_input, total_output, 3, 15)
# = $0.15 + $3.00 = $3.15 for 1000 requests

Cloud Hosting (rough estimates)

Service	Small	Medium	Large
Vercel (hobby)	Free	$20/mo	$150+/mo
Render (web service)	$7/mo	$25/mo	$85+/mo
Hetzner VPS	€5/mo	€20/mo	€80+/mo
AWS EC2 (t3)	$15/mo	$60/mo	$250+/mo

Analysis Framework

When asked to estimate costs:

Identify usage patterns: How many requests/day? Batch or real-time?
Estimate token counts: Input context size + expected output size
Calculate monthly volume: requests/day × 30
Apply pricing: Use the table above or current pricing page
Add buffer: Add 20-30% for variance and growth
Compare alternatives: Show cost at different models/providers

Output Format

COST ESTIMATE: [Feature/Service]

Assumptions:
- [usage volume]
- [average token counts]
- [model/provider]

Monthly estimate:
- API calls: [N] × $[X] = $[Y]/month
- Hosting: $[Z]/month
- Total: ~$[TOTAL]/month

At scale (10x):
- Total: ~$[10x TOTAL]/month

Recommendation: [cheapest option that meets requirements]