agent-environment-optimizer

star 2

Audit an agent's execution environment for cold-start patterns, missing warm caches, stale dependencies, and session persistence gaps. Scores against best practices from OpenAI Hosted Shell and METR research showing unoptimized environments negate AI productivity gains. Use when the user says "optimize agent environment", "cold start audit", "agent environment check", "why is my agent slow to start", "warm cache audit", "session persistence check", or wants to speed up agent execution by fixing the environment layer.

m2ai-portfolio By m2ai-portfolio schedule Updated 6/4/2026

name: agent-environment-optimizer description: Audit an agent's execution environment for cold-start patterns, missing warm caches, stale dependencies, and session persistence gaps. Scores against best practices from OpenAI Hosted Shell and METR research showing unoptimized environments negate AI productivity gains. Use when the user says "optimize agent environment", "cold start audit", "agent environment check", "why is my agent slow to start", "warm cache audit", "session persistence check", or wants to speed up agent execution by fixing the environment layer.

Agent Environment Optimizer

Audit whether an agent's execution environment follows persistent-environment best practices. Flags cold-start patterns that silently eat productivity gains and suggests fixes.

Source

Nate's Newsletter, 2026-04-16. References OpenAI's Hosted Shell pattern and METR study showing unmodified environments negate AI gains.

Trigger

Use when the user asks about agent startup speed, execution environment optimization, cold-start problems, or why their agent takes so long before doing useful work.

Prerequisites

  • Target environment must be accessible (local machine, SSH target, or container)
  • Works for Claude Code, the agent platform agents, or any LLM agent setup

Phase 1: Environment Discovery

Identify the agent's execution context:

  1. Runtime type: local process, container, SSH remote, cloud VM
  2. Session model: ephemeral (fresh each run) vs. persistent (long-lived process)
  3. Entry point: how the agent starts (CLI, daemon, cron, webhook)

Gather data:

# Language runtimes and versions
python3 --version 2>/dev/null; node --version 2>/dev/null
# Package manager caches
ls -la ~/.cache/pip/ 2>/dev/null; ls -la ~/.npm/_cacache/ 2>/dev/null
# Build caches
ls -la ~/.cache/pre-commit/ 2>/dev/null
# Git state
git status 2>/dev/null | head -5

Phase 2: 6-Dimension Assessment

Score each dimension 1-5 (1 = fully cold, 5 = fully warm):

D1: Dependency Availability

  • All package dependencies pre-installed (not installed at runtime)
  • Lock files present and up to date
  • No pip install / npm install in startup path
  • Virtual environments pre-built and activated

D2: Compilation Cache

  • Build artifacts cached between runs (dist/, pycache, .next/)
  • TypeScript compilation cache warm
  • Pre-commit hook environments pre-built
  • No "first run" compilation penalty

D3: Context Pre-loading

  • CLAUDE.md / system prompts loaded without file I/O at query time
  • Memory/context databases pre-warmed
  • MCP server connections established before first tool call
  • Relevant project files indexed or cached

D4: Auth Persistence

  • API tokens loaded from environment, not fetched per-request
  • SSH keys pre-loaded in agent
  • OAuth tokens refreshed proactively, not on-demand
  • No interactive auth prompts in automated flows

D5: Session Continuity

  • Agent state survives process restarts
  • Conversation history persisted to disk/DB
  • Work-in-progress checkpointed (not lost on crash)
  • Session ID / context carried across invocations

D6: Tool Readiness

  • CLI tools on PATH without activation steps
  • MCP servers pre-started (not cold-started per query)
  • Database connections pooled
  • File watchers / indexes pre-built

Phase 3: Cold-Start Pattern Detection

Flag these specific anti-patterns:

Pattern Signal Impact
Install-on-boot pip install / npm install in entrypoint 30-120s added per start
Auth-on-first-call Token fetch in first tool invocation 2-10s + potential failure
Index-on-demand File indexing triggered by first search 5-60s depending on repo size
Cache-miss cascade No pycache, no .next/, no node_modules/.cache Cumulative 10-30s
Context-reload Full CLAUDE.md chain re-parsed every message Token waste per turn
Ephemeral workspace /tmp or container with no volume mount All state lost between runs

Phase 4: Optimization Report

Produce a scorecard:

Agent Environment Score: XX/30

D1 Dependency Availability:  X/5
D2 Compilation Cache:        X/5
D3 Context Pre-loading:      X/5
D4 Auth Persistence:         X/5
D5 Session Continuity:       X/5
D6 Tool Readiness:           X/5

Cold-Start Patterns Found: N
Estimated startup overhead: ~Xs

Then list fixes ranked by impact:

#1 FIX: [Pattern name]
   Current: [what happens now]
   Target:  [what should happen]
   How:     [specific command or config change]
   Saves:   ~Xs per agent start

Phase 5: Implementation Checklist

Generate a concrete checklist the user can execute:

  • Fix #1: [specific action]
  • Fix #2: [specific action]
  • Fix #3: [specific action]
  • Re-run this audit to verify score improvement

Verification

  • All 6 dimensions assessed with evidence (not assumed)
  • Cold-start patterns backed by actual file/config checks, not guesses
  • Fix recommendations are specific enough to execute without further research
  • Estimated time savings are conservative (under-promise)
Install via CLI
npx skills add https://github.com/m2ai-portfolio/m2ai-skills-pack --skill agent-environment-optimizer
Repository Details
star Stars 2
call_split Forks 1
navigation Branch main
article Path SKILL.md
More from Creator
m2ai-portfolio
m2ai-portfolio Explore all skills →