name: agent-environment-optimizer description: Audit an agent's execution environment for cold-start patterns, missing warm caches, stale dependencies, and session persistence gaps. Scores against best practices from OpenAI Hosted Shell and METR research showing unoptimized environments negate AI productivity gains. Use when the user says "optimize agent environment", "cold start audit", "agent environment check", "why is my agent slow to start", "warm cache audit", "session persistence check", or wants to speed up agent execution by fixing the environment layer.
Agent Environment Optimizer
Audit whether an agent's execution environment follows persistent-environment best practices. Flags cold-start patterns that silently eat productivity gains and suggests fixes.
Source
Nate's Newsletter, 2026-04-16. References OpenAI's Hosted Shell pattern and METR study showing unmodified environments negate AI gains.
Trigger
Use when the user asks about agent startup speed, execution environment optimization, cold-start problems, or why their agent takes so long before doing useful work.
Prerequisites
- Target environment must be accessible (local machine, SSH target, or container)
- Works for Claude Code, the agent platform agents, or any LLM agent setup
Phase 1: Environment Discovery
Identify the agent's execution context:
- Runtime type: local process, container, SSH remote, cloud VM
- Session model: ephemeral (fresh each run) vs. persistent (long-lived process)
- Entry point: how the agent starts (CLI, daemon, cron, webhook)
Gather data:
# Language runtimes and versions
python3 --version 2>/dev/null; node --version 2>/dev/null
# Package manager caches
ls -la ~/.cache/pip/ 2>/dev/null; ls -la ~/.npm/_cacache/ 2>/dev/null
# Build caches
ls -la ~/.cache/pre-commit/ 2>/dev/null
# Git state
git status 2>/dev/null | head -5
Phase 2: 6-Dimension Assessment
Score each dimension 1-5 (1 = fully cold, 5 = fully warm):
D1: Dependency Availability
- All package dependencies pre-installed (not installed at runtime)
- Lock files present and up to date
- No
pip install/npm installin startup path - Virtual environments pre-built and activated
D2: Compilation Cache
- Build artifacts cached between runs (dist/, pycache, .next/)
- TypeScript compilation cache warm
- Pre-commit hook environments pre-built
- No "first run" compilation penalty
D3: Context Pre-loading
- CLAUDE.md / system prompts loaded without file I/O at query time
- Memory/context databases pre-warmed
- MCP server connections established before first tool call
- Relevant project files indexed or cached
D4: Auth Persistence
- API tokens loaded from environment, not fetched per-request
- SSH keys pre-loaded in agent
- OAuth tokens refreshed proactively, not on-demand
- No interactive auth prompts in automated flows
D5: Session Continuity
- Agent state survives process restarts
- Conversation history persisted to disk/DB
- Work-in-progress checkpointed (not lost on crash)
- Session ID / context carried across invocations
D6: Tool Readiness
- CLI tools on PATH without activation steps
- MCP servers pre-started (not cold-started per query)
- Database connections pooled
- File watchers / indexes pre-built
Phase 3: Cold-Start Pattern Detection
Flag these specific anti-patterns:
| Pattern | Signal | Impact |
|---|---|---|
| Install-on-boot | pip install / npm install in entrypoint |
30-120s added per start |
| Auth-on-first-call | Token fetch in first tool invocation | 2-10s + potential failure |
| Index-on-demand | File indexing triggered by first search | 5-60s depending on repo size |
| Cache-miss cascade | No pycache, no .next/, no node_modules/.cache | Cumulative 10-30s |
| Context-reload | Full CLAUDE.md chain re-parsed every message | Token waste per turn |
| Ephemeral workspace | /tmp or container with no volume mount | All state lost between runs |
Phase 4: Optimization Report
Produce a scorecard:
Agent Environment Score: XX/30
D1 Dependency Availability: X/5
D2 Compilation Cache: X/5
D3 Context Pre-loading: X/5
D4 Auth Persistence: X/5
D5 Session Continuity: X/5
D6 Tool Readiness: X/5
Cold-Start Patterns Found: N
Estimated startup overhead: ~Xs
Then list fixes ranked by impact:
#1 FIX: [Pattern name]
Current: [what happens now]
Target: [what should happen]
How: [specific command or config change]
Saves: ~Xs per agent start
Phase 5: Implementation Checklist
Generate a concrete checklist the user can execute:
- Fix #1: [specific action]
- Fix #2: [specific action]
- Fix #3: [specific action]
- Re-run this audit to verify score improvement
Verification
- All 6 dimensions assessed with evidence (not assumed)
- Cold-start patterns backed by actual file/config checks, not guesses
- Fix recommendations are specific enough to execute without further research
- Estimated time savings are conservative (under-promise)