infrastructure

star 629

Infrastructure debugging for Kubernetes and AWS. Use when investigating pod crashes, deployment issues, resource problems, container failures, or cloud infrastructure issues.

incidentfox By incidentfox schedule Updated 2/26/2026

name: infrastructure description: Infrastructure debugging for Kubernetes and AWS. Use when investigating pod crashes, deployment issues, resource problems, container failures, or cloud infrastructure issues.

Infrastructure Debugging

Available Domains

Kubernetes

For pod crashes, deployment issues, resource problems, container failures. Use: /infrastructure-kubernetes

AWS

For EC2, ECS, Lambda, and CloudWatch issues. Use: /infrastructure-aws

Quick Reference

Kubernetes Issues

# List pods in namespace
python .claude/skills/infrastructure-kubernetes/scripts/list_pods.py -n otel-demo

# Get pod events (ALWAYS check first!)
python .claude/skills/infrastructure-kubernetes/scripts/get_events.py <pod-name> -n otel-demo

# Get pod logs
python .claude/skills/infrastructure-kubernetes/scripts/get_logs.py <pod-name> -n otel-demo --tail 100

Common Patterns

Symptom First Action Script
Pod CrashLoopBackOff Check events get_events.py
Pod OOMKilled Check resources get_resources.py
Pod Pending Check events + nodes get_events.py
Deployment stuck Check rollout history get_history.py
Install via CLI
npx skills add https://github.com/incidentfox/incidentfox --skill infrastructure
Repository Details
star Stars 629
call_split Forks 73
navigation Branch main
article Path SKILL.md
More from Creator