incident-response

star 0

How Apollo.io handles incidents — detection, response, escalation, and postmortem

pbsull By pbsull schedule Updated 3/1/2026

name: incident-response description: How Apollo.io handles incidents — detection, response, escalation, and postmortem

Apollo Incident Response

Severity Levels

Severity Definition Response Time
SEV-1 Customer-facing outage or data loss Immediate
SEV-2 Significant degradation, limited blast radius 15 min
SEV-3 Minor issue, workaround available 1 hour
SEV-4 Low impact, no customer effect Next business day

How to Declare an Incident

  1. Post in #incidents Slack channel
  2. Page on-call via PagerDuty (or however you do it)
  3. Join the incident bridge: [link or instructions]
  4. Assign an Incident Commander (IC)

On-Call Rotation

  • Rotations managed via [PagerDuty / OpsGenie / other]
  • Each team has its own on-call schedule
  • Escalation path: on-call engineer → team lead → engineering manager

During an Incident

  • IC coordinates communication, not necessarily the fix
  • Update #incidents every 15 min for SEV-1/2
  • Post customer-facing status updates to [status page]
  • Keep a running timeline in the incident doc

Postmortem Process

  1. Postmortem doc created within 48 hours of resolution
  2. Blameless — focus on systems and processes, not people
  3. Reviewed in weekly incident review meeting
  4. Action items tracked to completion

Contacts

  • #incidents — active incident coordination
  • #postmortems — postmortem docs and reviews
  • On-call escalation: [PagerDuty link or escalation path]
Install via CLI
npx skills add https://github.com/pbsull/apollo-claude-plugin --skill incident-response
Repository Details
star Stars 0
call_split Forks 0
navigation Branch main
article Path SKILL.md
More from Creator