Skip to content

Runbooks

Runbooks should be short, explicit, and time-aware.

POS unable to fetch catalog

Check:

  • API gateway health
  • catalog service health
  • cache and projection freshness
  • store and tenant scoping

Dark-store order cannot be fulfilled

Check:

  • store eligibility
  • orderability projection
  • stock truth for the specific location
  • picker availability

Inventory mismatch

Check:

  • latest source events
  • canonical stock positions
  • reservation state
  • projection freshness

Documentation site unavailable

Check:

  • Cloudflare Pages deployment status
  • custom domain routing
  • build output
  • static asset paths

Runbook Format Template

Title
Severity
Symptoms
Likely causes
Immediate checks
Recovery steps
Escalation path
Post-incident follow-up