Skip to main content

Runbook: Incidents

Document type: DevOps runbook
Doc ID: DEVOPS-RUNBOOK-INCIDENTS
Status: Final v0.1
Release date: December 21, 2025
Author: Nicolas Turcotte, Founder
Source repo: dcorps-docs-public (docs/devops/RUNBOOK-INCIDENTS.md)
Last updated: 2026-01-25

Scope: Operational incident steps aligned to docs/security/INCIDENT-RESPONSE.md.


1. Triage

  • Classify severity (low, medium, high, critical).
  • Confirm scope: chain, indexer, wallet UX, or external dependency.
  • Capture minimum incident facts:
    • start time (UTC) and detection source
    • impacted components and users
    • suspected root cause (hypothesis)
    • current mitigations and next steps

2. Containment

  • Isolate impacted services.
  • Notify operators and critical service owners.
  • Document immediate mitigations.
  • Avoid publishing attacker-enabling details until remediation is deployed (see docs/policy/POL-DOCS-PUBLICATION.md).

3. Recovery

  • Deploy fixes or configuration changes.
  • Validate block production and API health.
  • Monitor for recurrence.
  • When applicable, rehearse fixes on staging before production rollout.

4. Post-incident

  • Publish a summary when safe.
  • Log the incident and follow-up actions.
  • Update threat model and runbooks as needed.