Content
View differences
Updated by Kabiru Mwenja 6 months ago
* How can we see live performance?
* How can we stress test?
* Create an incidence response playbook for DevOps- have them review before holidays
* A runbook typically contains:
* **Common runtime issues/errors** - Specific error messages, symptoms, or failure modes
* **Severity levels** - Impact classification (critical, high, medium, low)
* **Diagnostic steps** - How to identify and confirm the issue
* **Resolution procedures** - Step-by-step remediation instructions
* **Escalation paths** - When and how to escalate to developers or other teams
* **Monitoring/alerting details** - What metrics or logs to check
* How can we stress test?
* Create an incidence response playbook for DevOps- have them review before holidays
* A runbook typically contains:
* **Common runtime issues/errors** - Specific error messages, symptoms, or failure modes
* **Severity levels** - Impact classification (critical, high, medium, low)
* **Diagnostic steps** - How to identify and confirm the issue
* **Resolution procedures** - Step-by-step remediation instructions
* **Escalation paths** - When and how to escalate to developers or other teams
* **Monitoring/alerting details** - What metrics or logs to check