SECTION D: Playbook
P1 Incident Response
- Acknowledge
- Assess impact
- Communicate
- Triage
- Fix
- Validate
- Post-incident RCA
Common KQL Queries
// Failed logins
SigninLogs
| where ResultType != 0
// High CPU VMs
Perf
| summarize avg(CounterValue) by Computer
// NSG denied traffic
AzureDiagnostics
| where action_s == "Deny"
// HTTP 5xx errors
AppServiceHTTPLogs
| where ScStatus >= 500
Alert Tuning
- Review last 30 days alerts
- Identify false positives
- Adjust thresholds
- Group alerts
- Review monthly