[Google - Site Reliability Engineering](https://sre.google/workbook/postmortem-analysis/)
#### top 8 outage triggers, 2010-2017
- Binary push: 37%
- Configuration push: 31%
- User behavior change: 9%
- Processing pipeline: 6%
- Service provider change: 5%
- Performance decay: 5%
- Capacity management: 5%
- Hardware: 2%
#### top 5 root-cause categories
- Software: 41.35%
- Development process failure: 20.23%
- Complex system behaviors: 16.90%
- Deployment planning 6.74%
- Network failure: 2.75%
[[📘Site Reliability Workbook]]