[Google - Site Reliability Engineering](https://sre.google/workbook/postmortem-analysis/) #### top 8 outage triggers, 2010-2017 - Binary push: 37% - Configuration push: 31% - User behavior change: 9% - Processing pipeline: 6% - Service provider change: 5% - Performance decay: 5% - Capacity management: 5% - Hardware: 2% #### top 5 root-cause categories - Software: 41.35% - Development process failure: 20.23% - Complex system behaviors: 16.90% - Deployment planning 6.74% - Network failure: 2.75% [[📘Site Reliability Workbook]]