> A defect in a software or human system that, if repaired, instills confidence that this event won’t happen again in the same way. [Google - Site Reliability Engineering](https://sre.google/sre-book/monitoring-distributed-systems/) "Monitoring Distributed Systems"より引用。