The biggest cause of downtime was deployment. Either deploying code with a bug that wasn't caught in testing, or changing a real-time configuration parameter that wasn't properly scoped.
As opposed to a scaling issue that showed up later, or a node failure or all the other things that could cause downtime.
As opposed to a scaling issue that showed up later, or a node failure or all the other things that could cause downtime.