Downtime is expensive. You could just bypass your infra and manually get it work...

crehn · on Oct 31, 2021

That's in fact how most high-impact events should be handled: mitigate the issue with a potentially short-term solution, once things are back up find the root cause, fix the root cause, and perform a thorough analysis of events to ensure it won't happen again.

dilyevsky · on Oct 31, 2021

Depending on the level of automation that may not be possible. That’s like saying if factory line robot fails “you just bypass the line and manually weld those car bodies”