Main one would probably be automatic pod roll back
s
Shahar Glazner
08/23/2023, 3:29 PM
yea but shouldn’t be related to k8s
a
Andrew Fong
08/23/2023, 5:01 PM
@Shahar Glazner we have built that with Prodvana - universal orchestration based on convergence for infra - you can plug anything in at the top level for what we call protections
I would build reports and insights. Has someone received too many alerts and needs a break? Does the same alert keep firing off? Triage to find, create and assign the right people and teams needed with the incident.
Some tools around this:
https://www.shoreline.io/ - has Incident recovery automation around k8s
https://www.transposit.com/ - flexible, but easily automatable incident management, reports and insights.