Imagine you had a tool that gets alerts from all o...
# observability
s
Imagine you had a tool that gets alerts from all of your tools (datadog, grafana, etc) and runs worksflows based on these alerts (one or more). What workflows would you build?
h
Main one would probably be automatic pod roll back
s
yea but shouldn’t be related to k8s
a
@Shahar Glazner we have built that with Prodvana - universal orchestration based on convergence for infra - you can plug anything in at the top level for what we call protections
https://docs.prodvana.io/docs/built-in-protections this is control flow for anything in your convergence loop
m
I would build reports and insights. Has someone received too many alerts and needs a break? Does the same alert keep firing off? Triage to find, create and assign the right people and teams needed with the incident. Some tools around this: https://www.shoreline.io/ - has Incident recovery automation around k8s https://www.transposit.com/ - flexible, but easily automatable incident management, reports and insights.