Anomaly detection: business metrics vs. system metrics?
3 points| chipfixer | 10 months ago
What may have been helpful is anomaly detection directly on their business metrics — with system metrics helping explain root cause but only when real customer/business impact is detected.
Curious to hear: How much does your org prioritize monitoring business metrics (not just System metrics)? If you do, what tools do you use?
nchinmay|10 months ago
Larger, incident-worthy changes in metrics are also easier to set static thresholds around and ring more than one bell when they occur. I'd be more concerned about smaller to mid deviations from the trend, say, sudden -/+10% change in my business metrics over X minutes. Can I reliably set a static threshold that will universally be appropriate here? A good anomaly detector would ideally bring something like this to attention without hard coded alert configs here
poobear22|10 months ago
chipfixer|10 months ago