Alerts and Dashboardsยถ
grafanaยถ
https://grafana.il.unibeam.com/
Severity of Alertsยถ
High - May affect system performance, need higher attention (phone call) Medium - May happen because of wrong business use (e.g 400 bad request)
Dashboardsยถ
AWS ALB Cloudwatch Metricsยถ
load balancers metrics. Choose sia-service LB
Alerts:
| name | data source / panel | description | Severity | Threshold | recipients |
|---|---|---|---|---|---|
| ELB 500 Errors | ELB 500 Errors | alert on 500 error rate | High | 20 in 5m | slack/emails [roi,alex...] |
| ELB 400 Errors | ELB 400 Errors | alert on 400 error rate | Medium | 20 in 5m | slack/emails [roi,alex...] |
| Latency | HTTP latency | alert on every > 200ms latency | High | 20 in 5m | slack/emails [roi,alex...] |