Outages like: |
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 17:37 (18 minutes)
|
Found similar: |
[backend1] cpu usage is over 99%
2021-05-07 14:41 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 14:25 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 14:16 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 12:41 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 12:22 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 10:41 (17 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 10:27 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 09:21 (3 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 08:41 (19 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 08:34 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 08:26 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 07:59 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 07:36 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 07:26 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 06:41 (19 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 06:02 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 05:53 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 05:43 (3 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 05:31 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 05:20 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 05:10 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 05:04 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 04:41 (18 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 04:26 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 04:21 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 04:12 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 04:06 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 03:45 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 03:33 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 03:27 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 03:20 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 03:13 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 02:41 (18 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 02:27 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 02:02 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 01:07 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-07 00:41 (19 minutes)
|
[backend1] cpu usage is over 99%
2021-05-07 00:08 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 23:59 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 23:42 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 22:40 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 22:25 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 22:16 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 22:05 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 21:45 (8 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 21:37 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 21:31 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 21:22 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 21:07 (10 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 20:41 (22 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 20:23 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 20:11 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 19:59 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 19:46 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 19:31 (7 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 19:20 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 18:41 (21 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 18:25 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 18:13 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 16:41 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 15:31 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 14:41 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 13:30 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-06 13:11 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 12:41 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 12:02 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 11:31 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 10:41 (14 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 10:26 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 08:41 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 08:28 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 08:08 (4 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 07:26 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 06:41 (20 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 06:12 (4 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 05:40 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 05:35 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 05:16 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 04:56 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 04:41 (10 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 04:30 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 04:18 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 04:10 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 03:25 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 02:41 (10 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 01:20 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 00:59 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-06 00:41 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-06 00:33 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 23:25 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 22:58 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 22:41 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 22:34 (3 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 22:02 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 21:46 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 21:27 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 21:06 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 20:40 (14 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 20:35 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 20:02 (1 minute)
|