Outages like: |
[nosql2] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 18:01 (1 minute)
|
Found similar: |
[backend1] cpu usage is over 99%
2021-05-05 20:02 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 19:48 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 19:33 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 19:19 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 18:41 (19 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 16:41 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 15:53 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 15:39 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 14:42 (17 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 14:19 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 13:38 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 12:41 (12 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 10:41 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 10:26 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 08:41 (10 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 06:35 (19 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 06:20 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 05:59 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 05:08 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 04:37 (18 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 04:26 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 04:13 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 03:58 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 03:46 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 03:40 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 03:31 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 03:06 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 02:41 (17 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 02:30 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 02:01 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 01:52 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-05 00:41 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-05 00:03 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 22:41 (22 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 22:31 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 22:03 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 21:38 (4 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 21:21 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 21:07 (5 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 20:41 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 20:35 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 20:25 (3 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 20:19 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 19:51 (2 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 19:38 (1 minute)
|
[backend1] cpu usage is over 99%
2021-05-04 18:41 (18 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 16:41 (12 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 14:42 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 12:42 (14 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 10:41 (16 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 08:41 (12 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 06:42 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 04:41 (13 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 02:41 (16 minutes)
|
[backend1] cpu usage is over 99%
2021-05-04 00:41 (14 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 22:41 (15 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 20:42 (12 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 18:42 (16 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 16:41 (16 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 14:41 (17 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 12:42 (18 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 10:42 (14 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 08:42 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 06:42 (10 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 04:42 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 02:41 (17 minutes)
|
[backend1] cpu usage is over 99%
2021-05-03 00:42 (11 minutes)
|
[backend1] cpu usage is over 99%
2021-05-02 22:41 (11 minutes)
|