Outages like: |
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 17:26 (4 minutes)
|
Found similar: |
[db-postgresql1] cpu usage is over 99%
2021-05-28 17:11 (14 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 16:41 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 16:21 (5 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 16:06 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 15:56 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 15:41 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 15:22 (7 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 15:06 (4 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 15:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 14:52 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 14:47 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 14:31 (7 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 14:01 (2 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 13:31 (17 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 13:10 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 13:01 (1 minute)
|
[nosql1] cpu usage is over 99%
2021-05-28 13:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 12:46 (13 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 12:34 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 12:16 (17 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 12:11 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 12:06 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 12:01 (1 minute)
|
[nosql1] cpu usage is over 99%
2021-05-28 12:01 (10 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 11:36 (20 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 11:26 (5 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 11:09 (2 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 11:06 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 11:01 (2 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 11:01 (2 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 10:26 (7 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 10:06 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 10:06 (2 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 10:01 (1 minute)
|
[nosql1] cpu usage is over 99%
2021-05-28 10:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 09:41 (14 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 09:34 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 09:17 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 09:12 (9 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 09:06 (4 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 09:01 (1 minute)
|
[nosql1] cpu usage is over 99%
2021-05-28 09:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 08:46 (2 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 08:31 (11 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 08:11 (1 minute)
|
[nosql1] cpu usage is over 99%
2021-05-28 08:07 (4 minutes)
|
[nosql2] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 08:01 (1 minute)
|
[nosql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 08:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 08:01 (24 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 07:31 (22 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 07:16 (7 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 07:01 (2 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 06:46 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 06:16 (17 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 06:12 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 06:06 (4 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 06:01 (2 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 06:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 06:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 05:46 (6 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 05:16 (3 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 05:01 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 04:01 (2 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 04:01 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-28 03:12 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 02:53 (19 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-28 02:37 (7 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 02:26 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 02:16 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 01:46 (12 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 01:36 (5 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 01:20 (6 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 01:01 (2 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-28 01:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 00:36 (18 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-28 00:11 (8 minutes)
|
[nosql2] cpu usage is over 99%
2021-05-28 00:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 23:51 (10 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 23:41 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 23:22 (11 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 23:11 (4 minutes)
|
[nosql1] cpu usage is over 99%
2021-05-27 23:06 (1 minute)
|
[nosql2] cpu usage is over 99%
2021-05-27 23:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 23:01 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 22:26 (16 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 21:51 (28 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-27 21:31 (10 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-27 21:16 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-27 20:51 (6 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
okmeter outage
2021-05-27 20:26 (3 minutes)
|
[db-postgresql1] cpu usage is over 99%
2021-05-27 19:51 (13 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 19:36 (4 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 19:26 (5 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 19:16 (6 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 19:06 (5 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 18:56 (5 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 18:36 (16 minutes)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 18:12 (17 minutes)
|
[nosql2] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 18:01 (1 minute)
|
[db-postgresql1] cpu usage is over 99%
possible false positive due to
На ноде cl1ojovqhk3qm0ungvip-ujoh были ошибки udp пакетов.
Помог перезапуск collectorov на этой node.
Полезные график;
https://alfa.okmetric.com/okmeter/graph?duration=1h&graph_config=group_by%3A%20source_hostname%0Aoptions%3A%0A%20%20y_title%3A%20connections%0Alines%3A%0A%20%20-%20expression%3A%20%22sum_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.count%27))%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27)))%22%0A%20%20%20%20type%3A%20area%0A%20%20-%20expression%3A%20%22sum(max_by(instance%2C%20metric(name%3D%27collector.tcp_collector.connections.threshold%27))%20%20*%20max_by(instance%2C%20defined(metric(name%3D%27collector.tcp_collector.connections.gc.time.sum%27))))%22%0A%20%20%20%20legend%3A%20configured%20threshold%0Aevents%3A%0A%20%20-%20collector-errors%0A%20%20-%20tcp_collector_critical%0Atitle%3A%20Active%20connections%0A
https://alfa.okmetric.com/okmeter/hosts/cl1ojovqhk3qm0ungvip-ujoh/netstat?&duration=1h
2021-05-27 17:37 (18 minutes)
|