Pingdom customer impact
External signalNo Pingdom checks were available in this window.
No active issue listed in this category.
Generated 2026-06-21 22:54 for 2026-06-14 07:00 to 2026-06-21 07:00 from Pingdom checks, Slack #_alerts_prod, and AWS SNS alerts.
Some enrichments were unavailable in this run; drilldowns below stay focused on captured evidence.
Bottom line: application-level critical paths are present and catalog database alarms still look active.
No Pingdom checks were available in this window.
No active issue listed in this category.
5 critical, 30 non-critical active item(s).
2 critical, 0 non-critical active item(s).
| Pingdom Check | Status | Events | Downtime | Last Seen | Likely Services | Correlated Evidence |
|---|
Pingdom rows show externally visible signal first. The correlated evidence column helps tie the failing check back to services, Slack alert families, or AWS alarms when those links exist.
This view attributes alerts to the workload or resource named in the alert text. Grafana, Loki, and Tempo are treated as observability components and are excluded when a more specific impacted target is also present.
| Impacted Service / Resource | Highest Severity | Count | Last Seen | Status | Top Alert Types | Discussion Signal | Latest Thread Note |
|---|---|---|---|---|---|---|---|
| grafana | Critical | 67 | 2026-06-19 14:18 | Recent (72h) | PlatformLatencyP95Critical1s (7)Platform5xxLowVolumeCritical100 (4)KubeContainerMemoryHigh (15)NodeHighNumberConntrackEntriesUsed (7)KubePersistentVolumeFillingUp (6) | Observability storage | *Alert:* `Platform5xxRatioCritical3pct` (critical) | *When:* 2026-06-16 06:53 Europe/Bucharest | | *Service:* `grafana` | *TL;DR:* `grafana… |
| web-80 | Critical | 31 | 2026-06-18 16:09 | Recent (72h) | TraefikServiceHighErrorRate (14)TraefikServiceHighLatency (17) | Observability storage | *Alert:* `TraefikServiceHighErrorRate` (critical) | *When:* 2026-06-16 05:55 Europe/Bucharest | | *Service:* `web-80` | *TL;DR:* `web-80` c… |
| docgen2-api | Critical | 28 | 2026-06-19 09:05 | Recent (72h) | TraefikServiceHighErrorRate (4)CPUThrottlingHigh (13)KubeHpaMaxedOut (11) | None | No thread note |
| core-grafana-80 | Critical | 3 | 2026-06-19 10:43 | Recent (72h) | TraefikServiceHighErrorRate (3) | None | No thread note |
| uni-api Grouped 2 variantsVariant mentions 13Active variants 2 | Critical | 14 | 2026-06-17 18:34 | Seen this week | TraefikServiceHighErrorRate (4)TraefikServiceHighLatency (8)KubePodCrashLooping (1)KubeDeploymentReplicasMismatch (1) | Release / migration issue | *Alert:* `TraefikServiceHighErrorRate` (critical) | *When:* 2026-06-16 16:38 Europe/Bucharest | | *Service:* `uni-api` | *TL;DR:* `uni-api`… | *Fix now:* Check the latest deployed revision for `/api/v2/uni/student-studies/students/1882/disciplines?semester=1` and rollback or hotfix… |
| download-album Grouped 3 variantsVariant mentions 20Active variants 3 | Warning | 20 | 2026-06-21 05:43 | Seen today | KubeJobFailed (20) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| ai-api | Warning | 17 | 2026-06-19 17:15 | Recent (72h) | TraefikServiceHighLatency (17) | None | No thread note |
| loki | Warning | 6 | 2026-06-19 14:18 | Recent (72h) | KubePersistentVolumeFillingUp (6) | None | No thread note |
| admission-api | Warning | 3 | 2026-06-18 11:26 | Recent (72h) | TraefikServiceHighLatency (3) | None | No thread note |
| subscriptions-api Grouped 2 variantsVariant mentions 14Active variants 2 | Warning | 21 | 2026-06-16 01:43 | Seen this week | TraefikServiceHighLatency (13)KubeDeploymentReplicasMismatch (7)KubePodCrashLooping (1) | None | No thread note |
| social-api | Warning | 17 | 2026-06-15 14:11 | Seen this week | TraefikServiceHighLatency (17) | None | No thread note |
| core-getresponse-events-worker | Warning | 13 | 2026-06-16 07:03 | Seen this week | KubeDeploymentReplicasMismatch (9)KubePodCrashLooping (4) | None | No thread note |
| notifications-event-manager | Warning | 12 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (6)KubeHpaMaxedOut (5)KubePodCrashLooping (1) | None | No thread note |
| colecteaza-sms-note-abs | Warning | 11 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (11) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| service-websocket | Warning | 11 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (8)KubePodCrashLooping (3) | None | No thread note |
| rezumat | Warning | 10 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (10) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| publish-results | Warning | 9 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (9) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| update-recurenta | Warning | 9 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (9) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| attendance-register-missed-attendance | Warning | 8 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (8) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| stats-active-users | Warning | 8 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (8) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| stats-school-students | Warning | 7 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (7) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| library-api | Warning | 7 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (6)KubePodCrashLooping (1) | None | No thread note |
| notifications-push-sender-worker | Warning | 7 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (6)KubePodCrashLooping (1) | None | No thread note |
| service-av | Warning | 7 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (6)KubePodCrashLooping (1) | None | No thread note |
| send-codes | Warning | 6 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (6) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| notifications-events-worker | Warning | 6 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (5)KubePodCrashLooping (1) | None | No thread note |
| subscriptions-school-stats-worker | Warning | 6 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (5)KubePodCrashLooping (1) | None | No thread note |
| generate-invoices | Warning | 5 | 2026-06-16 15:36 | Seen this week | KubeJobFailed (5) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| service-fetgenerator Grouped 2 variantsVariant mentions 2Active variants 2 | Warning | 5 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (4)KubePodCrashLooping (1) | None | No thread note |
| subscriptions-assign-worker | Warning | 5 | 2026-06-16 01:43 | Seen this week | KubeDeploymentReplicasMismatch (4)KubePodCrashLooping (1) | None | No thread note |
| rooms-api | Warning | 5 | 2026-06-15 11:29 | Seen this week | TraefikServiceHighLatency (5) | None | No thread note |
| web | Warning | 3 | 2026-06-15 11:36 | Seen this week | KubeDeploymentReplicasMismatch (3) | None | No thread note |
| social-local-cache | Warning | 1 | 2026-06-16 15:30 | Seen this week | KubeJobNotCompleted (1) | None | No thread note |
| metrics-server | Warning | 1 | 2026-06-15 21:00 | Seen this week | KubeAggregatedAPIDown (1) | None | No thread note |
| billing-api | Warning | 1 | 2026-06-15 14:11 | Seen this week | TraefikServiceHighLatency (1) | None | No thread note |
| Alert | Severity | Count | Last Seen | Status | Threads | Top Impacted Services | Discussion Signal | Latest Thread Note |
|---|---|---|---|---|---|---|---|---|
| TraefikServiceHighErrorRate | Critical | 25 | 2026-06-19 10:43 | Recent (72h) | 2 | web-80 (14)docgen2-api (4)uni-api (4)core-grafana-80 (3) | Observability storageRelease / migration issue | *Alert:* `TraefikServiceHighErrorRate` (critical) | *When:* 2026-06-16 16:38 Europe/Bucharest | | *Service:* `uni-api` | *TL;DR:* `uni-api`… | *Fix now:* Check the latest deployed revision for `/api/v2/uni/student-studies/students/1882/disciplines?semester=1` and rollback or hotfix… |
| Platform5xxRatioCritical3pct | Critical | 2 | 2026-06-18 16:18 | Recent (72h) | 1 | grafana (2) | Observability storage | *Alert:* `Platform5xxRatioCritical3pct` (critical) | *When:* 2026-06-16 06:53 Europe/Bucharest | | *Service:* `grafana` | *TL;DR:* `grafana… |
| PlatformLatencyP95Critical1s | Critical | 7 | 2026-06-15 14:01 | Seen this week | 0 | grafana (7) | None | |
| Platform5xxLowVolumeCritical100 | Critical | 4 | 2026-06-16 05:48 | Seen this week | 1 | grafana (4) | Observability storage | *Alert:* `Platform5xxLowVolumeCritical100` (critical) | *When:* 2026-06-16 05:48 Europe/Bucharest | | *Service:* `grafana` | *TL;DR:* `graf… |
| KubeJobFailed | Warning | 27 | 2026-06-21 05:43 | Seen today | 1 | download-album (20)colecteaza-sms-note-abs (11)rezumat (10)publish-results (9)update-recurenta (9) | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| TraefikServiceHighLatency | Warning | 26 | 2026-06-19 17:15 | Recent (72h) | 0 | social-api (17)web-80 (17)ai-api (17)subscriptions-api (13)uni-api (8) | None | |
| KubeHpaMaxedOut | Warning | 22 | 2026-06-18 13:21 | Recent (72h) | 0 | docgen2-api (11)grafana (6)notifications-event-manager (5) | None | |
| KubeContainerMemoryHigh | Warning | 15 | 2026-06-18 17:22 | Recent (72h) | 0 | grafana (15) | None | |
| CPUThrottlingHigh | Warning | 13 | 2026-06-19 09:05 | Recent (72h) | 0 | docgen2-api (13) | None | |
| NodeHighNumberConntrackEntriesUsed | Warning | 7 | 2026-06-19 09:32 | Recent (72h) | 0 | grafana (7) | None | |
| KubePersistentVolumeFillingUp | Warning | 6 | 2026-06-19 14:18 | Recent (72h) | 0 | grafana (6)loki (6) | None | |
| NodeSystemSaturation | Warning | 3 | 2026-06-19 01:24 | Recent (72h) | 0 | grafana (3) | None | |
| Platform5xxRatioWarning1pct | Warning | 2 | 2026-06-18 16:18 | Recent (72h) | 0 | grafana (2) | None | |
| NodeDiskIOSaturation | Warning | 1 | 2026-06-19 01:40 | Recent (72h) | 0 | grafana (1) | None | |
| KubeDeploymentReplicasMismatch | Warning | 14 | 2026-06-17 18:28 | Seen this week | 0 | core-getresponse-events-worker (9)service-websocket (8)subscriptions-api (7)library-api (6)notifications-event-manager (6) | None | |
| KubePodCrashLooping | Warning | 6 | 2026-06-17 18:29 | Seen this week | 0 | core-getresponse-events-worker (4)service-websocket (3)library-api (1)notifications-event-manager (1)notifications-events-worker (1) | None | |
| PlatformLatencyP95Warning400ms | Warning | 6 | 2026-06-15 14:01 | Seen this week | 0 | grafana (6) | None | |
| PlatformLatencyP99Warning5s | Warning | 5 | 2026-06-15 14:07 | Seen this week | 0 | grafana (5) | None | |
| TargetDown | Warning | 2 | 2026-06-15 09:31 | Seen this week | 0 | grafana (2) | None | |
| KubeJobNotCompleted | Warning | 1 | 2026-06-16 15:30 | Seen this week | 0 | social-local-cache (1) | None | |
| Platform5xxLowVolumeWarning20 | Warning | 1 | 2026-06-16 04:11 | Seen this week | 0 | grafana (1) | None | |
| KubeAggregatedAPIDown | Warning | 1 | 2026-06-15 21:00 | Seen this week | 0 | metrics-server (1) | None |
Status is heuristic. Slack rarely posts explicit resolutions, so “Seen today” or “Recent” means the alert family still appeared in production recently, not that it is definitely unresolved.
| AWS Alarm | Emails | ALARM | OK | State Flips | First Seen | Last Seen | Latest State | Status |
|---|---|---|---|---|---|---|---|---|
| adservio-rds-mysql-catalog2-memory-low | 32 | 17 | 15 | 30 | 2026-06-15 06:22 | 2026-06-21 03:59 | ALARM | Still alarming |
| adservio-rds-mysql-catalog2-swap-high | 3 | 2 | 1 | 2 | 2026-06-15 06:33 | 2026-06-17 06:42 | ALARM | Still alarming |
| adservio-root-account-usage | 4 | 2 | 2 | 3 | 2026-06-16 02:47 | 2026-06-16 15:07 | OK | Latest OK |
| adservio-rds-mysql-catalog2-storage-low | 4 | 2 | 2 | 3 | 2026-06-15 23:55 | 2026-06-16 06:00 | OK | Latest OK |
| adservio-rds-mysql-catalog3-cpu-high | 1 | 0 | 1 | 0 | 2026-06-15 20:05 | 2026-06-15 20:05 | OK | Latest OK |
| adservio-rds-mysql-catalog3-read-latency-high | 1 | 0 | 1 | 0 | 2026-06-15 20:05 | 2026-06-15 20:05 | OK | Latest OK |
| adservio-rds-mysql-catalog3-memory-low | 1 | 0 | 1 | 0 | 2026-06-15 20:05 | 2026-06-15 20:05 | OK | Latest OK |
| adservio-rds-mysql-catalog3-storage-low | 1 | 0 | 1 | 0 | 2026-06-15 20:05 | 2026-06-15 20:05 | OK | Latest OK |
| adservio-rds-mysql-catalog3-write-latency-high | 1 | 0 | 1 | 0 | 2026-06-15 20:05 | 2026-06-15 20:05 | OK | Latest OK |
| adservio-rds-mysql-catalog3-connections-high | 1 | 0 | 1 | 0 | 2026-06-15 20:04 | 2026-06-15 20:04 | OK | Latest OK |
| adservio-rds-mysql-catalog3-disk-queue-high | 1 | 0 | 1 | 0 | 2026-06-15 20:04 | 2026-06-15 20:04 | OK | Latest OK |
| adservio-rds-mysql-catalog3-swap-high | 1 | 0 | 1 | 0 | 2026-06-15 20:04 | 2026-06-15 20:04 | OK | Latest OK |
“Flapping, latest OK” means the most recent email was an OK, but the alarm toggled repeatedly and is still a reliability concern.
| Thread Date | Alert | Severity | Services | Signal | Key Notes |
|---|---|---|---|---|---|
| 2026-06-16 16:38 | TraefikServiceHighErrorRate | Critical | uni-api | Release / migration issue | *Alert:* `TraefikServiceHighErrorRate` (critical) | *When:* 2026-06-16 16:38 Europe/Bucharest | | *Service:* `uni-api` | *TL;DR:* `uni-api`… | *Fix now:* Check the latest deployed revision for `/api/v2/uni/student-studies/students/1882/disciplines?semester=1` and rollback or hotfix… |
| 2026-06-16 11:31 | KubeJobFailed | Warning | attendance-register-missed-attendance, colecteaza-sms-note-abs, download-album, generate-invoices | General investigation | These are failed jobs which can be cleaned up from cluster , removed these failed jobs . |
| 2026-06-16 06:53 | Platform5xxRatioCritical3pct | Critical | grafana | Observability storage | *Alert:* `Platform5xxRatioCritical3pct` (critical) | *When:* 2026-06-16 06:53 Europe/Bucharest | | *Service:* `grafana` | *TL;DR:* `grafana… |
| 2026-06-16 05:55 | TraefikServiceHighErrorRate | Critical | web-80 | Observability storage | *Alert:* `TraefikServiceHighErrorRate` (critical) | *When:* 2026-06-16 05:55 Europe/Bucharest | | *Service:* `web-80` | *TL;DR:* `web-80` c… |
| 2026-06-16 05:48 | Platform5xxLowVolumeCritical100 | Critical | grafana | Observability storage | *Alert:* `Platform5xxLowVolumeCritical100` (critical) | *When:* 2026-06-16 05:48 Europe/Bucharest | | *Service:* `grafana` | *TL;DR:* `graf… |