100 monitoring recipes · uptime, SSL, cron, k8s · Enterno.io · page 5

Anatoly Oshmanovsky

Monitoring cookbook

Hand-written recipes for the monitoring problems we see most often. Each recipe shows a minimal DIY script and the one-click Enterno.io monitor that covers the same concern without extra infrastructure.

100 recipes · MIT-licensed · RU + EN

Docker Hub — alert when approaching pull rate limit

bash

docker dockerhub registry

Anonymous image pulls hit Docker Hub limits (100/6h per IP) — CI starts failing with TooManyRequests. Usually visible only after you are already over.

Read recipe → API monitor

Falco — alert on a runtime-security event spike

bash

falco security runtime

Falco logs suspicious actions (write to /etc, shell in container, unexpected network connect) — but logs sit locally and nobody looks. An in-container attack develops silently.

Read recipe → HTTP monitor

Alertmanager — alert when an alert is stuck in pending

bash

prometheus alertmanager observability

An alertmanager alert sits in state=pending past its for-window — it should be active but is not firing (group_wait too big? notifier broken? misconfigured route?). Nobody gets paged.

Read recipe → API monitor

Airflow — alert when a DAG misses its SLA

bash

airflow scheduling dag

An Airflow DAG finished past its SLA (but did not fail — late success). By default an SLA miss only triggers an email callback that is rarely configured. The pipeline shows a "red flag" an hour after the fact.

Read recipe → API monitor

PostgreSQL — alert when a table is bloating

bash

postgres bloat disk

A table takes 200 GB, of which 150 GB is bloat (dead tuples). VACUUM FULL needs an exclusive lock, autovacuum cannot keep up. You notice when an index scan turns into a seq scan.

Read recipe → HTTP monitor

Kubernetes — alert when a CronJob is suspended

bash

kubernetes cronjob scheduling

Someone set `spec.suspend: true` on a CronJob (debug or rushed release) and forgot to revert. The daily task does not run, reports are not generated — you only learn when finance asks.

Read recipe → HTTP monitor

Azure — alert when approaching a subscription quota

bash

azure quota cloud

An Azure subscription approaches a quota (vCPU per region, public IPs, storage accounts) — the next terraform apply fails with 429/ItemNotFound. Quota raises go via a support ticket, you need a head start.

Read recipe → API monitor

Datadog — alert when a host stops reporting

bash

datadog apm observability

The Datadog agent dies (OOM, mismatched apt update, cert expiry on dd-staging.com) — host disappears from the dashboard after 10 min (default mute window), but nobody alerts that monitoring went blind.

Read recipe → API monitor

Vault — alert when a secret has not rotated in N days

bash

vault secrets compliance

Compliance mandates rotating DB credentials every 90 days. Vault static-creds engine should do it, but someone set max_ttl=0 — the secret lives forever. The auditor finds it first.

Read recipe → API monitor

MongoDB — alert when oplog window shrinks

bash

mongodb oplog replication

Writes on primary grow faster than oplog retention. If a secondary falls behind by more than the oplog window, you need an initial sync (hours of downtime). Usually noticed too late.

Read recipe → HTTP monitor

Cassandra — alert when repair has not run within gc_grace

bash

cassandra repair consistency

Cassandra needs a full repair within `gc_grace_seconds` (default 10 days) — otherwise deletes resurrect as zombies on failover. Easy to miss without a scheduler.

Read recipe → HTTP monitor

ArgoCD — alert when an application drifts from git state

bash

argocd gitops kubernetes

Someone ran `kubectl edit` directly on the cluster — the manifest diverges from git. ArgoCD shows OutOfSync, but auto-sync is off. The manifest drifts further, divergence accumulates.

Read recipe → API monitor

Jenkins — alert when the build queue is stuck

bash

jenkins ci queue

The Jenkins queue grows — an agent went away, label mismatch, or executors are saturated. PR checks hang, devs start chat-pinging "what is up with CI?".

Read recipe → API monitor

Amazon ECR — alert on a pull-failure climb

bash

aws ecr registry

ECR pulls start failing consistently (IRSA expired, network ACL, repo policy mismatch) — pods in k8s cannot start, ImagePullBackOff. But the kubelet event pages nobody.

Read recipe → API monitor

Lighthouse — alert on a perf-score drop between releases

bash

lighthouse performance frontend

After a release Lighthouse perf score drops from 90 to 65 (new lib without code-split, or un-minified bundle). You only learn when RUM starts showing LCP > 4 s.

Read recipe → PageSpeed Checker

Webpack — alert on bundle-size growth in PRs

bash

webpack bundle frontend

Someone added `import * from 'lodash'` — the bundle grew 70 KB. CI passed (tests OK), but first user load got 300 ms slower. Catch in CI before merge.

Read recipe → API monitor

Kubernetes — alert when a Secret has not rotated in N days

bash

kubernetes secrets rotation

Compliance mandates rotating k8s Secrets (DB passwords, API tokens) every 90 days. Nobody auto-rotates, Secrets live since cluster creation. The auditor finds it first.

Read recipe → HTTP monitor

Vault — alert when a mount/secret engine disappeared

bash

vault secrets config-drift

Someone ran `vault secrets disable` (debug or drift) — the pipeline reaches for DB creds and gets 404. Vault does not warn — for it this is a "normal admin action".

Read recipe → API monitor

Fastly — alert when a purge takes longer than N s

bash

fastly cdn purge

Fastly soft-purge is typically sub-second, but sometimes hangs 30+ s (overload, key collision). After a release, new assets do not appear, users see the old version.

Read recipe → API monitor

GCP — alert when approaching a project quota

bash

gcp quota cloud

A GCP project quota (CPU, IPs, persistent disks) creeps toward the limit. The next terraform plan fails with RESOURCE_EXHAUSTED. Quota requests take 1–2 days — needs head start.

Read recipe → API monitor

Have a recipe we missed?

Tell us which stack to cover next — drop a line to support@enterno.io and we'll add the recipe (and credit you on the page).

Start monitoring — free →

Have a recipe we missed?

Start monitoring for free