SLI — a measured metric (e.g. "response time p99"). SLO — a target for the SLI (e.g. "p99 < 200ms"). SLA — a contractual commitment to customers (e.g. "99.9% uptime, otherwise refund"). Google's SRE book popularized this hierarchy. Typical: SLO = 99.9% monthly → error budget = 43 min/month. When budget is spent — pause feature work and focus on reliability.
Below: details, example, related terms, FAQ.
SLI: % requests with status 2xx\/3xx
SLO: 99.9% of requests succeed (monthly)
SLA: refund 10% if <99.9% in a monthSynthetic probes (Enterno monitors) every minute. 30-day window. Success = HTTP 2xx/3xx + response time < threshold.
Internal SLO — always. SLA — only if a customer requires it (enterprise, compliance).
Feature-freeze, postmortem, reliability work until budget recovers. This is the main value of the error-budget approach.