Перейти к содержимому
Skip to content
← All articles

MTTR, MTTF, MTBF: Reliability Metrics Explained for Web Operations

Reliability metrics are the language of uptime. When someone asks "how reliable is your service?", metrics like MTTR, MTTF, and MTBF provide objective answers. Understanding these metrics helps you set meaningful SLAs, prioritize improvements, and communicate with stakeholders.

MTTR — Mean Time to Repair

MTTR measures the average time from when a failure is detected to when the service is restored. It's the most actionable reliability metric because it directly measures your team's ability to respond to and fix issues.

MTTR Formula

MTTR = Total repair time / Number of repairs
Example: 3 incidents took 30min + 120min + 15min = 165min total
MTTR = 165 / 3 = 55 minutes

MTTR Components

Reducing MTTR

MTTF — Mean Time to Failure

MTTF measures the average time a system operates before its first failure. It's primarily used for non-repairable systems or new deployments. For web services, MTTF answers: "How long after deployment until something breaks?"

MTTF Formula

MTTF = Total uptime before failures / Number of failures
Example: 3 deployments ran for 72h, 168h, 48h before failing
MTTF = (72 + 168 + 48) / 3 = 96 hours

Improving MTTF

MTBF — Mean Time Between Failures

MTBF measures the average time between consecutive failures for repairable systems. It includes both uptime and repair time: MTBF = MTTF + MTTR. This is the most commonly cited reliability metric for ongoing services.

MTBF Formula

MTBF = Total operational time / Number of failures
Example: Service ran 720 hours in a month with 3 failures
MTBF = 720 / 3 = 240 hours between failures

How They Relate

MTBF = MTTF + MTTR

|←—— MTBF ——→|←—— MTBF ——→|
|← MTTF →|←MTTR→|← MTTF →|←MTTR→|
[  uptime  ][down ][  uptime  ][down ]

Availability from Metrics

These metrics directly calculate service availability:

Availability = MTTF / MTBF = MTTF / (MTTF + MTTR)

Example: MTTF = 237h, MTTR = 3h
Availability = 237 / (237 + 3) = 237 / 240 = 98.75%

This shows that reducing MTTR has a disproportionate impact on availability compared to increasing MTTF. Going from 3h to 1h MTTR improves availability more than doubling MTTF.

Setting Targets

AvailabilityAnnual DowntimeExample MTBF/MTTR
99%3.65 daysMTBF 100h, MTTR 1h
99.9%8.76 hoursMTBF 1000h, MTTR 1h
99.95%4.38 hoursMTBF 2000h, MTTR 1h
99.99%52.6 minutesMTBF 10000h, MTTR 1h

Practical Tips

Conclusion

MTTR, MTTF, and MTBF are complementary metrics that together paint a complete picture of service reliability. Start by measuring MTTR — it's the most actionable. Then track MTBF to understand your overall reliability trend. Use these numbers to set realistic SLAs, justify infrastructure investments, and demonstrate improvement over time.

Check your website right now

Check now →
More articles: Monitoring
Monitoring
Incident Response Plan: A Step-by-Step Guide for Web Teams
16.03.2026 · 10 views
Monitoring
Uptime Monitoring: Why and How to Set It Up
14.03.2026 · 10 views
Monitoring
Designing Effective Health Check Endpoints for Web Services
16.03.2026 · 11 views
Monitoring
Alerting Best Practices for Website Monitoring
14.03.2026 · 20 views