Five 9s of availability in system design

Key takeaways:
Availability is the percentage of time a service or product is accessible and performing intended operations under normal conditions.
Five 9s represents a 99.999% uptime, meaning less than 5.26 minutes of downtime per year.
To achieve high availability in system design, use load balancing, redundancy, rate limiting, circuit breakers, and failover mechanisms.
Availability is measured using metrics like mean time to failure (MTTF), mean time to repair (MTTR), and mean time between failure (MTBF).

Availability in system design is the percentage of time a service or product is accessible and performing intended operations under normal conditions. For example, if a service’s availability is 50%, it is accessible and operational for 50% of the time a year; 100% availability means the service is available all the time––which is impossible. In SLAsservice level agreements, availability is defined in percentage of time, usually represented by 9s––five 9s is the most optimal number.

What are the five 9s of availability?

When deploying a service, clients are often assured that the application will be available 99.999% (five 9s) of the time. This allows for only 0.001% of downtime. Let’s take a look at some calculations to see exactly what that entails:

Why availability is important

Availability is crucial for the success of any service and for delivering a seamless user experience.

It ensures that users can access the service and perform their intended tasks at any time. When availability is low and downtime is frequent, it can lead to financial losses and increased customer dissatisfaction—both of which can seriously impact the reputation and reliability of the service.

Usually, a system’s availability is defined through different metrics such as percentage of total operational time, mean time to failure (MTTF)Average time between two consecutive failures of the service., mean time to repair (MTTR)Average time to restore a service to normal operation after failure, and mean time between failure (MTBF)Sum of MTTF and MTTR.

In system design, minimizing the above times and achieving five-nines of availability is the ultimate goal for an optimal and efficient system, but it can also be more challenging. To achieve high availability, we must follow some design principles carefully to reduce the probability of failure, as discussed in subsequent sections.

Load balancing

For a service with millions of users, numerous requests can arrive in a second.

Though there are thousands of services to handle these requests, if redirected to a single server, they can overload and crash the server. A load balancer is a component that helps to avoid overloading a server by fairly dividing incoming requests among the pool of available servers.

This helps improve availability, scalability, and performance.

New on Educative

Learn to Code

Learn any Language as a beginner

Develop a human edge in an AI powered world and learn to code with AI from our beginner friendly catalog

🏆 Leaderboard

Daily Coding Challenge

Solve a new coding challenge every day and climb the leaderboard

Free Resources

Availability	Downtime per Year	Downtime per Month	Downtime per day
1 nine –– 90%	36.5 days	72 hours	2.4 hours
2 nines –– 99.0%	3.65 days	7.20 hours	14.4 minutes
3 nines –– 99.9%	8.76 hours	43.8 minutes	1.46 minutes
4 nines –– 99.99%	52.56 minutes	4.32 minutes	8.64 seconds
5 nines –– 99.999%	5.26 minutes	25.9 seconds	0.86 seconds

Five 9s of availability in system design

What are the five 9s of availability?

Nines of availability

Why availability is important

Load balancing

Redundancy

Rate limiting

Circuit breaker

Failover mechanism