What is reliability. Reliability is the confidence we have on a component or a system that it will not fail. How to improve reliability. How to assess reliability.
What is Availability? Availability is the percentage of time a service was available. How to Measure Availability. How to Improve Availability
To build a non-stop IT infrastructure, capable of supporting demanding IT services in a dynamic DevOps environment, 4 self-contained principles are needed. These principles are: Abstraction, Redundacy, Automation, and Proactiveness.
Machines, People and Processes - When we refer to IT infrastructure, we tend to forget that it is much more than hardware. In fact, hardware you use are the least of your concerns. You buy the best hardware of the most shinny technology, plug it to power, connect to network and it is up and running. Is this enough? Definitely Not. If you look into the history of your problems ...