I'm trying to get a better understanding of what is achieved by using RHEV/oVirt (or other OSS solutions) in an HA cluster. I'm interested in knowing how long it takes to fail over, and what exactly is happening when that happens, so I can judge whether or not this is an acceptable solution for different types of situations.
For example, what's the state of the system when it comes back - is it exactly where it left off, or is it like the power was pulled from the system, and it would be restarting after a power outage (thus having inconsistent disk states?)
I know this is a bit of a vague ask... but are there best practices for VMs to be run in an HA configuration like this, with the above considerations? From a layperson coming in with little to no experience, it seems like any application should be able to just be put on a VM and it'll magically work if the primary VM host crashes, and another VM host will take over. But it seems that's not really the case, and maybe there's some fundamental considerations that can be applied to most solutions.