After having a bit of scare with a server that wouldn't come up one morning, the higher ups have decided that the business needs a high availability / fail over setup.
We have 5 main servers (4x Linux, 1x OpenBSD) all of which need to be running for the company to operate. Three of the servers are fairly standard (Files/Web/Database), the fourth handles most network routing and web proxies, while the fifth supports our phone system and has non-standard hardware.
My boss has stated that turn around time for a server failure should be under 30 minutes.
My experience in this field is non-existent (I'm just a programmer who was 'promoted'), so I guess my question really boils down to:
- Is this something that should even be attempted by someone with average server-admin skills. If so, what should I read, and who should I talk to?
Thanks.