Ping a Specific Port

Question

organicveggie

Asked: 2011-06-02 19:16:00 +0800 CST2011-06-02 19:16:00 +0800 CST 2011-06-02 19:16:00 +0800 CST

Alternatives to Heartbeat, Pacemaker and CoroSync?

772

Are there any major alternatives for automatic failover on Linux besides the typical Heartbeat/Pacemaker/CoroSync combinations? In particular, I'm setting up failover on EC2 instances, which only supports unicast - no multicast or broadcast. I'm specifically trying to handle the few pieces of software we have which don't already have automatic failover and don't support multi-master environments. This includes tools like HAProxy and Solr.

I have Heartbeat+Pacemaker working, but I'm not thrilled with it. Here are some of my issues:

Heartbeat - By itself, limited to two nodes. I'd like to have 3+.
Pacemaker - Impossible to configure automatically. Cluster has to be running with a quorum and then it still requires manual configuration.
CoroSync - Does not support unicast.

Pacemaker works very well, although it's power makes it difficult to setup. The real problem with Pacemaker is that there is no easy way to automate the configuration. I really want to launch an EC2 instance, install Chef/Puppet and have the entire cluster launch without my intervention.

9 Answers

Voted

JimB · Answer 1 · 2011-06-03T12:50:09+08:00

Best Answer

JimB

2011-06-03T12:50:09+08:002011-06-03T12:50:09+08:00

I prefer to use keepalived for high-availability. I find it simpler to setup (one daemon and config) than heartbeat and company. The only drawback I run into, is that keepalived doesn't have a unicast option by default, and only uses VRRP for communication (The author of HAProxy has written a unicast patch for keepalived however)

17

cyberx86 · Answer 2 · 2011-07-03T23:39:43+08:00

I am actually working on something very similar to what you described (a fail-over cluster on EC2), and after trying out Heartbeat, settled on Corosync as my messaging layer. Corosync will run on multiple servers and it does support Unicast (UDPU) as of version 1.3.0 (from Nov, 2010). I have setup and tested Corosync on Amazon's EC2 cloud (using Amazon's Linux AMI) and can confirm it works without issue.

A sample udpu file is installed to /etc/corosync.

Add one member block to the interface section for each node, and specify the transport as updu. (I have used the same port as heartbeat in the example below, but you can change it as desired).

e.g.:

totem {
        version: 2
        secauth: off
        interface {
                member {
                        memberaddr: 10.xxx.xxx.xxx
                }
                member {
                        memberaddr: 10.xxx.xxx.xxx
                }
                ringnumber: 0
                bindnetaddr: 10.xxx.xxx.xxx
                mcastport: 694
        }
        transport: udpu
}

(Heartbeat is supposed to support 3+ node clusters in versions 1.2.3+, although, I have never tried it personally, and don't know if it would work with Unicast).

Andrew Beekhof · Answer 3 · 2011-06-17T23:25:24+08:00

Andrew Beekhof

2011-06-17T23:25:24+08:002011-06-17T23:25:24+08:00

Sorry, but the part about Pacemaker is not true. The Pacemaker regression and release tests make extensive use of automation.

To configure without an active cluster, prefix all commands with CIB_file=/var/lib/heartbeat/crm/cib.xml or set it in your environment. Just be sure you remove the .sig file before starting the cluster.

For clusters without quorum, most if not all tools should support -f or --force which will instruct the cluster to accept the change anyway. If you find a tool that does not - please file a bug.

11

rthomson · Answer 4 · 2011-06-02T19:58:21+08:00

rthomson

2011-06-02T19:58:21+08:002011-06-02T19:58:21+08:00

In the open source world, there's RedHat Cluster Suite. It's been several years since I've implemented RHCS so I don't have many relevant things to say about it today.

Commercially, there is Veritas Cluster Server. No experience with it.

A much simpler and open source HA tool is UCARP. UCARP doesn't provide nearly the same kind of "infrastructure" that Heartbeat/Pacemaker/CoroSync does but you can build HA solutions around it.

You can also build highly available infrastructure with virtualization technologies but these solutions tend to focus on host-level availability as opposed to application level availability.

3

Kendall · Answer 5 · 2011-06-03T14:39:03+08:00

Kendall

2011-06-03T14:39:03+08:002011-06-03T14:39:03+08:00

There is Oracle Clusterware for Oracle Unbreakable Linux, though I've not used it.

1

manku · Answer 6 · 2011-06-26T09:14:57+08:00

manku

2011-06-26T09:14:57+08:002011-06-26T09:14:57+08:00

If you are already using EC2, why not use Elastic Load Balancing ? It will let you achieve application level availability without having to configure failover yourself.

1

Nils · Answer 7 · 2011-09-25T12:10:53+08:00

Nils

2011-09-25T12:10:53+08:002011-09-25T12:10:53+08:00

Veritas Cluster is great (compared to Linux-Heartbeat, AIX-hacmp, HP-Serviceguard and Sun cluster), but it costs lots of money. The last time I did look at it its price was based on cpu-cores of the cluster. Current Vendor ist Symantec...

1

Chaoxiang N · Answer 8 · 2018-11-07T12:46:38+08:00

Chaoxiang N

2018-11-07T12:46:38+08:002018-11-07T12:46:38+08:00

opensvc (https://www.opensvc.com) support multiple heartbeat drivers :

unicast
multicast
shared disk
3rd site relay

and also have quorum mecanisms in case of split brain.

I managed to automatically setup a 4 nodes cluster made of 2 google cloud instances + 2 amazon instances with terraform + ansible.

0

Luigi · Answer 9 · 2016-07-26T02:50:17+08:00

Luigi

2016-07-26T02:50:17+08:002016-07-26T02:50:17+08:00

I wrote a failover cluster manager in posix shell: https://github.com/nackstein/back-to-work

take a look at it, I'm looking for someone that want to try it and help in development.

-1

Alternatives to Heartbeat, Pacemaker and CoroSync?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?