Ping a Specific Port

Question

Rajat

Asked: 2009-11-03 10:03:59 +0800 CST2009-11-03 10:03:59 +0800 CST 2009-11-03 10:03:59 +0800 CST

cluster suite rhel

772

we have set 2 node cluster with san box our config like that HS22 IBM blade center with T3400 SAN box with SAN Switch i have try with RHEL 5.2 RHEL 5.3 RHEL 5.4 cluster suite i can reboot using luci as well i can fence both server even i can relocate the services from 1st node to 2node

Issues is if ckcek on node 1 clustat and it show me all the service and cluster owner is node 1 if i stop services network at node 1 it will relocate all the service to node2 and node 1 goes poweroff. when i reboot the node 1 it will join the cluster that time node 2 is owner of all the services as well cluster and if i stop service notwork at node2 it dont relocate cluster to node 1 and on my /var/log i can see 52 failed to changed RG status have any one come across like this issues if yes then what is work around

Thank you so much people I got this working!!!

3 Answers

Voted

RainyRat · Answer 1 · 2009-11-03T10:29:37+08:00

Best Answer

RainyRat

2009-11-03T10:29:37+08:002009-11-03T10:29:37+08:00

I don't have any direct experience with RH clustering but, from your description, it sounds like node 1 isn't re-joining the cluster correctly after you reboot it.

As a starting point, I'd check that all the appropriate services are set to start automatically on node 1, but before I do that, I'd clean up your question, as it's almost unreadable in its current form.

There appears to be a bug (sort of) related to this over at RedHat's Bugzilla, too.

1

Aaron Brown · Answer 2 · 2009-11-03T12:58:18+08:00

Aaron Brown

2009-11-03T12:58:18+08:002009-11-03T12:58:18+08:00

I bet I'll receive some vote downs for this, but my experience with RHCS is that it basically doesn't work at all. I tried and tried and tried to make a simple 3 node cluster work with ricci and luci and ended up just giving up. My searches indicated similar experiences and a common theme that RHCS is not ready for deployments in production. I was able to sometimes join a couple servers to the cluster, but as soon as I tried to join another node, it just failed with very little information in the logs.

I ended up moving towards Pacemaker backed with a DRBD filesystem and found it is more flexible and just works. My advice is to use Pacemaker.

1

dyasny · Answer 3 · 2010-02-09T13:49:12+08:00

dyasny

2010-02-09T13:49:12+08:002010-02-09T13:49:12+08:00

if a network service goes down, the cluster node goes into "unknown" state. The CS has no idea whether the host actually died, or became temporarily unresponsive. If you have a fence mechanism in there, you can fence the host, which will also inform the RHCS that the node is actually down, so the services can be taken to another node. If the services would simply restart elsewhere, and the host got it's network back, you would have the same service running on both nodes, accessing the same files on the SAN thus corrupting them.

1

cluster suite rhel

Ping a Specific Port

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What's the command-line utility in Windows to do a reverse DNS look-up?

How to check if a port is blocked on a Windows machine?

What port should I open to allow remote desktop?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?