Ping a Specific Port

Question

Icapan

Asked: 2009-08-07 01:03:38 +0800 CST2009-08-07 01:03:38 +0800 CST 2009-08-07 01:03:38 +0800 CST

GFS broken down, unable to start lock_gulmd, says state=Expired

772

Someting has broken and I lost a connection with storage on first server. Second server had access to that FS. I tried to restart GFS by service lock_gulmd, gfs, pool, ccsd stop/start (in various orders) but no luck. On master server (the third one) "gulm_tool nodelist localhost"

" says

Name: srv1
  state = Expired
  mode = Slave
  missed beats = 0
  last beat = 0
  delay avg = 0
  max delay = 0

I found that it needs to be fenced? Automatically or manually? Anyone can help? At the moment, none of the hosts is writing anything to the FS, so no harm could be done, I presume. Second host is also expired at the moment and can't start lock_gulmd.

2 Answers

Voted

wzzrd · Answer 1 · 2009-08-07T01:26:35+08:00

Best Answer

wzzrd

2009-08-07T01:26:35+08:002009-08-07T01:26:35+08:00

If it hasn't already been automatically fenced, I would assume your fencing mechanism is not exactly working perfectly.

I suppose what one could do, is reboot the expired hosts (either one by one, or both at the same time) and inform the cluster fencing has been successful with the fence_ack_manual tool. Doesn't this show in your logs?

Running this tool (on the node that requested it to be run, which is not the node that needed to be rebooted) will allow the GFS filesystem and the faulty node to be recovered. The recovering mainly consists of the node being a proper cluster member again and the GFS filesystem journal being replayed if necessary, iirc.

1

womble · Answer 2 · 2009-08-07T01:21:49+08:00

womble

2009-08-07T01:21:49+08:002009-08-07T01:21:49+08:00

Honestly, the best way to clear GFS problems like this, especially when you're locked out of the filesystem anyway, is just to shut all the machines down and then start the cluster back up again. It was the most reliable and usually the quickest way of fixing these problems when I was wranging lots of GFS filesystems.

0

GFS broken down, unable to start lock_gulmd, says state=Expired

Ping a Specific Port

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What's the command-line utility in Windows to do a reverse DNS look-up?

How to check if a port is blocked on a Windows machine?

What port should I open to allow remote desktop?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?