Ping a Specific Port

Question

Roy

Asked: 2009-11-13 08:54:05 +0800 CST2009-11-13 08:54:05 +0800 CST 2009-11-13 08:54:05 +0800 CST

Oracle VM 2.2 nodes rebooting for no obvious reason

772

I have a simple four node Oracle VM environment. A management server running in vmware, a nfs server for shared storage and two Oracle VM servers running the actual hypervisor.

For some reason the node running the pool master service will suddenly reboot for no obvious reason. I'm fairly sure it's a software issue, possibly a cluster watchdog of some sort. Just to be clear, it's the vm server/hypervisor that reboots, not the guest machines.

Have anyone seen similar issues, or have any suggestions as to where I should start looking for the root cause?

I don't see anything suspicious in the /var/log/ovs*/ logs, any other place I shold look?

The documentation from Oracle leaves a little something to be desired.

3 Answers

Voted

lilott8 · Answer 1 · 2009-11-13T10:11:21+08:00

lilott8

2009-11-13T10:11:21+08:002009-11-13T10:11:21+08:00

I'm not sure if you have the nice fancy graphs that come with the VM Management or not. If you do they do provide a decent amount of insight into what the memory, cpu and disks are doing. Perhaps there might be some correlation? From there you can start looking at top and ps to see what exactly is running, and in use, when the server bounces.

Also can you set the servers into debug mode? Do they support that?

I hope this helps get you started at the very least.

1

Roy · Answer 2 · 2009-11-22T04:51:58+08:00

Best Answer

Roy

2009-11-22T04:51:58+08:002009-11-22T04:51:58+08:00

Turns out the nodes were not communicating correctly, due to the node hostname being listed on the loopback address in /etc/hosts. The cluster services would silently force a reboot to protect shared storage.

1

Ronald · Answer 3 · 2010-08-13T09:17:08+08:00

Ronald

2010-08-13T09:17:08+08:002010-08-13T09:17:08+08:00

Are you using ocfs2? if so increase the ocfs2 timeout in /etc/sysconfig/o2cb.conf

0

Oracle VM 2.2 nodes rebooting for no obvious reason

Ping a Specific Port

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What's the command-line utility in Windows to do a reverse DNS look-up?

How to check if a port is blocked on a Windows machine?

What port should I open to allow remote desktop?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?