Ping a Specific Port

Question

Andrei Serdeliuc

Asked: 2010-06-02 01:57:37 +0800 CST2010-06-02 01:57:37 +0800 CST 2010-06-02 01:57:37 +0800 CST

ls hangs after NFS server reboot

772

I've got server A and server B. B acts as an nfs server, A mounts from B.

Both are running on EC2.

Sometimes I have to shut down B and start a new instance (identical instance). After B is back up, trying to do anything inside the mounted directory on A (ls for example) just hangs.

I'm trying to set up a cron that checks the status of the mount, and remounts if anything is wrong.

Is there any way to check the status of a mount?

2 Answers

Voted

mkomitee · Answer 1 · 2010-06-02T04:01:55+08:00

You can fork, have the child enter the directory, and then exit the child. Have the parent monitor the existence of the child process with a timeout. If you've got a stale mount, the child won't be able to exit and will stick around for a long time, so the timeout will occur in the parent. Have the parent kill -9 the child and try an unmount.

The problem you may experience, though, is that if any other process is using a file that's on the broken mount, then you won't be able to unmount it without first killing those processes. You can (often) discover whether any processes are using unavailable resources on a stale mount with lsof or fuser.

I'd avoid auto-magically killing arbitrary processes though; send yourself a notification to investigate further manually.

To reduce the likelihood of this occurring, you may want to look into automounter which won't mount the volume until it's needed / a resource on the server is requested, and automatically unmount it when it's no longer needed.

-- by the way, to make this more searchable, you may want to tag this with the words stale, stuck, nfs, and mount. This phenomenon is not specific to your usage of ec2.

Andrei Serdeliuc · Answer 2 · 2010-06-02T05:45:52+08:00

Best Answer

Andrei Serdeliuc

2010-06-02T05:45:52+08:002010-06-02T05:45:52+08:00

I realised that when the NFS server reboots, it changes it's ip, therefore the mount wouldn't work.

Wrote this script which checks if the NFS host's ip is the ip currently used in the mount, if not, it unmounts and remounts. Might help someone in the future.

#!/bin/bash

NFS_HOST=$(mount | grep nfs | awk '{ print $1 }' | cut -d ":" -f 1)
NFS_HOST_PATH=$(mount | grep nfs | awk '{ print $1 }' | cut -d ":" -f 2)

host $NFS_HOST

if [[ $? -ne 0 ]]; then
    echo "NFS host $NFS_HOST doesn't exist!"
    exit 2
fi

MOUNT_POINT=$(mount | grep $NFS_HOST | awk '{ print $3 }')

NFS_IP=$(host $NFS_HOST | awk '{ print $4 }')

mount | grep "$NFS_IP"

if [[ $? -ne 0 ]]; then
    umount -fl $MOUNT_POINT
    mount "$NFS_HOST:$NFS_HOST_PATH" $MOUNT_POINT
fi

0

ls hangs after NFS server reboot

Ping a Specific Port

How do I tell Git for Windows where to find my private RSA key?

How do you restart php-fpm?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?