Ping a Specific Port

Question

Silvio Donnini

Asked: 2010-03-19 06:25:50 +0800 CST2010-03-19 06:25:50 +0800 CST 2010-03-19 06:25:50 +0800 CST

Lingering tcp connection in LISTEN state

772

My java application can sometimes be killed by an external script. This can be done either with SIGTERM or with SIGKILL.

The application is a server which receives many connections per second, and it can be killed while trying to serve them.

I would like to restart the application whenever it's killed, so I have prepared a script for that purpose.

The problem is that, once the app has been killed, the new application instance can't bind to the port used by the previous instance, because the "Address is already in use". The previous instance's process has been definitely terminated, anyway the offending listening port is still there, but it is assigned to bash (or sh on other machines).

Obviouly, my goal is to restart the application and let it bind successfully to the previous address.

I've tried waiting more than 200 seconds before restarting to no avail, anyway I can't afford to wait that much.

I've encountered this problem on all the machines I've ran the application (which is a jetty server with java 1.6).

Any suggestion is appreciated, thanks,

Silvio

EDIT Killing the jvm process is not the normal way I exit my application, this is used in case of problems (OutOfMemoryErrors) only. And I never need to kill it with SIGKILL, because SIGTERM always suffices, I resort to SIGKILL only in case SIGTERM fails, which has never happened. I'm working on a long term solution, meanwhile I have to keep my app running by applying stitches here and there.

EDIT To be more clear: this is the netstat -tunap | grep line I see before killing the process:

tcp6       0      0 :::8898       :::*        LISTEN      22709/java

and this is after killing the process

tcp6       0      0 :::8898       :::*        LISTEN      23665/sh

notice that the process with PID 22709 is killed and gone, but the port is still there (but locked by sh)

UPDATE after I kill my application, with netstat I can see a long list of pending connections in CLOSE_WAIT state, with my ip as destination. Also, I can see a sh process in state LISTEN listening on my port: when I kill it, a sleep process replaces it and listens on the same port: When I finally kill this sleep process, the port is released and I can restart successfully my server. That could be a solution to get my port released, but I fear that automatically killing processes in order to release a port is a bit risky

5 Answers

Voted

Jacek Konieczny · Answer 1 · 2010-03-19T06:55:47+08:00

Jacek Konieczny

2010-03-19T06:55:47+08:002010-03-19T06:55:47+08:00

The server still expects some packets from the clients after the listening sockets are closed and keeps the port assigned. The application may use SO_REUSEADDR socket option to allow immediate socket address reuse.

Here is an excerpt from my Linux ip(7) manual page:

A TCP local socket address that has been bound is unavailable for some time after closing, unless the SO_REUSEADDR flag has been set. Care should be taken when using this flag as it makes TCP less reliable.

The application or application server might have a configuration setting for using this socket option.

3

The Unix Janitor · Answer 2 · 2010-03-19T06:54:39+08:00

The Unix Janitor

2010-03-19T06:54:39+08:002010-03-19T06:54:39+08:00

Your not actually killing your java application, your actually killing your java virtual machine (jvm) instance which is in turn running your java application.

This is not the idea way of terminating your java process.

if your having to kill your jvm with kill -9 , the jvm wont be able clear up after itself thus leaving operating resources in limbo. :-(

Add some functionality to your app to make it exit gracefully. If you have no choice, then try to kill you jvm with -15 , it may help it clear up after itself.

If your java program really is hanging the jvm, then you need to get a debugger and squash those pests.

Killing a process and restarting it is a hack, but's not fix. You should only use SIGKILL if a process is not responding any other method.

I usually try

kill -15

then only kill -9 as a last resort.

and for fun...

http://www.youtube.com/watch?v=Fow7iUaKrq4

1

Scott Lundberg · Answer 3 · 2010-03-19T07:50:06+08:00

Best Answer

Scott Lundberg

2010-03-19T07:50:06+08:002010-03-19T07:50:06+08:00

Since you only do this manually, you may have to add another check.

netstat -p

and kill the pid associated with your open socket, even if it is bash or sh.

Also, you mentioned that most of the time SIGTERM works. If that's the case, your app should catch the SIGTERM and jump into some graceful exit code that RSTs all open connections and then closes the socket.

HTH

1

Insyte · Answer 4 · 2010-03-19T07:51:00+08:00

Insyte

2010-03-19T07:51:00+08:002010-03-19T07:51:00+08:00

If you have access to the source code, you need to create the socket with the SO_REUSEADDR option mentioned by Jacek. Also of interest are the tcp_tw_recycle and tcp_tw_reuse kernel flags (on Linux).

The real problem is in the protocol design, which you may or may not be able to change. Interesting threads on the topic:

0

Jacek Konieczny · Answer 5 · 2010-03-19T12:12:15+08:00

Jacek Konieczny

2010-03-19T12:12:15+08:002010-03-19T12:12:15+08:00

With your update I have another explanation. The sh process keeping the socket open must be a child of your application, forked after the listening socket has been opened. It didn't die with its parent and was adopted by the init process.

You should try to find out what is that shell process for (probably some script started by your application) and why it is not terminating. Maybe it will be enough to fix the script so it terminates after finishing it job will be enough? Or there is a way to make it not detach from the parent (it should die with the parent if is a part of the same process group) or make it to close all the unneeded file descriptors inherited from the parent.

You may try:

fuser -p $pid_of_the_sh_process

to see what other files it keeps opened. One of these will be, most probably, the shell script. Knowing what it is we may find a way to fix the problem.

0

Lingering tcp connection in LISTEN state

Ping a Specific Port

How do I tell Git for Windows where to find my private RSA key?

How do you restart php-fpm?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?