Ping a Specific Port

Question

King David

Asked: 2023-04-11 15:18:49 +0800 CST2023-04-11 15:18:49 +0800 CST 2023-04-11 15:18:49 +0800 CST

RHEL + how to capture fresh kernel message without machine reboot

772

here is example from dmesg output from important production server ( RHEL 7.2 - DELL machine HW ) as we can see the sde disk in server is dying

[Wed Jun 30 11:24:58 2021] sd 0:2:4:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[Wed Jun 30 11:26:18 2021] sd 0:2:4:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[Wed Jun 30 11:26:18 2021] sd 0:2:4:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[Wed Jun 30 11:27:28 2021] sd 0:2:4:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
[Wed Jun 30 11:27:46 2021] sd 0:2:4:0: [sde] tag#0 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE

what is interesting is that these messages are old from 2021 , and we not seen this messages on 2022/2023

based on that facts, I want to ask if disk replacement should be considered based on faulty disk messages from 2021

second important question, is how to capture new fresh kernel messages by dmesg

is it possible to re-create new fresh kernel messages ?

as I know maybe reboot machine can helps about this , but I want to avoid machine reboot

1 Answers

Voted

HBruijn · Answer 1 · 2023-04-11T15:52:12+08:00

dmesg by default prints the messages from the kernel ring buffer.

A ring buffer is a special kind of buffer that is always a constant size, removing the oldest messages when new messages are received, it gets freshly instantiated on system boot so what you're seeing are already the most recent kernel messages available.

When today you see messages from almost two years ago, in combination with a legacy RHEL version 7.2 the first thing that comes to mind is: you didn't perform any reboot for close to two years and seemingly did not do any maintenance on that server for even longer!

If your server is indeed from late 2015 - early 2026 (what the RHEL version suggests) before anything else I would start with checking the integrity of your back-ups, your restore procedure and disaster recovery plan and possibly start planning for a replacement and upgrade.

If you want to check the disk health on a live system: you can try to read the S.M.A.R.T. data and/or initiate a smart self-test with smartctl

sudo smartctl -i /dev/sde

To see an estimate of how long the various supported self tests will take:

sudo smartctl -c /dev/sde

And the for example start a short test:

sudo smartctl -t short /dev/sde

RHEL + how to capture fresh kernel message without machine reboot

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?