Aaron Digulla's questions -server

Aaron Digulla

Asked: 2020-10-18 03:39:17 +0800 CST

How do I remove a dedicated (DHS) or global hot spare (GHS)?

0

I want to replace a hot spare with a new disk, for example to replace a smaller disk with a bigger one or when the manufacturer does a callback of a series of disks.

How do I do that?

Aaron Digulla

Asked: 2019-02-17 12:17:09 +0800 CST

How do I remove a failing disk from a LSI MegaRAID disk group?

3

One of the disks in group 0 (EID:Slot 252:4, DiskID 12) is starting to fail it's smart tests:

  1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail  Always       -       1837
200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age   Offline      -       57

but I can't find any documentation how to remove disks from a disk group.

Do I have to

storcli /c0/e252/s4 set offline

or rather

storcli /c0/e252/s4 spindown

or both? What's the difference between "spindown" and "offline"? What about

storcli /c0/s4 set missing

What does that do? What does "missing" mean?

And how about the rebuild? Does that start automatically?

If not, then I guess the "start rebuild" command is my friend but why do I have to specify a single disk for that? It would make much more sense to specify the disk group or volume to rebuild, no?

Aaron Digulla

Asked: 2016-12-30 07:48:57 +0800 CST

LSI MegaRAID: How to convert between Dedicated Hot Space (DHS) to Global Hot Spare (GHS)?

1

To see your drives, use storcli /c0 show all or storcli /c0/eXXX/sALL show (replace XXX with the enclosure ID or EID). In my case, the output looks like this:

Drive Information :
=================

--------------------------------------------------------------------------------
EID:Slt DID State DG     Size Intf Med SED PI SeSz Model                Sp Type 
--------------------------------------------------------------------------------
252:0    10 Onln   0 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68AX9N0 U  -    
252:1     9 Onln   0 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68AX9N0 U  -    
252:2    11 Onln   0 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -    
252:3     8 Onln   0 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -    
252:4    12 Onln   - 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -    
252:6    14 GHS    - 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -    
252:7    13 GHS    - 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -    
--------------------------------------------------------------------------------

How do I convert GHS to DHS or the other way around?

Aaron Digulla

Asked: 2016-12-27 14:48:04 +0800 CST

How to fix LSI MegaRaid RAID5 after 1 disk failed

1

My LSI MegaRaid just told me one disk is "UBad" which I assume means it failed:

EID:Slt DID State DG     Size Intf Med SED PI SeSz Model                Sp Type 
--------------------------------------------------------------------------------
252:7    13 UBad   F 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 U  -

I have a hot spare installed:

EID:Slt DID State DG     Size Intf Med SED PI SeSz Model                Sp Type 
--------------------------------------------------------------------------------
252:6    14 DHS    0 2.728 TB SATA HDD N   N  512B WDC WD30EFRX-68EUZN0 D -

but the status of the hot spare didn't change. Is it being used to save my RAID array?

If not, how do I tell the controller to add the hot spare to the disk group 0?

Aaron Digulla

Asked: 2010-04-01 03:55:20 +0800 CST

What do I have to do to see the generated SQL with JBoss 6?

0

How can I make JBoss 6.0M2 show me the SQL that it generates and sends to the database?

Aaron Digulla

Asked: 2009-11-18 01:37:29 +0800 CST

How do you use Oracle's tools from scripts?

2

Most Oracle tools and scripts request that you pass them the password via the command line - where everyone on the same machine can see them. Example:

exp <user>/<password> ...

Is there a way to invoke those commands (like sqlplus, imp, exp) from a script without compromising security?

Aaron Digulla

Asked: 2009-08-20 01:55:28 +0800 CST

Oracle: TRUNCATE sometimes takes ages

1

Usually, truncating a table takes 5-10 seconds. But when several people work on the same DB instance (but different tables), the operation can take more than an hour. How do I debug this?

Aaron Digulla

Asked: 2009-08-11 01:29:44 +0800 CST

poolmon: Which driver uses "Thre"?

0

One step further: On my machine, the pool tagged "Thre" grows about 1MB/day. Searching for "Thre" with findstr returns about every *.sys file on my harddisk. Any ideas how I could reduce the number of possible culprits?

Aaron Digulla

Asked: 2009-08-10 23:13:29 +0800 CST

How to analyze the output of poolmon

1

I've read the KB articles about poolmon but they don't tell me how to analyze the numbers. My first guess is to look for drivers where the value in the column "Diff" is very high. Is that correct?

In my case, that would be these processes:

 Tag  Type     Allocs     Frees    Diff   Bytes    Per Alloc
 Ntfr Nonp    2690737   2528557    162180 10379976        64
 Ntfn Nonp    1397933   1304230     93703 3750928         40
 NtFs Nonp    2385330   2291634     93696 3749056         40
 File Nonp   13789939  13704656     85283 13203912       154

So that would mean the Ntfs driver has a memory leak which I doubt :) So what should I look for?

Aaron Digulla

Asked: 2009-07-27 23:44:16 +0800 CST

Automatically capture the output of poolmon

1

I'm hunting of a memory pool leak using poolmon. In the KB article, they explain how to capture the output manually using cut&paste. Isn't there a way to automate this?

Since the tool doesn't seem to support it, my idea was to run two command prompts (one for paged and one for nonpaged pools), and use a tool to make an automatic screenshot. If this was possible, which tool would you suggest? Is there a tool that can cut the text out of a command prompt without manual intervention?

Aaron Digulla

Asked: 2009-07-22 05:18:38 +0800 CST

Reading the output of perfmon

2

As per this answer, I've configured perfmon to show

Memory / Pages Input/sec
CPU / CPU Time (%)
Physical Disk / Average Queue length

(Names might be slightly different on an English version of Windows). Now I see these average values:

Memory: 74.613 (1.000)
CPU: 16.642 (1.000)
Disk: 0.160 (100.000)

How do I interpret these values? CPU is simple (16.6% usage).

But how about disk? Is that 16 requests every second? Or 0.16? Or 0.0016? That doesn't seem right; the LED is flashing madly.

And page faults: Is that 74 page loads/sec?

For the fun of it, I've added "Physical Disk / Bytes read/sec" and "Physical Disk / Bytes written/sec". Here I get 235478.228 and 30568.626 respectively with a factor of 0.0001. Does that translate to 235MB/s read (implausible with a desktop harddisk) or 235 Bytes/s? Again the LED on the case indicates it must be much more.

Thanks a lot for clearing this up.

[EDIT] One thing which I figured out: The "factor" is to scale the value to be able to display it in the graph. The values below the graph (current, average, min, max) are absolute (or unscaled).

[EDIT2] Sorry, I mixed up the factors for memory and queue length.

[EDIT3] I'm on Windows XP/SP3.

And for those people who have been looking for the "Explain" button: 1. Click on "Add" (new indicator). In the dialog, there is an "Explain" button which tells you something about the currently selected indicator.

And a message to MicroSoft: If you supply a list box to select one option out of a whole lot, make that widget a bit bigger, okay? Scrolling wastes valuable human CPU power.

Aaron Digulla

Asked: 2009-06-27 01:08:27 +0800 CST

Linux Gigabit driver that survives resume

2

Currently, I'm using the r8169 driver (Realtek 8169 gigabit ethernet). I tried both the chips on the mainboard and with an external network card. When I boot my PC, the machine comes up with speed = 1000 and speed is as expected.

When I resume after suspend to disk, the speed drops to 100. The driver doesn't support renegotiation or setting the speed with ethtool. Sometimes, I can fix the issue by rmmod r8169 the driver and loading it again. But lately, the chip doesn't come up completely, either the speed is 10 or "up" is false.

I'm sick of this issue. Can someone recommend a network driver (and a gigabit network card) that survives suspend/resume?

How do I remove a dedicated (DHS) or global hot spare (GHS)?

How do I remove a failing disk from a LSI MegaRAID disk group?

LSI MegaRAID: How to convert between Dedicated Hot Space (DHS) to Global Hot Spare (GHS)?

How to fix LSI MegaRaid RAID5 after 1 disk failed

What do I have to do to see the generated SQL with JBoss 6?

How do you use Oracle's tools from scripts?

Oracle: TRUNCATE sometimes takes ages

poolmon: Which driver uses "Thre"?

How to analyze the output of poolmon

Automatically capture the output of poolmon

Reading the output of perfmon

Linux Gigabit driver that survives resume

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?