I want to replace a hot spare with a new disk, for example to replace a smaller disk with a bigger one or when the manufacturer does a callback of a series of disks.
How do I do that?
I want to replace a hot spare with a new disk, for example to replace a smaller disk with a bigger one or when the manufacturer does a callback of a series of disks.
How do I do that?
One of the disks in group 0 (EID:Slot 252:4, DiskID 12) is starting to fail it's smart tests:
1 Raw_Read_Error_Rate 0x002f 200 200 051 Pre-fail Always - 1837
200 Multi_Zone_Error_Rate 0x0008 200 200 000 Old_age Offline - 57
but I can't find any documentation how to remove disks from a disk group.
Do I have to
storcli /c0/e252/s4 set offline
or rather
storcli /c0/e252/s4 spindown
or both? What's the difference between "spindown" and "offline"? What about
storcli /c0/s4 set missing
What does that do? What does "missing" mean?
And how about the rebuild? Does that start automatically?
If not, then I guess the "start rebuild" command is my friend but why do I have to specify a single disk for that? It would make much more sense to specify the disk group or volume to rebuild, no?
To see your drives, use storcli /c0 show all
or storcli /c0/eXXX/sALL show
(replace XXX
with the enclosure ID or EID). In my case, the output looks like this:
Drive Information :
=================
--------------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:0 10 Onln 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68AX9N0 U -
252:1 9 Onln 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68AX9N0 U -
252:2 11 Onln 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
252:3 8 Onln 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
252:4 12 Onln - 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
252:6 14 GHS - 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
252:7 13 GHS - 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
--------------------------------------------------------------------------------
How do I convert GHS
to DHS
or the other way around?
My LSI MegaRaid just told me one disk is "UBad" which I assume means it failed:
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:7 13 UBad F 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
I have a hot spare installed:
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:6 14 DHS 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 D -
but the status of the hot spare didn't change. Is it being used to save my RAID array?
If not, how do I tell the controller to add the hot spare to the disk group 0?
Most Oracle tools and scripts request that you pass them the password via the command line - where everyone on the same machine can see them. Example:
exp <user>/<password> ...
Is there a way to invoke those commands (like sqlplus
, imp
, exp
) from a script without compromising security?
Usually, truncating a table takes 5-10 seconds. But when several people work on the same DB instance (but different tables), the operation can take more than an hour. How do I debug this?
One step further: On my machine, the pool tagged "Thre" grows about 1MB/day. Searching for "Thre" with findstr
returns about every *.sys
file on my harddisk. Any ideas how I could reduce the number of possible culprits?
I've read the KB articles about poolmon but they don't tell me how to analyze the numbers. My first guess is to look for drivers where the value in the column "Diff" is very high. Is that correct?
In my case, that would be these processes:
Tag Type Allocs Frees Diff Bytes Per Alloc
Ntfr Nonp 2690737 2528557 162180 10379976 64
Ntfn Nonp 1397933 1304230 93703 3750928 40
NtFs Nonp 2385330 2291634 93696 3749056 40
File Nonp 13789939 13704656 85283 13203912 154
So that would mean the Ntfs driver has a memory leak which I doubt :) So what should I look for?
I'm hunting of a memory pool leak using poolmon. In the KB article, they explain how to capture the output manually using cut&paste. Isn't there a way to automate this?
Since the tool doesn't seem to support it, my idea was to run two command prompts (one for paged and one for nonpaged pools), and use a tool to make an automatic screenshot. If this was possible, which tool would you suggest? Is there a tool that can cut the text out of a command prompt without manual intervention?
As per this answer, I've configured perfmon to show
(Names might be slightly different on an English version of Windows). Now I see these average values:
How do I interpret these values? CPU is simple (16.6% usage).
But how about disk? Is that 16 requests every second? Or 0.16? Or 0.0016? That doesn't seem right; the LED is flashing madly.
And page faults: Is that 74 page loads/sec?
For the fun of it, I've added "Physical Disk / Bytes read/sec" and "Physical Disk / Bytes written/sec". Here I get 235478.228 and 30568.626 respectively with a factor of 0.0001. Does that translate to 235MB/s read (implausible with a desktop harddisk) or 235 Bytes/s? Again the LED on the case indicates it must be much more.
Thanks a lot for clearing this up.
[EDIT] One thing which I figured out: The "factor" is to scale the value to be able to display it in the graph. The values below the graph (current, average, min, max) are absolute (or unscaled).
[EDIT2] Sorry, I mixed up the factors for memory and queue length.
[EDIT3] I'm on Windows XP/SP3.
And for those people who have been looking for the "Explain" button: 1. Click on "Add" (new indicator). In the dialog, there is an "Explain" button which tells you something about the currently selected indicator.
And a message to MicroSoft: If you supply a list box to select one option out of a whole lot, make that widget a bit bigger, okay? Scrolling wastes valuable human CPU power.
Currently, I'm using the r8169 driver (Realtek 8169 gigabit ethernet). I tried both the chips on the mainboard and with an external network card. When I boot my PC, the machine comes up with speed = 1000 and speed is as expected.
When I resume after suspend to disk, the speed drops to 100. The driver doesn't support renegotiation or setting the speed with ethtool. Sometimes, I can fix the issue by rmmod r8169
the driver and loading it again. But lately, the chip doesn't come up completely, either the speed is 10 or "up" is false.
I'm sick of this issue. Can someone recommend a network driver (and a gigabit network card) that survives suspend/resume?