Ping a Specific Port

Question

Phuong Nguyen

Asked: 2011-03-27 23:16:57 +0800 CST2011-03-27 23:16:57 +0800 CST 2011-03-27 23:16:57 +0800 CST

Software Raid 0: Some bad sectors found in second hard drive and things went crazy

772

I was having Ubuntu 10.04 Server running over a software raid 0. Yesterday, I left it running continuously for 10 hours, when I came back, the computer became weird. I cannot shut it down. It was saying "Bus error" or something similar to that. So I force a shutdown by holding power button for 4 seconds. Then I turn it on back. And here come disaster: The raid was broken. System kept dumping out "Failed command: READ DMA EXT". I tried to run fsck.ext4 /dev/md0 from the Alternate CD rescue mode, but fsck.ext4 then said: "Attempt to read block from filesystem resulted in short read". So I use a Hiren CD and run the hard drive scanner and find 12 bad sectors on second hard drive (and at the very end of the drive: more than 80% from the beginning I recall) The told the software to fix the 12 bad sectors but I doubt if Ubuntu understand the fix.

I again ran the Alternate CD rescue mode again, and did e2fsck /dev/sda but it was saying device or resource is busy.

God and geeks, how come that 12 bad sectors mess up my whole RAID. What should I do to have my RAID and Ubuntu work again?

P/S: Once I get stuff work back, I'll switch to RAID 5. I swear.

3 Answers

Voted

JamesRyan · Answer 1 · 2011-03-28T07:14:08+08:00

JamesRyan

2011-03-28T07:14:08+08:002011-03-28T07:14:08+08:00

RAID 0 has no redundancy so errors will break the entire array. Are you confusing it with RAID 1 (mirrored)?

2

grs · Answer 2 · 2011-03-28T15:23:58+08:00

grs

2011-03-28T15:23:58+08:002011-03-28T15:23:58+08:00

Can you tell us how your RAID 0 array was set up? I had the impression that it consists of 2 physical drives: /dev/sda + /dev/sdb and the resulting device is /dev/md0. Now you are talking about /dev/md1. Does /dev/md0 = /dev/sda1 + /dev/sdb1 and /dev/md1 = /dev/sda2 + /dev/sdb2? And if so - how you expect to repair the md0 filesystem (which is spread across 2 devices/partitions) when you run it only on one of these devices? This is RAID 0, not 1.

The funny thing is none of /dev/sda1, /dev/sda2, /dev/sdb1, /dev/sdb2 is fsck-able without error.

-> is it the same "Superblock invalid" error?

0

Suku · Answer 3 · 2011-03-28T03:01:01+08:00

Suku

2011-03-28T03:01:01+08:002011-03-28T03:01:01+08:00

device or resource is busy

This error message is because your RAID daemon is on. In case of RHEL/CentOS you can stop RAID service/daemon by the command:

mdadm --stop

After stopping RAID, check the file system using fsck -fyC /dev/sda

f - stands for force
y - stands for yes to all
C - stands for progress bar

-1

Software Raid 0: Some bad sectors found in second hard drive and things went crazy

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?