A recent issue with a Buffalo TeraStation NAS here in my office has got me investigating Raid 5.
I've found a few different articles talking about the unsuitability of using raid 5 in large arrays, or with large disks
Here is one example article that talks about problems with rebuilding an array with large consumer drives.
I'm trying to work out what counts as 'large'?
The NAS we have here is a 4 drive Raid 5 setup, each drive is 1 TB. A drive failed and has been replaced, the array is currently rebuilding.
Does this setup constitute as large, in terms of will likely have a problem during the rebuild?
How reliable is this setup for day to day use?
Designing the reliability of a disk array:
1 - ( 1 - [Drive Size] x [URE Rate]) ^ [Data Drives‡] = [Risk]
For arrays with more than one disk of parity or mirrors with more than a pair of disks in the mirror, change the
1
after the Drives in Array to the number of disks with parity/mirror.So I've got a set of four 1TB WD Green drives in an array. They have a URE Rate of 1/10^14. And I use them in as scratch storage.
1 - (1 - 1TB x 1/10^14byte) ^ 3
=>3.3%
risk of failure rebuilding the array after one drive dies. These are great for storing my junk, but I'm not putting critical data on there.†Determining acceptable failure is a long and complicated process. It can be summarizes as
Budget = Risk * Cost
. So if a failure is going to cost $100, and has a 10% chance of happening then you should have a budget of $10 to prevent it. This grossly simplifies the task of determining the risk, the costs of various failures, and the nature of potential prevention techniques - but you get the idea. ‡[Data Drives] = [Total Drives] - [Parity Drives]
. A two disk mirror (RAID1) and RAID5 has 1 parity drive. A three disk mirror (RAID1) and RAID6 has 2 parity drives. It's possible to have more parity drives with RAID1 and/or custom schemes, but atypical.This statistical equation does come with it's caveats however:
The reason that article exists is to draw attention to Unrecoverable Bit Error Rates on HDDs. Specifically, your cheap 'home PC' disks. They typically have a factory spec of 1 / 10^14. This is about 12.5TB of data, which if you are doing a RAID-5 with 2TB disks ... you hit quite quickly.
This means you should either:
I would suggest typically that RAID-6 is the way forwards generally, but it'll cost you performance.