BLUF: How does VMWare's implementation of RAID5/6 work?
Below is VMWare's chart for RAID5/6 in vSAN:
It seems VMWare is altering RAID's terminology. When I hear RAID6 I think two parity of something; usually two parity drives and subsequently we can tolerate two failures. I'm having trouble making sense of this chart for a couple of reasons:
- What is the difference in row 2/4 between RAID5 and 6? They seem to be identical.
- Why is RAID 1 (mirroring) listed twice, but with different failures to tolerate and different capacity required? It seems to follow that additional copies of the data must be made, but if that's the case then it wouldn't be RAID1 (at least not as I understand RAID1)
- How is it that they have a RAID6 that only has a single failure to tolerate
As I wrote the above question it dawned on me that what the might mean is that they are running a RAID6 with VMWare's FTT set to 1. However, if that is the case how does that work? How are they getting data size 100GBs with 133GBs required?
Row #1 describes two-way mirroring or traditional RAID1, where your data are duplicated to another disk. Writing 100 GB means using 200 GB in total. You can tolerate one disk failure for each mirror.
Row #2 is about traditional 4-disk RAID5, where you lose one disk worth of capacity (ie: 100 GB written are expanded to 133 GB due to parity). Again, you can lose a single disk for each RAID5 set.
Row #3 describes triple-way mirroring or RAID1E. Writing 100 GB really means 300 GB are going to be written, but you can lose two disks for each mirror.
Row #4 really is about 6-disk RAID6 (including RAID5 here is a simplification, the numbers are going to differ), where 100 GB expands to 150 GB due to double parity. Again, you can lose up to two disk for each array.
Short answer: the chart in my OP is wrong.
See this page for a much better explanation.
I am marking @shodanshoc’s answer correct as their explanation is on point. Basically ignore RAID6 in row 2 and RAID5 in row 4.