For 3 years we had an LSI SAN with 48, 300GB Segate Cheetah 15K.5 (Model ST3300655FC) 3.5-inch drives. There are about 7 drives failed total. There bulk failed recently. Six drives since May 2010.
That's at a rate of 0.02 (drives failed)/(month)/(drives in array) for the last 6 month period.
There is an older SAN from HP running in the same room, I the drives are probably 15K 36 GB. Those never failed.
Is it common that 300GB 15K RPM drives start failing at this rate after 3 years?
No, this is not a common problem, but one I've seen every now and again. It's likely that the Cheetah 15ks you're running are all from the same "batch" in manufacturing and as such, all have the same faults. This is why you're seeing many of them fail in such a short amount of time.
I'd recommend contacting your SAN vendor and insisting that they make this right.
That doesn't sound excessively high to me but as you're concerned here are a few things to check:
Well there is the MTBF inverted bell-curve failure issue that you'd expect to see, so after three years if you start seeing one or two disks failing then I'd be expecting a few more soon after as you've seen. It then usually settles down into a series of small peaks and troughs of reliability - best to just ensure you've got some spares close to hand. As for why the HP isn't failing, well the physics are the same, I'd imagine their behaviour is just influenced by their usage patterns but when they do start failing it's likely to be in the same manner.