I'm in a evaluation phase of some hw that could be used for setting up a hadoop cluster. This hw is refurbished (hp G6 servers w/ Smart Array 410/i controller) and probably we should/must use it... we haven't it yet. I've read that 410/i controller doesn't allow a JBOD configuration as required by hadoop, is there anybody that could confirm this statement ? Or not ? Seems that a workaround could be to configure raid0 on every single disk, again anybody confirms it or there is other ways of configuration ? Thank you Rob
The Smart Array 410/i controller only allows 2 raid devices to be created without the optional memory card added to it. So if your hardware only has 2 drives, then it would be fine; you can just make it 2 raid0's. If you have more then 2 drives you will probably have to purchase Part #462974-001 - 256MB Battery Backed Write Cache BBWC memory module. If you aren't sure, give it a try in the configuration utility, it will complain that there is a limit on the number of configured raid devices allowed if you need to purchase the memory addon.
I recently setup a ZFS server utilizing one of those and 12 disks in RAID0 mode for each and it performs fine. Since you can disable pretty much all features of the actual raid controller, its effectively the same as JBOD mode.
This situation is similar to what I encounter when designing hardware for ZFS storage solutions. ZFS is a software RAID and leverages the server's resources for disk management. Hardware RAID isn't recommended, and JBOD isn't really possible with Smart Array controllers. So when using ProLiant systems for ZFS, I replace the Smart Array controller with something that has a JBOD mode or a basic SAS/SATA HBA.
Typically, this will be an LSI SAS HBA (without RAID features). My preference is the LSI 9211-8i or 9205e.
See the recommendations at: ZFS SAS/SATA controller recommendations
While the Pxxx range doesn't allow direct JBOD configuration if you just create one array off all your disks and then create multiple RAID0 arrays of the equivalent sizes then you're pretty much there in terms of functional similarity. That said I'm also a little dubious of Hadoop's requirement, firstly there's no way it can actually tell what the real disk configuration as it's hidden by the BIOS and secondly because it seems overly prescriptive given how well thought of the HP controllers are, what's the problem with the array managing its disks?
You can configre RAID0 per available phisical disk.
Let's say if you've got 8 disks, then you end up having something like that (CentOS 6):