I have a backup application backed by many databases (one per client) in Microsoft SQL Server, storing files in the database as BLOBs.
We are looking to move this application to Amazon EC2, and want to have 3TB of disk available. The largest EBS volume you can create and attach to an instance is 1TB.
(Assume that the 3TB could be about 500 databases which grow to 6GB each.)
I see two ways of doing this:
- creating a striped RAID volume of these disks
- manually manage the databases, storing ⅓ on each disk
In the case of a RAID volume, the failure of any volume in the stripe would lose all the data. Amazon don't recommend software RAID 1 across EBS volumes, suggesting you manage durability with regular EBS snapshots (as this is handled outside your instance). If you managed the databases manually, then a disk failure would only impact the databases on that volume.
Managing the volumes manually means a lot of administrative overhead.
Is there a method where I can say "these three disks are the primary storage pool for SQL Server" and have it sort it out itself?
No you can't give SQL A pool of resources like that. I see three options.