We have a web service running on Amazon EC2. Currently we have some live user data stored on a single disk (EBS). We are considering moving to a RAID0 setup (we don't have to be concerned about the increased failure rate).
If we do this migration, what is the quickest (to minimize site unavailability) way to reliably transfer the user data to the RAID array?
One idea I had was to take a recent snapshot of the data, copy it over to the new RAID array, then when the site goes down for maintenance use rsync to copy only the changed data over. I'm not sure if this would actually save time or ensure data integrity though.
I use the exact approach you describe for migrating / copying data on EC2:
With the right options, rsync should ensure a close to exact copy from the file system level. It varies a little with Linux distro, but perhaps:
The cool thing about EC2 is that you can test your entire migration plan on temporary volumes and temporary instances before you do it for real.