I recently carried out a recovery test on a Google Compute Engine VM with a Centos 7 OS. The VM has five partitioned disks managed by LVM. The server has been around for a while so there are about ten file systems distributed over the disks. The snapshots were all created within one or two seconds of each other in the early hours of the morning.
It would appear that the recovery went well, df
output looks like what I would expect.
Was I just lucky?
I would normally expect to have trouble when restoring from ostensibly unsynchronized snapshots.
I'm wondering: do I need to use a "proper" logical-volume-aware backup system to ensure the consistency of restored file systems? Or failing that, should I ensure that each file systems is on its own single disk? Or am I worrying unnecessarily?
GCP recommends to have each file system on it's own single disk.
From the standard sysadmin perspective it can be seen as a waste of resources considering that there is a limit to the amount of disks you can attach to an instance, however it is easier to manage backups.
If I understand correctly you attached 5 individual disks to your instanced and proceeded to create PV,VG,LV and FS. Finally you took snapshots for each disk almost at the same time
It seems you did not make changes in your disks between the snapshots but I do not recommend this architecture for sensitive applications such as a DB as I would be worried about data consistency. I would not surprise me if you repeat the experiment with a longer time between the snapshots of your disks and experience data consistency issues.
I recommend you to take a look at snapshot best practices and scheduled snapshots