Ping a Specific Port

Question

mhost

Asked: 2011-01-21 09:54:19 +0800 CST2011-01-21 09:54:19 +0800 CST 2011-01-21 09:54:19 +0800 CST

Does librsync really reduce bandwidth for Amazon S3

772

I am looking to do some backups to S3 and want to reduce the bandwidth as much as possible. I am looking at a few options. It seems like librsync is the best solution for low bandwidth remote backups.

I've been reading on how librsync works. And it seems like the remote end needs to calculate the checksum on the blocks of the files it is comparing (as well as the local end). I would assume that S3 can't do those checksum calculations since it only serves files.

I've also read that S3 doesn't support separating files into chunks. It can only offer the whole file or nothing.

If both (or either) of these statements are true, would librsync be essentially useless? Can someone shed some light on this for me?

Thanks.

1 Answers

Voted

kaliatech · Answer 1 · 2011-03-23T08:48:15+08:00

I think librsync is just an implementation of the algorithm. It can be used in multiple ways. The "normal" usage pattern, like used in the original rsync program, does expect recipient to support remote hash generation. Duplicity also uses librsync, but it pre-calculates the hashes instead, and so does not need remote support beyond file storage.

More info: http://en.wikipedia.org/wiki/Rsync. Specifically:

duplicity is a variation on rdiff-backup that allows for backups without cooperation from the storage server, as with simple storage services like Amazon S3. It works by generating the hashes for each block in advance, encrypting them, and storing them on the server, then retrieving them when doing an incremental backup. The rest of the data is also stored encrypted for security purposes.

I'm still researching as well, but minimally if using Duplicity, disabling SSL (s3-unencrypted-connection) and increasing volsize parameter should help conserve bandwidth.

Does librsync really reduce bandwidth for Amazon S3

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?