Ping a Specific Port

Question

monster

Asked: 2012-02-14 03:36:56 +0800 CST2012-02-14 03:36:56 +0800 CST 2012-02-14 03:36:56 +0800 CST

What is the fastest way to "clone" a file in Linux?

772

I would like to use an application API that is not "crash safe"; in other words, there is a high likelihood of the data file being corrupt and unreadable if the application crashes.

The file itself is a "metadata file" and should not get very big: few 100s of MB maximum.

What I want to do is:

Force the application to access the file in "direct mode" (no OS caching).
Pause updates at regular "checkpoint" intervals
Perform a flush() (some data probably got flushed automatically)
Now that I know the file is consistent, clone it.
If there is an "old clone" delete it.
Resume doing changes to the original file.
Loop.

Could I use a special-purpose file system that makes some kind of "zero copy" of the file, combined with copy-on-write of the modified sectors of the original file, to get the clone "almost free" (with minimum disk IO)?

Also, can I do the "clone" without having to fork a process? (I don't know if the Linux file API offers a "cp" system-call).

3 Answers

Voted

AndreasM · Answer 1 · 2012-02-14T05:04:48+08:00

Best Answer

AndreasM

2012-02-14T05:04:48+08:002012-02-14T05:04:48+08:00

You could use LVM snapshotting for this instead of cloning. If something goes wrong, just copy the file from the clone.

There is a libdevmapper/libdevmapper-event-lvm2snapshot which could be helpful in doing this programmatically (without a fork): http://sourceware.org/dm/

Edit:

If you can change your program here is another solution: https://stackoverflow.com/questions/1565177/can-i-do-a-copy-on-write-memcpy-in-linux

mmap() the file twice, once normally and once with MAP_PRIVATE.

This would avoid the externalities (esp performance) of lvm

6

ewwhite · Answer 2 · 2012-02-14T05:12:59+08:00

ewwhite

2012-02-14T05:12:59+08:002012-02-14T05:12:59+08:00

Here's a quick suggestion that won't involve LVM. Use R1Soft Hot Copy to take one or multiple point-in-time snapshot of the filesystem in question. See the tips page. It uses copy-on-write technology. This has been a solution to some similar questions here, but also applies to what you're looking to do.

4

poige · Answer 3 · 2012-04-30T22:35:40+08:00

poige

2012-04-30T22:35:40+08:002012-04-30T22:35:40+08:00

Btrfs × cp --reflink or snapshots
Nilfs — by design AFAIU
ZFS "on Linux" (some ppl say it works fine for them) — snapshots

3

What is the fastest way to "clone" a file in Linux?

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?