Ping a Specific Port

Question

Christian

Asked: 2013-04-24 11:31:48 +0800 CST2013-04-24 11:31:48 +0800 CST 2013-04-24 11:31:48 +0800 CST

Daemon to create hardlinks on Linux file server which finds identical files

772

I have a Linux server where I only store new files or rename directories and never edit files. It contains backups from other Linux servers.

Due to certain circumstances there are quite some duplicate files, often with different names.

Is there any free Linux tool which periodically scans the filesystem and has a database with filenames, sizes and maybe sha1sums and then identifies duplicates and replaces them with hardlinks?

2 Answers

Voted

simohe · Answer 1 · 2013-09-20T00:57:09+08:00

Best Answer

simohe

2013-09-20T00:57:09+08:002013-09-20T00:57:09+08:00

some tools taken from https://unix.stackexchange.com/questions/3037/is-there-an-easy-way-to-replace-duplicate-files-with-hardlinks

trimtrees.pl
fduples -L
findup -m (from fslint)
rdfind -makehardlinks

You could run one of them in a cron job.

2

Izzy · Answer 2 · 2013-04-24T11:42:37+08:00

Izzy

2013-04-24T11:42:37+08:002013-04-24T11:42:37+08:00

You can use a deduplicating filesystem. There are two main choices in Linux - btrfs and zfs.

With btrfs the drawback would be that it is still not marked as stable and has no fsck.

ZFS is not in the Linux kernel due to licensing issues but there is a kernel module with support for most Linux distributions. Also ZFS sports some kind of online-fsck with the scrub feature. You can have a look at the supported distros on zfsonlinux.org

Both have compression, deduplication and snapshotting features without the need of any additional userspace daemons - making them ideal for backup solutions.

1

Daemon to create hardlinks on Linux file server which finds identical files

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?