Stilez

Asked: 2017-03-13 00:51:02 +0800 CST2017-03-13 00:51:02 +0800 CST 2017-03-13 00:51:02 +0800 CST

Clarifying sizing of dedup table on ZFS when only some datasets are deduped

I am trying to be clear about how zfs dedup handles the case where some (but not all) datasets in a pool are deduped, from a dedup table/RAM impact perspective. I found this quote from the FreeBSD mailing list in 2012:

"Note that only file systems that you enabled dedup for will actually participate in dedup. File systems that have dedup=off won't go through the dedup."

As an example, suppose we have two zpools, A and B. Pool A has 4 datasets containing 21 TB of data:

Datasets #1 and #2 each contain 0.5 TB data with dedup on
Dataset #3 and #4 each contain 10 TB data with dedup off

Pool B has one dataset containing 1 TB data with dedup on.

It's clear that the dedup functionality applies to the entirety of each pool. What isn't clear, is whether the RAM impact of dedup is based only on the deduped datasets? In other words, all other things being equal, will the dedup table size and RAM impact be similar for pool A and pool B, or far larger for pool A than pool B?

I think the dedup table has to be similar for both (set poolwide but no impact on size from any non-deduped datasets), mainly because if it was much larger, it would be equivalent to forcing dedup on the whole pool not just specific datasets. However it isn't clear to me whether this is actually so.

Clarifying sizing of dedup table on ZFS when only some datasets are deduped

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?

Clarifying sizing of dedup table on ZFS when only some datasets are deduped

0 Answers