Re: Query about proposed dedup patches and behaviours

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 15 January 2016 at 01:47, Duncan <1i5t5.duncan@xxxxxxx> wrote:
>
> Hugo should really explain as he was the one that said that, but upon
> looking into it, he found that while he was correct in a sense, his
> reasoning was a bit narrow, and autodefrag isn't snapshot aware in the
> wider context.
>
> Without attempting to explain his reasoning as I think I sort of
> understand it but not well enough to try to explain, autodefrag isn't
> snapshot aware and will break reflinks, but due to $reasons, autodefrag's
> damage to reflinking apparently isn't as bad as manual defrag.
>
> That's the best I can do to explain the situation.  In general,
> autodefrag remains bad for reflinks, but apparently not h***-bad, as
> manual defrag is.
>

As I recall it's something like autodefrag will break the reflink
pretty much to the same extent as if you just starting writing to each
instance.

http://article.gmane.org/gmane.comp.file-systems.btrfs/51441

Looking through the patches again I see that Qu has indeed already
looked to on disk hash rather than in memory so that relieves my
memory blooming concerns.

http://thread.gmane.org/gmane.comp.file-systems.btrfs/52215

It does appear that btrfs-progs is only being extended to enable or
disable dedup on a whole pool rather than to dedup X files

http://news.gmane.org/find-root.php?message_id=1452751070%2d2460%2d3%2dgit%2dsend%2demail%2dquwenruo%40cn.fujitsu.com

I suppose that one could in principle target a btrfs balance to
particular extents after enabling dedupe on the pool in order to try
and target particular files but that seems rather cumbersome, and if
wanting to dedup an entire pool then enabling the feature followed by
a full balance ought to do it.

So I see two things out of this:

1) A least a note in the man page (or command output as well
preferably) reminding that autodefrag will to an extent work against
dedupe (and it may be worth testing the effect of both enabled and if
poor preventing one whilst the other is there).

2) Qu is there any intention to be able to do btrfs dedup
/path1../pathN or is the intention for this work only to enable
in-band across an entire pool (less any files with the proposed
attribute changed to say nodedup)?

If there is no intention for 2 then the duperemove packaging is still
worthwhile to carry out.
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux