Re: [PATCH 1/2] btrfs-progs: Add support for btrfs-image + corrupt script fsck test case.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




-------- Original Message --------
Subject: Re: [PATCH 1/2] btrfs-progs: Add support for btrfs-image + corrupt script fsck test case.
From: Filipe David Manana <fdmanana@xxxxxxxxx>
To: Qu Wenruo <quwenruo@xxxxxxxxxxxxxx>
Date: 2014年12月16日 21:55
On Tue, Dec 16, 2014 at 12:58 AM, Qu Wenruo <quwenruo@xxxxxxxxxxxxxx> wrote:
-------- Original Message --------
Subject: Re: [PATCH 1/2] btrfs-progs: Add support for btrfs-image + corrupt
script fsck test case.
From: David Sterba <dsterba@xxxxxxx>
To: Filipe David Manana <fdmanana@xxxxxxxxx>
Date: 2014年12月16日 01:35
On Mon, Dec 15, 2014 at 09:36:51AM +0000, Filipe David Manana wrote:
So another thing I would like to see is doing a more comprehensive
verification that the repair code worked as expected. Currently we
only check that a readonly fsck, after running fsck --repair, returns
0.

For the improvements you've been doing, it's equally important to
verify that --repair recovered the inodes, links, etc to the
lost+found directory (or whatever is the directory's name).

So perhaps adding a verify.sh script to the tarball for example?
A verifier script would be good, but I'd rather not put it into the
tarball. We might want to edit it, do cleanups etc, this would require
to regenerate the image each time and the changes would be hard to
review.

We can use the base image name and add -verify.sh suffix instead, eg.
007-bad_root_items_fs_skinny.tar.xz and
007-bad_root_items_fs_skinny-verify.sh


I'd like to add verify script too, especially when it is put out of the
tarball.

But to the leaf-corruption case, it seems a little overkilled for me.

1) The object of leaf-corrupt recover is not to salvage data.
Although most of the patches are trying its best to salvage as much data as
possible ,
from ino to file type or even later extent data, but in fact, the patchset's
main object is to make the metadata
of the btrfs consistent. The data recovery is just a optional addition.
(Original, it's designed to delete every inode whose metadata is lost in a
corrupted leaf)
So the second btrfsck's return value instead of the contents in lost+found
is the important.

2) The recovery is *lossy*, verify would better be called on *lossless*
recovery
Leaf-corruption is based on the btree recovery, which will introduce data
loss(at least a leaf),
so we can't ensure anything.
And in some case, repair_inode_backref() will even repair backref before
nlink repair,
which may introduce some randomness
(if a inode_item is not corrupted in a leaf, then a backref maybe repaired
without move it to lost+found dir)
So for *lossy* repair, I prefer not to add verify script.
 From the moment we have code that accomplishes something, it doesn't
matter if it was part of a primary or secondary goal of a patch, nor
if it does full or partial recovery. If we have code that does
something (intentionally) we should always try to have tests for it -
if we don't care about what the code does exactly, then we probably
shouldn't have it in the first place.
First please let me make it clear when you mention the verify script, what it really means.
Which case in the leaf-corruption recovery do you mean?

1) Generic script verifying everything or part of the inodes in original image is recovered. If you mean this *GENERIC* one, that's impractical for leaf-corruption recovery and any other
lossy recovery.

2) Specific script only for this specific damaged image.
This one is suitable for lossy recovery case but it may be restrict verify script, it only tells what result it should be after recovery for the specific image. And it may be only a reminder
for new patches modifying the existing recovery codes.

If you mean 1), IMHO it's not practical.
If you mean 2), I am OK to implement the verify script but I doubt about the necessarily.

Thanks,
Qu
Otherwise code will break more easily with future changes. Having
manual tests done on each release (or ideally after each btrfs-progs
or fsck at least) is error prone...

I generally agree to add verify script support, but only for lossless
recovery case.

Thanks,
Qu



--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux