Re: Unrecoverable fs corruption?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Christoph Anton Mitterer posted on Sat, 02 Jan 2016 05:32:21 +0100 as
excerpted:

> On Fri, 2016-01-01 at 08:13 +0000, Duncan wrote:
>> you can also try a read-only scrub
> OT: I just wondered, would a balance include everything a scrub includes
> (i.e. read+verify all data and rebuild an errors on different devices /
> block copies)... of course in addition to also copying all "good"
> data... and perhaps with the difference, that you don't get that
> detailed information as in scrub but only the kernel log messages about
> errors?

AFAIK, no, at least not by design, as balance works at the chunk level, 
while scrub works inside chunks, verifying the checksums on each block.

But now that I think about it, balance does read the chunk in ordered to 
rewrite its contents, and that read, like all reads, should normally be 
checksum verified (except of course in the case of nodatasum, which nocow 
of course implies).  So a balance completed without error /may/ 
effectively indicate a scrub would complete without error as well.  But 
it wasn't specifically designed for that, and if it does so, it's only 
doing it because all reads are checksum verified, not because it's 
actually purposely doing a scrub.

And even if balance works to verify no checksum errors, I don't believe 
it would correct them or give you the detail on them that a scrub would.  
And if there is an error, it'd be a balance error, which might or might 
not actually be a scrub error.

>> In this case,
>> you'll need to recover from the degraded-mount working device as if the
>> second one had entirely failed.
>> 
>> What I'd do in this case, if you haven't done so already, is that read-
>> only btrfs scrub, just to see where you are in terms of corruption on
>> the remaining device.
> I don't think that this is the best order of the steps - at least not
> when it's about precious data.
> 
> Doing a scrub at this phase, would just read all data, telling you the
> status,... but first you should try to copy as much as possible (just in
> case the remaining good drive fails as well) and *then* do the scrub to
> see what's actually good or not.

Good point.

-- 
Duncan - List replies preferred.   No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master."  Richard Stallman

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux