Re: Chicken-egg: uncorrectable checksum error prevents RAID1 rebalancing

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sun, Jan 24, 2016 at 10:52 AM, Tom Hunt <tomdicksonhunt@xxxxxxxxx> wrote:
> I've been running for a week or two using a single-drive 6TB btrfs
> volume. For some of this time, the machine running had bad memory,
> which led to various checksum errors. For most of these, I just
> deleted the relevant file and reacquired it (the errors fortuitously
> never occurring in files which were not easily replaceable). However,
> there currently remains a single error which does not appear to be in
> any file:
>
> # btrfs scrub status /
> scrub status for 85f5b744-f68c-4194-aa90-d6fe238115a3
>       scrub started at Fri Jan 22 09:49:02 2016 and finished after 11:55:08
>       total bytes scrubbed: 4.27TiB with 1 errors
>       error details: csum=1
>       corrected errors: 0, uncorrectable errors: 1, unverified errors: 0
>
> # dmesg
> (...)
> [52841.310422] BTRFS warning (device dm-0): csum failed ino 515 off 15118336 csum 2629660496 expected csum 54021641
> [52841.335656] BTRFS warning (device dm-0): csum failed ino 515 off 15118336 csum 2629660496 expected csum 54021641
> [95071.256448] BTRFS: bdev /dev/mapper/rootvol_1 errs: wr 0, rd 0, flush 0, corrupt 11, gen 0
> [95071.256532] BTRFS: unable to fixup (regular) error at logical 4450167468032 on dev /dev/mapper/rootvol_1
>
> I've searched for ino 515, and the file there does not have any
> apparent error (can read the whole thing without problem; deleting and
> recreating it does not make the error go away). The error is, of
> course, uncorrectable, because it's a single-drive volume. However,
> having put in a second drive, the balance filter to convert to raid1
> fails because of the I/O error.

You delete the file and yet the scrub still says inode 515 exists and
has an error? Or there are no errors, but then after copying the same
file back to the volume, the problem reoccurs? Are there any snapshots
or subvolumes? Because if there are any subvolumes/snapshots, each is
its own fs tree with its own set of inodes. So an inode can be used
more than once for different files so I wonder off hand if you haven't
found the actual problematic file.

Or possibly it's a directory, and not a file.

# find /brick2 -inum 60724

On my system, it returns six results, four are files, two of which are
in common to the others due to snapshotting, and two are directories
one of which is also a snapshot of the other. So a single inode can
not only appear multiple times on a Btrfs volume, but can be pointing
to a file or a directory. The scrub not saying what the file path is
suggests it could be a directory.


-- 
Chris Murphy
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux