Re: btrfs dev del not transaction protected?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On 2019/12/20 下午9:27, Marc Lehmann wrote:
>>>  [  102.393185] BTRFS: device label COLD1 devid 5 transid 1876906 /dev/dm-30
>>
>> dm-30 is one transaction older than other devices.
>>
>> Is that expected? If not, it may explain why we got the dead device. As
>> we're using older superblock, which may points to older chunk tree which
>> has the device item.
> 
> Well, not that my expectation here would mean anything, but no, from
> experience I have never seen the transids to disagree, or bad thingsa will
> happen...
> 
>>>  [  109.626550] BTRFS: device label COLD1 devid 4 transid 1876907 /dev/dm-32
>>>  [  109.654401] BTRFS: device label COLD1 devid 3 transid 1876907 /dev/dm-31
>>
>> And I'm also curious about the 7s delay between devid5 and devid 3/4
>> detection.
> 
> That is about the time it takes the disk to wake up when its spinned down,
> so maybe that was the case - the disks are used for archiving ("cold"
> storage), have a short spin-down and btrfs filesystems can takes ages to
> mount. The real question is why the fortuh disk was already spun up then,
> but the disks do not apply time outs very exactly.
> 
>> Can you find a way to make devid 3/4 show up before devid 5 and try again?
> 
> Unfortunately, I had to start restoring from backup a while ago, as I need
> the machine up and restoring takes days.
> 
> How would I go about making it show up in different orders though? If
> these messages come up independently, I could have spun down some of the
> disks, right?

You could utilize the latest "forget" feature, to make btrfs kernel
module forget that device, provided by "btrfs device scan -u".

So the plan would be something like:
- Forget all devices of that volume
- Scan the two disks with higher transid
- Scan the disk with mismatched transid

Then try to mount the volume.

> 
>> And if you find a way to mount the volume RW, please write a single
>> empty file, and sync the fs, then umount the fs, ensure "btrfs ins
>> dump-super" gives the same transid of all 3 related disks.
> 
> I tried -o degraded followed by remounting rw, but couldn't get it to
> mount rw. I tried to mount/remount, though:
> 
>    04:48:45 doom kernel: BTRFS error (device dm-32): devid 1 uuid f5c3dc63-1fac-45b3-b9ba-ed1ec5f92403 is missing
>    04:48:45 doom kernel: BTRFS error (device dm-32): failed to read chunk tree: -2
>    04:48:45 doom kernel: BTRFS error (device dm-32): open_ctree failed
>    04:49:37 doom kernel: BTRFS warning (device dm-31): devid 1 uuid f5c3dc63-1fac-45b3-b9ba-ed1ec5f92403 is missing
>    04:52:30 doom kernel: BTRFS warning (device dm-31): chunk 12582912 missing 1 devices, max tolerance is 0 for writable mount

BTW, that chunk number is very small, and since it has 0 tolerance, it
looks like to be SINGLE chunk.

In that case, it looks like a temporary chunk from older mkfs, and it
should contain no data/metadata at all, thus brings no data loss.

>    04:52:30 doom kernel: BTRFS warning (device dm-31): writable mount is not allowed due to too many missing devices
>    04:52:30 doom kernel: BTRFS error (device dm-31): open_ctree failed
>    04:54:01 doom kernel: BTRFS warning (device dm-32): devid 1 uuid f5c3dc63-1fac-45b3-b9ba-ed1ec5f92403 is missing
>    04:54:45 doom kernel: BTRFS warning (device dm-32): chunk 12582912 missing 1 devices, max tolerance is 0 for writable mount
>    04:54:45 doom kernel: BTRFS warning (device dm-32): too many missing devices, writable remount is not allowed
> 
> Since (in theory :) the filesystemw a completely backed up, I didn't
> bother with further recovery after I made sure the physical disk is
> actually there and was unlocked (cryptsetup), so it wasn't a case of an
> actual missing disk.

BTW, "btrfs ins dump-tree -t chunk <dev>" would help a lot.
That would directly tell us if the devid 1 device is in chunk tree.

If passing different <dev> would cause different output, please also
provide all different versions.

> 
>>> BTW, this (second issue) also happens with filesystems that are not
>>> multi-device.
>>
>> Single device btrfs doesn't need device scan.
>> If that happened, something insane happened again...
>> Thanks,
> 
> It happens since at least 4.14 on at least four machines, but I haven't
> seen it recently, after I switched to 5.2.21 one some machines (post-4.4
> kernels have this habit of freezing under memory pressure, and 5.2.21 has
> greatly improved in this regard). That also means I had far fewer hard
> resets with 5.2.21, but the problem did not happen on the last resets in
> 5.2.21 and 5.4.5.
> 
> I originally reported it below, with some evidence that it isn't a
> hardware issue (no reset needed, just wipe the dm table while the device
> is mounted which should cleanly "cut off" the write stream):
> 
> https://bugzilla.kernel.org/show_bug.cgi?id=204083
> 
> Since multiple scrubs and full reads of the volumes didn't show up any
> issues, I didn't think much of it.
> 
> And if you want to hear more "insane" things, after I hard-reset
> my desktop machine (5.2.21) two days ago I had to "btrfs rescue
> fix-device-size" to be able to mount (can't find the kernel error atm.).

Consider all these insane things, I tend to believe there is some
FUA/FLUSH related hardware problem.
E.g. the HDD/SSD controller reports FUA/FLUSH finished way before it
really write data into the disk or non-volatile cache, or the
non-volatile cache recovery is not implemented properly...

Thanks,
Qu
> 
> Greetings,
> 

Attachment: signature.asc
Description: OpenPGP digital signature


[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux