Btrfs filesystem-fail observations and hints

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello,

at the weekend we had a disk-fail in a 5-disk BtrFS-RAID1
setup. Ideally one failing disk in a RAID1 setup should (at least
temporarily) degrade the filesystem and inform root about the
situation, but should let the rest of the system unaffected. That’s
not what happend. Processes accessing the filesystem hung
device-waiting and the filesystem itself “hung” too, producing lots of

  BTRFS: lost page write due to I/O error on /dev/sdd
  BTRFS: bdev /dev/sdd errs: wr …, rd …, flush 0, corrupt 0, gen 0

messages. Attempts to reboot the system regularly failed. Only after
physically removing the failed (hotplugable) disk from the system, it
was possible to reboot the system somewhat normal.

Afterwards, trying to get the system running again, the following
observation where made:

· “btrfs device delete missing”

  There seems to be no straight-forward way to monitor the progress of
  the “rebalancing” of the filesystem. It took about 6 hours and while
  it was possible to estimate the time of finish by watching “btrfs fi
  show” and extrapolating device-usagem, a method to monitor the
  progess like “btrfs balance status” would be fine. (“btrfs balance
  status” says “No balance found on …”)

· “btrfs fi df”

  During “btrfs device delete missing”-rebalance “btrfs fi df” does
  not reflect the current state of the filesystem. It says p.e.

    Data, RAID1: total=1.46TiB, used=1.46TiB
    Data, single: total=8.00MiB, used=0.00B

  while actually, depending of the advance of the rebalance, about 0
  to 300 GByte have only one copy on the devices. So p.e.

    Data, RAID1: total=1.1TiB, used=1.1TiB
    Data, single: total=290GiB, used=290GiB

  would be better reflecting the state of the system.

	MfG
	bmg

-- 
„Des is völlig wurscht, was heut beschlos- | M G Berberich
 sen wird: I bin sowieso dagegn!“          | berberic@xxxxxxxxxxxxxxxxx
(SPD-Stadtrat Kurt Schindler; Regensburg)  | www.fmi.uni-passau.de/~berberic
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux