On Tue, Jan 29, 2019 at 4:15 PM Hans van Kranenburg <Hans.van.Kranenburg@xxxxxxxxxx> wrote: > > Hi, > > Thought experiment time... > > I have an HP z820 workstation here (with ECC memory, yay!) and 4x250G > 10k SAS disks (and some spare disks). It's donated hardware, and I'm > going to use it to replace the current server in the office of a > non-profit organization (so it's not work stuff this time). > > The machine is going to run Debian/Xen and a few virtual machines > (current one also does, but the hardware is now really starting to fall > apart). > > I have been thinking a bit how to (re)organize disk storage in this > scenario. > > 1. Let's use btrfs everywhere. \:D/ > 2. For running Xen virtual machines, I prefer block devices on LVM. No > image files, no btrfs-on-btrfs etc... > 3. Oh, and there's also 1 MS Windows VM that will be in the mix. > > Obviously I can't start using multi-device btrfs in each and every > virtual machine (a big pile of horror when one disk dies or starts > misbehaving). > > So, what I was thinking of is: > > * Use dm-integrity on partitions on the individual disks > * Use mdadm RAID10 on top (which is then able to repair bitrot) > * Use LVM on top > * Etc... > > For all of the filesystems, I would be doing backups to a remote > location outside of the building with send/receive. > > The Windows VM will be an image file on a btrfs filesystem in the Xen > dom0. It's idle most of the time, and I think cow+autodefrag can easily > handle it. I'd like to be able to take snapshots of it which can be sent > to a remote location. I'd consider thinp LV's for VM's. They are way more efficient for snapshots than thickp (conventional) LVM snapshots. There is no command to compute, send/receive only the LVM extents that are changed though. And this includes for NTFS. In effect, you can shrink any LV without literally shrinking it, you just need to execute fstrim on the mounted volume (you can use discard mount option from inside each VM; or enable fstrim.timer), and this will cause unused LVM logical extents to be returned to the thin pool, which can then be used by any other LV that draws from that pool. It's been a couple years since I tested NTFS in a Raw file on Btrfs but at that time it was just pathological and I gave up. It was so slow. Btrfs on Raw image on Btrfs was way faster. You could also consider XFS on LVM or plain partition, with a qcow2 file as backing. Snapshots are supported by creating a new image that points to another as a backing file. And you can easily backup these snapshots as they are discrete files. > Now, to finally throw in the big question: If I use btrfs everywhere, > can I run dm-integrity without a journal? The documentation says if you run without a journal, dm-integrity is no longer crash safe, i.e. it's no longer atomic operation. That to me is the whole point of dm-integrity so I wouldn't do it even if I'm using Btrfs on top. > > As far as I can reason about.. I could. As long as there's no 'nocow' > happening, the only thing that needs to happen correctly is superblock > writes, right? Metadata is always cow. There is a nodatacow mount option but that doesn't affect metadata. But in either case the whole point of having dm-integrity in place is to have a Linux block device layer that tells you if there has been some kind of storage stack corruption; to make silent corruption visible. And that's not as effective if it's possible to get such corruption in the course of a crash or power failure of some kind. It might be useful to ask on @linux-integrity list. http://vger.kernel.org/vger-lists.html#linux-integrity -- Chris Murphy
