On Fri, Apr 1, 2016 at 3:40 PM, Marc Haber <mh+linux-btrfs@xxxxxxxxxxxx> wrote:
> Hi,
>
> just for a change, this is another btrfs on a different host. The host
> is also running Debian unstable with mainline kernels, the btrfs in
> question was created (not converted) in March 2015 with btrfs-tools
> 3.17. It is the root fs of my main work notebook which is under
> workstation load, with lots of snapshots being created and deleted.
>
> Balance immediately fails with ENOSPC
>
> balance -dprofiles=single -dusage=1 goes through "fine" ("had to
> relocate 0 out of 602 chunks")
>
> balance -dprofiles=single -dusage=2 also ENOSPCes immediately.
>
> [4/502]mh@swivel:~$ sudo btrfs fi usage /
> Overall:
> Device size: 600.00GiB
> Device allocated: 600.00GiB
> Device unallocated: 1.00MiB
> Device missing: 0.00B
> Used: 413.40GiB
> Free (estimated): 148.20GiB (min: 148.20GiB)
> Data ratio: 1.00
> Metadata ratio: 2.00
> Global reserve: 512.00MiB (used: 0.00B)
>
> Data,single: Size:553.93GiB, Used:405.73GiB
> /dev/mapper/swivelbtr 553.93GiB
>
> Metadata,DUP: Size:23.00GiB, Used:3.83GiB
> /dev/mapper/swivelbtr 46.00GiB
>
> System,DUP: Size:32.00MiB, Used:112.00KiB
> /dev/mapper/swivelbtr 64.00MiB
>
> Unallocated:
> /dev/mapper/swivelbtr 1.00MiB
> [5/503]mh@swivel:~$
>
> btrfs balance -mprofiles seems to do something. one kworked and one
> btrfs-transaction process hog one CPU core each for hours, while
> blocking the filesystem for minutes apiece, which leads to the host
> being nearly unuseable up to the point of "clock and mouse pointer
> frozen for nearly ten minutes".
I assume you still have your every 10 minutes snapshotting running
while balancing?
> The btrfs balance cancel I issued after four hours of this state took
> eleven minutes alone to complete.
>
> These are all log entries that were obtained after starting btrfs
> balance -mprofiles on 09:43
> Apr 1 12:18:21 swivel kernel: [253651.970413] BTRFS info (device dm-14): found 3523 extents
> Apr 1 12:18:21 swivel kernel: [253652.035572] BTRFS info (device dm-14): relocating block group 1538365849600 flags 36
> Apr 1 13:30:57 swivel kernel: [258007.653597] BTRFS info (device dm-14): found 3585 extents
> Apr 1 13:30:57 swivel kernel: [258007.746541] BTRFS info (device dm-14): relocating block group 1536755236864 flags 36
> Apr 1 13:49:39 swivel kernel: [259130.296184] BTRFS info (device dm-14): found 3047 extents
> Apr 1 13:49:39 swivel kernel: [259130.357314] BTRFS info (device dm-14): relocating block group 1528702173184 flags 36
> Apr 1 14:30:00 swivel kernel: [261550.776348] BTRFS info (device dm-14): found 4200 extents
>
> This kernel trace from 11:16 is not btrfs-related, is it? I guess it's
> bluetooth related since it happened simultaneously to the bluetooth
> device popping out an in:
> Apr 1 11:16:38 swivel kernel: [249948.993751] usb 1-1.4: USB disconnect, device number 39
> Apr 1 11:16:38 swivel systemd[1]: Starting Load/Save RF Kill Switch Status...
> Apr 1 11:16:38 swivel systemd[1]: Started Load/Save RF Kill Switch Status.
> Apr 1 11:16:38 swivel systemd[1]: bluetooth.target: Unit not needed anymore. Stopping.
> Apr 1 11:16:38 swivel systemd[1]: Stopped target Bluetooth.
> Apr 1 11:16:38 swivel laptop-mode: Laptop mode
> Apr 1 11:16:38 swivel laptop-mode: enabled, not active
> Apr 1 11:16:39 swivel kernel: [249949.211549] usb 1-1.4: new full-speed USB device number 40 using ehci-pci
> Apr 1 11:16:39 swivel kernel: [249949.308386] usb 1-1.4: New USB device found, idVendor=0a5c, idProduct=217f
> Apr 1 11:16:39 swivel kernel: [249949.308397] usb 1-1.4: New USB device strings: Mfr=1, Product=2, SerialNumber=3
> Apr 1 11:16:39 swivel kernel: [249949.308402] usb 1-1.4: Product: Broadcom Bluetooth Device
> Apr 1 11:16:39 swivel kernel: [249949.308407] usb 1-1.4: Manufacturer: Broadcom Corp
> Apr 1 11:16:39 swivel kernel: [249949.308412] usb 1-1.4: SerialNumber: CCAF78F1274F
> Apr 1 11:16:39 swivel systemd[1]: Reached target Bluetooth.
> Apr 1 11:16:39 swivel kernel: [249949.507794] ------------[ cut here ]------------
> Apr 1 11:16:39 swivel kernel: [249949.507810] WARNING: CPU: 1 PID: 11 at arch/x86/kernel/cpu/perf_event_intel_ds.c:325 reserve_ds_buffers+0x102/0x326()
> Apr 1 11:16:39 swivel kernel: [249949.507813] alloc_bts_buffer: BTS buffer allocation failure
> Apr 1 11:16:39 swivel kernel: [249949.507816] Modules linked in: cpuid hid_generic usbhid hid e1000e tun ctr ccm rfcomm bridge stp llc cpufreq_userspace cpufreq_stats cpufreq_conservative cpufreq_powersave nf_conntrack_netlink nfnetlink bnep binfmt_misc intel_rapl x86_pkg_temp_thermal arc4 intel_powerclamp kvm_intel kvm irqbypass iwldvm snd_hda_codec_conexant snd_hda_codec_generic mac80211 input_leds btusb btbcm i2c_i801 snd_hda_intel btintel snd_hda_codec bluetooth iwlwifi snd_hda_core cfg80211 snd_hwdep sg snd_pcm_oss snd_mixer_oss lpc_ich mfd_core snd_pcm shpchp snd_timer thinkpad_acpi nvram snd battery soundcore rfkill ac tpm_tis tpm evdev processor xt_TCPMSS xt_tcpudp iptable_mangle iptable_filter ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack ip_tables ip6table_filter ip6_tables x_tables coretemp tp_smapi(O) thinkpad_ec(O) loop drbd lru_cache libcrc32c crc32c_generic autofs4 btrfs xor raid6_pq ext4 crc16 mbcache jbd2 algif_skcipher af_alg dm_snapshot dm_bufio dm_crypt dm_mod md_mod sd_mod usb_storage crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel jitterentropy_rng hmac sha256_ssse3 sha256_generic drbg ansi_cprng aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd psmouse ahci libahci i915 libata ehci_pci sdhci_pci i2c_algo_bit scsi_mod ehci_hcd sdhci drm_kms_helper mmc_core usbcore usb_common drm i2c_core thermal video button [last unloaded: e1000e]
> Apr 1 11:16:39 swivel kernel: [249949.507974] CPU: 1 PID: 11 Comm: watchdog/1 Tainted: G O 4.5.0-zgws1 #2
> Apr 1 11:16:39 swivel kernel: [249949.507978] Hardware name: LENOVO 4240CTO/4240CTO, BIOS 8AET63WW (1.43 ) 05/08/2013
> Apr 1 11:16:39 swivel kernel: [249949.507981] 0000000000000047 ffffffff811e6bf3 ffff88017dd4fce8 0000000000000009
> Apr 1 11:16:39 swivel kernel: [249949.507987] ffffffff81052619 ffffffff810230d1 0000000000000000 ffff88017dd4fd40
> Apr 1 11:16:39 swivel kernel: [249949.507992] 0000000000000000 00000000000106b0 ffffffff81052671 ffffffff817188a1
> Apr 1 11:16:39 swivel kernel: [249949.507997] Call Trace:
> Apr 1 11:16:39 swivel kernel: [249949.508006] [<ffffffff811e6bf3>] ? dump_stack+0x5a/0x6f
> Apr 1 11:16:39 swivel kernel: [249949.508013] [<ffffffff81052619>] ? warn_slowpath_common+0x8e/0xa3
> Apr 1 11:16:39 swivel kernel: [249949.508019] [<ffffffff810230d1>] ? reserve_ds_buffers+0x102/0x326
> Apr 1 11:16:39 swivel kernel: [249949.508024] [<ffffffff81052671>] ? warn_slowpath_fmt+0x43/0x4b
> Apr 1 11:16:39 swivel kernel: [249949.508029] [<ffffffff810230d1>] ? reserve_ds_buffers+0x102/0x326
> Apr 1 11:16:39 swivel kernel: [249949.508035] [<ffffffff810bfefd>] ? watchdog_timer_fn+0x1ad/0x1ad
> Apr 1 11:16:39 swivel kernel: [249949.508040] [<ffffffff8101e15f>] ? x86_reserve_hardware+0xb9/0xc8
> Apr 1 11:16:39 swivel kernel: [249949.508045] [<ffffffff8101e1b9>] ? x86_pmu_event_init+0x4b/0x1bb
> Apr 1 11:16:39 swivel kernel: [249949.508050] [<ffffffff810dfca7>] ? perf_try_init_event+0x3d/0x6c
> Apr 1 11:16:39 swivel kernel: [249949.508055] [<ffffffff810e17d2>] ? perf_event_alloc+0x3c2/0x500
> Apr 1 11:16:39 swivel kernel: [249949.508060] [<ffffffff810e2cb7>] ? perf_event_create_kernel_counter+0x1f/0x122
> Apr 1 11:16:39 swivel kernel: [249949.508065] [<ffffffff810bfc0e>] ? watchdog_enable+0x9d/0x199
> Apr 1 11:16:39 swivel kernel: [249949.508071] [<ffffffff8106a8df>] ? smpboot_thread_fn+0xf7/0x13a
> Apr 1 11:16:39 swivel kernel: [249949.508075] [<ffffffff8106a7e8>] ? sort_range+0x17/0x17
> Apr 1 11:16:39 swivel kernel: [249949.508081] [<ffffffff81068756>] ? kthread+0x95/0x9d
> Apr 1 11:16:39 swivel kernel: [249949.508085] [<ffffffff810686c1>] ? kthread_parkme+0x16/0x16
> Apr 1 11:16:39 swivel kernel: [249949.508092] [<ffffffff8141dcff>] ? ret_from_fork+0x3f/0x70
> Apr 1 11:16:39 swivel kernel: [249949.508097] [<ffffffff810686c1>] ? kthread_parkme+0x16/0x16
> Apr 1 11:16:39 swivel kernel: [249949.508100] ---[ end trace e082dccd90d0875a ]---
> Apr 1 11:16:39 swivel kernel: [249949.509383] watchdog/1: page allocation failure: order:4, mode:0x26040c0
> Apr 1 11:16:39 swivel kernel: [249949.509390] CPU: 1 PID: 11 Comm: watchdog/1 Tainted: G W O 4.5.0-zgws1 #2
> Apr 1 11:16:39 swivel kernel: [249949.509392] Hardware name: LENOVO 4240CTO/4240CTO, BIOS 8AET63WW (1.43 ) 05/08/2013
> Apr 1 11:16:39 swivel kernel: [249949.509395] 0000000000000047 ffffffff811e6bf3 0000000000000001 ffff88017dd4fb60
> Apr 1 11:16:39 swivel kernel: [249949.509401] ffffffff810f01af 0000000000000010 0000000000000040 0000000000000000
> Apr 1 11:16:39 swivel kernel: [249949.509407] 00004c0800000001 000000001e5eeb00 0000000000000004 0000000000000040
> Apr 1 11:16:39 swivel kernel: [249949.509412] Call Trace:
> Apr 1 11:16:39 swivel kernel: [249949.509418] [<ffffffff811e6bf3>] ? dump_stack+0x5a/0x6f
> Apr 1 11:16:39 swivel kernel: [249949.509425] [<ffffffff810f01af>] ? warn_alloc_failed+0x10f/0x127
> Apr 1 11:16:39 swivel kernel: [249949.509431] [<ffffffff810f2a55>] ? __alloc_pages_nodemask+0x8cc/0x966
> Apr 1 11:16:39 swivel kernel: [249949.509438] [<ffffffff8112dd2c>] ? kmem_getpages+0x50/0x12c
> Apr 1 11:16:39 swivel kernel: [249949.509442] [<ffffffff8112e046>] ? fallback_alloc+0xfe/0x1a7
> Apr 1 11:16:39 swivel kernel: [249949.509447] [<ffffffff8112e73d>] ? kmem_cache_alloc_node_trace+0x89/0x14b
> Apr 1 11:16:39 swivel kernel: [249949.509454] [<ffffffff8102314f>] ? reserve_ds_buffers+0x180/0x326
> Apr 1 11:16:39 swivel kernel: [249949.509459] [<ffffffff810bfefd>] ? watchdog_timer_fn+0x1ad/0x1ad
> Apr 1 11:16:39 swivel kernel: [249949.509463] [<ffffffff8101e15f>] ? x86_reserve_hardware+0xb9/0xc8
> Apr 1 11:16:39 swivel kernel: [249949.509468] [<ffffffff8101e1b9>] ? x86_pmu_event_init+0x4b/0x1bb
> Apr 1 11:16:39 swivel kernel: [249949.509472] [<ffffffff810dfca7>] ? perf_try_init_event+0x3d/0x6c
> Apr 1 11:16:39 swivel kernel: [249949.509477] [<ffffffff810e17d2>] ? perf_event_alloc+0x3c2/0x500
> Apr 1 11:16:39 swivel kernel: [249949.509482] [<ffffffff810e2cb7>] ? perf_event_create_kernel_counter+0x1f/0x122
> Apr 1 11:16:39 swivel kernel: [249949.509487] [<ffffffff810bfc0e>] ? watchdog_enable+0x9d/0x199
> Apr 1 11:16:39 swivel kernel: [249949.509491] [<ffffffff8106a8df>] ? smpboot_thread_fn+0xf7/0x13a
> Apr 1 11:16:39 swivel kernel: [249949.509495] [<ffffffff8106a7e8>] ? sort_range+0x17/0x17
> Apr 1 11:16:39 swivel kernel: [249949.509500] [<ffffffff81068756>] ? kthread+0x95/0x9d
> Apr 1 11:16:39 swivel kernel: [249949.509505] [<ffffffff810686c1>] ? kthread_parkme+0x16/0x16
> Apr 1 11:16:39 swivel kernel: [249949.509510] [<ffffffff8141dcff>] ? ret_from_fork+0x3f/0x70
> Apr 1 11:16:39 swivel kernel: [249949.509515] [<ffffffff810686c1>] ? kthread_parkme+0x16/0x16
> Apr 1 11:16:39 swivel kernel: [249949.509519] Mem-Info:
> Apr 1 11:16:39 swivel kernel: [249949.509529] active_anon:1107088 inactive_anon:326101 isolated_anon:0
> Apr 1 11:16:39 swivel kernel: [249949.509529] active_file:1104846 inactive_file:1367650 isolated_file:0
> Apr 1 11:16:39 swivel kernel: [249949.509529] unevictable:2526 dirty:14757 writeback:0 unstable:0
> Apr 1 11:16:39 swivel kernel: [249949.509529] slab_reclaimable:56106 slab_unreclaimable:33051
> Apr 1 11:16:39 swivel kernel: [249949.509529] mapped:67336 shmem:87440 pagetables:12012 bounce:0
> Apr 1 11:16:39 swivel kernel: [249949.509529] free:30592 free_pcp:170 free_cma:0
> Apr 1 11:16:39 swivel kernel: [249949.509538] Node 0 DMA free:15360kB min:12kB low:12kB high:16kB active_anon:0kB inactive_anon:0kB active_file:0kB inactive_file:0kB unevictable:0kB isolated(anon):0kB isolated(file):0kB present:15984kB managed:15360kB mlocked:0kB dirty:0kB writeback:0kB mapped:0kB shmem:0kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB unstable:0kB bounce:0kB free_pcp:0kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? yes
> Apr 1 11:16:39 swivel kernel: [249949.509553] lowmem_reserve[]: 0 3403 15919 15919
> Apr 1 11:16:39 swivel kernel: [249949.509559] Node 0 DMA32 free:64968kB min:3436kB low:4292kB high:5152kB active_anon:475148kB inactive_anon:357880kB active_file:1173604kB inactive_file:1314960kB unevictable:3416kB isolated(anon):0kB isolated(file):0kB present:3561088kB managed:3487816kB mlocked:3416kB dirty:13592kB writeback:0kB mapped:55924kB shmem:70004kB slab_reclaimable:47096kB slab_unreclaimable:17888kB kernel_stack:2000kB pagetables:8308kB unstable:0kB bounce:0kB free_pcp:4kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:128 all_unreclaimable? no
> Apr 1 11:16:39 swivel kernel: [249949.509575] lowmem_reserve[]: 0 0 12516 12516
> Apr 1 11:16:39 swivel kernel: [249949.509580] Node 0 Normal free:42040kB min:12648kB low:15808kB high:18972kB active_anon:3953204kB inactive_anon:946524kB active_file:3245780kB inactive_file:4155640kB unevictable:6688kB isolated(anon):0kB isolated(file):0kB present:13080576kB managed:12816596kB mlocked:6688kB dirty:45436kB writeback:0kB mapped:213420kB shmem:279756kB slab_reclaimable:177328kB slab_unreclaimable:114316kB kernel_stack:8688kB pagetables:39740kB unstable:0kB bounce:0kB free_pcp:764kB local_pcp:0kB free_cma:0kB writeback_tmp:0kB pages_scanned:0 all_unreclaimable? no
> Apr 1 11:16:39 swivel kernel: [249949.509596] lowmem_reserve[]: 0 0 0 0
> Apr 1 11:16:39 swivel kernel: [249949.509601] Node 0 DMA: 0*4kB 0*8kB 0*16kB 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 1*1024kB (U) 1*2048kB (M) 3*4096kB (M) = 15360kB
> Apr 1 11:16:39 swivel kernel: [249949.509619] Node 0 DMA32: 11548*4kB (UME) 2282*8kB (UME) 55*16kB (UM) 2*32kB (UM) 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 65392kB
> Apr 1 11:16:39 swivel kernel: [249949.509638] Node 0 Normal: 3736*4kB (UME) 3206*8kB (UE) 131*16kB (U) 0*32kB 0*64kB 0*128kB 0*256kB 0*512kB 0*1024kB 0*2048kB 0*4096kB = 42688kB
> Apr 1 11:16:39 swivel kernel: [249949.509657] Node 0 hugepages_total=0 hugepages_free=0 hugepages_surp=0 hugepages_size=2048kB
> Apr 1 11:16:39 swivel kernel: [249949.509661] 2561271 total pagecache pages
> Apr 1 11:16:39 swivel kernel: [249949.509664] 616 pages in swap cache
> Apr 1 11:16:39 swivel kernel: [249949.509667] Swap cache stats: add 28221, delete 27605, find 294750/295285
> Apr 1 11:16:39 swivel kernel: [249949.509670] Free swap = 8277324kB
> Apr 1 11:16:39 swivel kernel: [249949.509672] Total swap = 8386556kB
> Apr 1 11:16:39 swivel kernel: [249949.509674] 4164412 pages RAM
> Apr 1 11:16:39 swivel kernel: [249949.509676] 0 pages HighMem/MovableOnly
> Apr 1 11:16:39 swivel kernel: [249949.509678] 84469 pages reserved
> Apr 1 11:16:39 swivel kernel: [249949.509681] 0 pages hwpoisoned
> Apr 1 11:16:39 swivel kernel: [249949.509717] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
> Apr 1 11:16:39 swivel kernel: [249949.537265] EXT4-fs (dm-16): re-mounted. Opts: data=ordered,commit=0
> Apr 1 11:16:39 swivel systemd[1]: Reloading Laptop Mode Tools.
> Apr 1 11:16:39 swivel kernel: [249949.664133] thinkpad_acpi: EC reports that Thermal Table has changed
> Apr 1 11:16:39 swivel kernel: [249949.723795] IPv6: ADDRCONF(NETDEV_UP): eth0: link is not ready
>
>
>
> This btrfs is ripe for the backup-format-restore procedure, right?
>
> Greetings
> Marc
>
>
> --
> -----------------------------------------------------------------------------
> Marc Haber | "I don't trust Computers. They | Mailadresse im Header
> Leimen, Germany | lose things." Winona Ryder | Fon: *49 6224 1600402
> Nordisch by Nature | How to make an American Quilt | Fax: *49 6224 1600421
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html