Re: btrfs corruption after cp --reflink

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello again!

2011/10/8 Kai Krakow <hurikhan77+btrfs@xxxxxxxxx>:
> I tried to "cp --reflink" a huge file (about 80G, a VMware disk
> image). It took maybe about 1 minute when my PC started thrashing the
> hard disk, some minutes later the command returned with an out of
> memory message.
[...]
> So I'd love to
> get rid of the problems scrub complains about. I don't mind if I would
> have to delete some files which I can probably recover easily. But I
> can simply find no way to identify these files. Is there a way to map
> the above error messages to file system pathes?

I figured out that most of the affected files are located in the
Google Chromium cache. Trying to "cat" all files there showed up the
csum errors in dmesg but the system stays stable. I decided to simply
delete the cache, however that makes "rm" becoming killed. Here's the
dmesg output from the "rm -Rf .cache/chromium/*" session:

[  637.297845] verify_parent_transid: 470 callbacks suppressed
[  637.297852] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  637.298833] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  637.315081] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  637.345259] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  637.345269] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  637.345296] BUG: unable to handle kernel NULL pointer dereference at 0000001c
[  637.345306] IP: [<c1142c2f>] btrfs_print_leaf+0xd/0x886
[  637.345319] *pde = 00000000
[  637.345324] Oops: 0000 [#1] SMP
[  637.345330] Modules linked in: af_packet vmnet vmblock vsock vmci
vmmon lm90 it87 hwmon_vid hwmon fuse rfcomm bnep snd_seq_oss
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss
reiserfs zram(C) mperf loop emu10k1_gp sidewinder joydev nfs lockd
auth_rpcgss nfs_acl sunrpc ipv6 tcp_cubic nvidia(P) snd_usb_audio
snd_usbmidi_lib i82875p_edac usb_storage usbhid 8250_pnp gspca_sonixj
gspca_main videodev firewire_ohci edac_core uas hid firewire_core
sr_mod cdrom sg ns558 btusb analog evdev pcspkr ne2k_pci 8390 floppy
gameport i2c_i801 8250 parport_pc serial_core parport e1000 intel_agp
snd_mpu401 snd_mpu401_uart thermal bluetooth crc16 crc_itu_t fan
processor button intel_gtt agpgart unix [last unloaded: microcode]
[  637.345434]
[  637.345439] Pid: 2465, comm: btrfs-delayed-m Tainted: P       A C
3.0.6-gentoo #1    /8KNXP
[  637.345446] EIP: 0060:[<c1142c2f>] EFLAGS: 00010286 CPU: 1
[  637.345450] EIP is at btrfs_print_leaf+0xd/0x886
[  637.345454] EAX: f506a800 EBX: f506a800 ECX: 00418335 EDX: 00000000
[  637.345457] ESI: 00000000 EDI: f4c40850 EBP: fffffffb ESP: f49d7d30
[  637.345460]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[  637.345464] Process btrfs-delayed-m (pid: 2465, ti=f49d6000
task=f4707800 task.ti=f49d6000)
[  637.345467] Stack:
[  637.345469]  f499e840 f505e000 000000b2 00000000 00000000 ffffffff
00000000 00000000
[  637.345478]  00000100 00000000 0006cac6 00000000 c113b1ec c101e917
f506a800 c1350707
[  637.345487]  11d22000 00000128 00060000 00008050 2811d220 a8000001
00040000 00000000
[  637.345496] Call Trace:
[  637.345502]  [<c113b1ec>] ? update_block_group.clone.51+0x2b3/0x2e4
[  637.345508]  [<c101e917>] ? need_resched+0x11/0x1a
[  637.345513]  [<c1350707>] ? _cond_resched+0x5/0x18
[  637.345518]  [<c113b7e1>] ? __btrfs_free_extent+0x397/0x7e3
[  637.345523]  [<c113e818>] ? run_clustered_refs+0x839/0x869
[  637.345528]  [<c11b66f8>] ? rb_erase+0x14d/0x1f0
[  637.345532]  [<c113e805>] ? run_clustered_refs+0x826/0x869
[  637.345537]  [<c101d84e>] ? kmap_atomic_prot+0x23/0x96
[  637.345542]  [<c11356fc>] ? btrfs_search_slot+0x3e8/0x452
[  637.345547]  [<c113e909>] ? btrfs_run_delayed_refs+0xc1/0x144
[  637.345552]  [<c114d8a0>] ? __btrfs_end_transaction+0x70/0x19b
[  637.345556]  [<c114d9df>] ? btrfs_end_transaction_dmeta+0x14/0x18
[  637.345561]  [<c118d3dd>] ? btrfs_async_run_delayed_node_done+0x14d/0x1a0
[  637.345567]  [<c1174d3f>] ? worker_loop+0x10a/0x393
[  637.345571]  [<c1174c35>] ? btrfs_queue_worker+0x1f1/0x1f1
[  637.345576]  [<c103b98a>] ? kthread+0x63/0x68
[  637.345580]  [<c103b927>] ? kthread_worker_fn+0x10f/0x10f
[  637.345585]  [<c135263e>] ? kernel_thread_helper+0x6/0xd
[  637.345588] Code: da d4 20 00 83 c4 2c 5b 5e 5f 5d c3 53 e8 ab ac
ed ff 8a 58 64 e8 58 ab ed ff 88 d8 5b c3 55 57 56 53 83 ec 60 89 c3
89 54 24 2c <8b> 42 1c e8 8a ac ed ff 8b 50 60 89 54 24 48 e8 33 ab ed
ff 8b
[  637.345633] EIP: [<c1142c2f>] btrfs_print_leaf+0xd/0x886 SS:ESP 0068:f49d7d30
[  637.345640] CR2: 000000000000001c
[  637.345647] ---[ end trace 640af837f79e8469 ]---
[  639.058404] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  639.058670] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  639.059074] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  639.067414] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  639.067423] parent transid verify failed on 641086160896 wanted
43863 found 43873
[  639.067453] BUG: unable to handle kernel NULL pointer dereference at 0000001c
[  639.067462] IP: [<c1142c2f>] btrfs_print_leaf+0xd/0x886
[  639.067476] *pde = 00000000
[  639.067481] Oops: 0000 [#2] SMP
[  639.067487] Modules linked in: af_packet vmnet vmblock vsock vmci
vmmon lm90 it87 hwmon_vid hwmon fuse rfcomm bnep snd_seq_oss
snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss
reiserfs zram(C) mperf loop emu10k1_gp sidewinder joydev nfs lockd
auth_rpcgss nfs_acl sunrpc ipv6 tcp_cubic nvidia(P) snd_usb_audio
snd_usbmidi_lib i82875p_edac usb_storage usbhid 8250_pnp gspca_sonixj
gspca_main videodev firewire_ohci edac_core uas hid firewire_core
sr_mod cdrom sg ns558 btusb analog evdev pcspkr ne2k_pci 8390 floppy
gameport i2c_i801 8250 parport_pc serial_core parport e1000 intel_agp
snd_mpu401 snd_mpu401_uart thermal bluetooth crc16 crc_itu_t fan
processor button intel_gtt agpgart unix [last unloaded: microcode]
[  639.067584]
[  639.067590] Pid: 4241, comm: rm Tainted: P      DA C  3.0.6-gentoo
#1    /8KNXP
[  639.067596] EIP: 0060:[<c1142c2f>] EFLAGS: 00010286 CPU: 1
[  639.067600] EIP is at btrfs_print_leaf+0xd/0x886
[  639.067604] EAX: f506a800 EBX: f506a800 ECX: 0042f40d EDX: 00000000
[  639.067607] ESI: 00000000 EDI: f4c400e0 EBP: fffffffb ESP: ec6c5ce4
[  639.067610]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
[  639.067614] Process rm (pid: 4241, ti=ec6c4000 task=f3731c00
task.ti=ec6c4000)
[  639.067616] Stack:
[  639.067618]  f49f9780 f505e000 000000b2 00000000 00000000 ffffffff
00000000 00000000
[  639.067627]  00000100 00000000 0006cac7 00000000 c113b1ec c101e917
f506a800 c1350707
[  639.067636]  11d6c000 00000128 00001000 00008050 2811d6c0 a8000001
00040000 00000000
[  639.067645] Call Trace:
[  639.067651]  [<c113b1ec>] ? update_block_group.clone.51+0x2b3/0x2e4
[  639.067657]  [<c101e917>] ? need_resched+0x11/0x1a
[  639.067662]  [<c1350707>] ? _cond_resched+0x5/0x18
[  639.067667]  [<c113b7e1>] ? __btrfs_free_extent+0x397/0x7e3
[  639.067672]  [<c113e818>] ? run_clustered_refs+0x839/0x869
[  639.067676]  [<c113e818>] ? run_clustered_refs+0x839/0x869
[  639.067681]  [<c10963a7>] ? kfree+0x88/0x90
[  639.067685]  [<c113e818>] ? run_clustered_refs+0x839/0x869
[  639.067690]  [<c11857c8>] ? btrfs_delayed_ref_lock+0x2c/0x74
[  639.067694]  [<c113e805>] ? run_clustered_refs+0x826/0x869
[  639.067700]  [<c113e909>] ? btrfs_run_delayed_refs+0xc1/0x144
[  639.067704]  [<c1350707>] ? _cond_resched+0x5/0x18
[  639.067709]  [<c114d8a0>] ? __btrfs_end_transaction+0x70/0x19b
[  639.067713]  [<c114da21>] ? btrfs_end_transaction+0x11/0x15
[  639.067718]  [<c1157509>] ? btrfs_evict_inode+0x172/0x1e5
[  639.067723]  [<c10ac392>] ? evict+0x52/0xe1
[  639.067727]  [<c10a580c>] ? do_unlinkat+0xca/0x10a
[  639.067733]  [<c10c0e00>] ? fsnotify_find_inode_mark+0x17/0x1d
[  639.067737]  [<c109aa57>] ? filp_close+0x56/0x5f
[  639.067743]  [<c1352093>] ? sysenter_do_call+0x12/0x22
[  639.067745] Code: da d4 20 00 83 c4 2c 5b 5e 5f 5d c3 53 e8 ab ac
ed ff 8a 58 64 e8 58 ab ed ff 88 d8 5b c3 55 57 56 53 83 ec 60 89 c3
89 54 24 2c <8b> 42 1c e8 8a ac ed ff 8b 50 60 89 54 24 48 e8 33 ab ed
ff 8b
[  639.067791] EIP: [<c1142c2f>] btrfs_print_leaf+0xd/0x886 SS:ESP 0068:ec6c5ce4
[  639.067797] CR2: 000000000000001c
[  639.067805] ---[ end trace 640af837f79e846a ]---

Now every file access (no matter where in the file system) makes the
shell freeze and the process cannot be killed. So, effectively my
system is frozen again. :-(

Any fix for this?

Greetings,
Kai
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux