Re: 3.13.5 kernel hangs some processes with btrfs

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Marc MERLIN posted on Sun, 23 Feb 2014 22:58:47 -0800 as excerpted:

> On Mon, Feb 24, 2014 at 06:42:30AM +0000, Duncan wrote:
>> [S]ee the /var/lib/btrfs/scrub.status.* files.  That's
>> where scrub state is stored, and manually blowing away the appropriate
>> file should clear btrfs' memory of the aborted scrub, so you can scrub
>> start properly.
> 
> Ah, silly me, I thought this was all in the kernel and not in userspace.

That was a bit eye opening for me too. =:^)

> Yep, I cleared the stats, and that part is back to ok, thanks.

=:^)

> But I'm still seeing these, albeit less often.
> Any idea what they could be linked to?
> (I have a btrs send/receive going right now, it could hanging
> /mnt/btrfs_pool1 in a way that affects smbd, but the array feels ok
> otherwise, weird...)
> 
> [ 1332.548370] INFO: task smbd:21882 blocked for more than 120 seconds.
> [ 1332.587455]       Not tainted 3.13.5-ia32-i915-preempt-20140216 #1
> [ 1332.625478] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs"
> disables this message.

I've not seen anything like that here, but there are several kernel 
3.13/3.14-rc reports of similar behavior on the list.

>From what I've seen reported, the problem /might/ be large-internal-write-
file related, multi-gig vm images, database files, etc, actively being 
written, the sort of thing NOCOW *SHOULD* fix, at least in the absence of 
frequent snapshots, but in at least one case, NOCOW had been properly 
activated before the file content was written and the user was NOT doing 
major snapshotting of any kind, so that rules out those triggers.

So I've no idea, except that in every reported case I've seen, people did 
have large VMs or the like going as well, so that's a possible connection 
despite the above.

Hopefully the devs are having more success at assembling this puzzle than 
I am, but I've no suggestions for fixing it ATM, except the possibility 
of putting your VMs, etc, on a dedicated non-btrfs filesystem for the 
time being, assuming of course that apparent connection is a valid one.

-- 
Duncan - List replies preferred.   No HTML msgs.
"Every nonfree program has a lord, a master --
and if you use the program, he is your master."  Richard Stallman

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Filesystem Development]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux