- To: NeilBrown <neilb@xxxxxxx>
- Subject: Re: raid10 make_request failure during iozone benchmark upon btrfs
- From: Kerin Millar <kerframil@xxxxxxxxx>
- Date: Tue, 03 Jul 2012 03:13:33 +0100
- Cc: linux-raid@xxxxxxxxxxxxxxx
- In-reply-to: <20120703113943.3e4c43ad@notabene.brown>
- User-agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:10.0.5) Gecko/20120605 Thunderbird/10.0.5
Hi,
On 03/07/2012 02:39, NeilBrown wrote:
[snip]
>>> Could you please double check that you are running a kernel with
>>>
>>> commit aba336bd1d46d6b0404b06f6915ed76150739057
>>> Author: NeilBrown<neilb@xxxxxxx>
>>> Date: Thu May 31 15:39:11 2012 +1000
>>>
>>> md: raid1/raid10: fix problem with merge_bvec_fn
>>>
>>> in it?
>>
>> I am indeed. I searched the list beforehand and noticed the patch in
>> question. Not sure which -rc it landed in but I checked my source tree
>> and it's definitely in there.
>>
>> Cheers,
>>
>> --Kerin
>
> Thanks.
> Looking at it again I see that it is definitely a different bug, that patch
> wouldn't affect it.
>
> But I cannot see what could possibly be causing the problem.
> You have a 256K chunk size, so requests should be limited to 512 sectors
> aligned at a 512-sector boundary.
> However all the requests that a causing errors are 512 sectors long, but
> aligned on a 256-sector boundary (which is not also 512-sector). This is
> wrong.
I see.
>
> It could be that btrfs is submitting bad requests, but I think it always uses
> bio_add_page, and bio_add_page appears to do the right thing.
> It could be that dm-linear is causing problem, but it seems to correctly after
> the underlying device for alignment, and reports that alignment to
> bio_add_page.
> It could be that md/raid10 is the problem but I cannot find any fault in
> raid10_mergeable_bvec - performs much the same tests that the
> raid01 make_request function does.
>
> So it is a mystery.
>
> Is this failure repeatable?
Yes, it's reproducible with 100% consistency. Furthermore, I tried to
use the btrfs volume as a store for the package manager, so as to try
with a 'realistic' workload. Many of these errors were triggered
immediately upon invoking the package manager. In case it matters, the
package manager is portage (in Gentoo Linux) and the directory structure
entails a shallow directory depth with a large number of distributed
small files. I haven't been able to reproduce with xfs, ext4 or reiserfs.
>
> If so, could you please insert
> WARN_ON_ONCE(1);
> in drivers/md/raid10.c where it prints out the message: just after the
> "bad_map:" label.
>
> Also, in raid10_mergeable_bvec, insert
> WARN_ON_ONCE(max< 0);
> just before
> if (max< 0)
> /* bio_add cannot handle a negative return */
> max = 0;
>
> and then see if either of those generate a warning, and post the full stack
> trace if they do.
OK. I ran iozone again on a fresh filesystem, mounted with the default
options. Here's the trace that appears, just before the first
make_request_bug message:
WARNING: at drivers/md/raid10.c:1094 make_request+0xda5/0xe20()
Hardware name: ProLiant MicroServer
Modules linked in: btrfs zlib_deflate lzo_compress kvm_amd kvm sp5100_tco i2c_piix4
Pid: 1031, comm: btrfs-submit-1 Not tainted 3.5.0-rc5 #3
Call Trace:
[<ffffffff81031987>] ? warn_slowpath_common+0x67/0xa0
[<ffffffff81442b45>] ? make_request+0xda5/0xe20
[<ffffffff81460b34>] ? __split_and_process_bio+0x2d4/0x600
[<ffffffff81063429>] ? set_next_entity+0x29/0x60
[<ffffffff810652c3>] ? pick_next_task_fair+0x63/0x140
[<ffffffff81450b7f>] ? md_make_request+0xbf/0x1e0
[<ffffffff8123d12f>] ? generic_make_request+0xaf/0xe0
[<ffffffff8123d1c3>] ? submit_bio+0x63/0xe0
[<ffffffff81040abd>] ? try_to_del_timer_sync+0x7d/0x120
[<ffffffffa016839a>] ? run_scheduled_bios+0x23a/0x520 [btrfs]
[<ffffffffa0170e40>] ? worker_loop+0x120/0x520 [btrfs]
[<ffffffffa0170d20>] ? btrfs_queue_worker+0x2e0/0x2e0 [btrfs]
[<ffffffff810520c5>] ? kthread+0x85/0xa0
[<ffffffff815441f4>] ? kernel_thread_helper+0x4/0x10
[<ffffffff81052040>] ? kthread_freezable_should_stop+0x60/0x60
[<ffffffff815441f0>] ? gs_change+0xb/0xb
Cheers,
--Kerin
--
To unsubscribe from this list: send the line "unsubscribe linux-raid" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
[ATA RAID]
[Linux SCSI Target Infrastructure]
[Managing RAID on Linux]
[Linux IDE]
[Linux SCSI]
[Linux Hams]
[Device-Mapper]
[Kernel]
[Linux Books]
[Linux Admin]
[Linux Net]
[GFS]
[RPM]
[git]
[Photos]
[Yosemite Photos]
[Yosemite News]
[AMD 64]
[Linux Networking]