Google
  Web www.spinics.net

Re: Soft-Lockup/Race in networking in 2.6.31-rc1+195 ( possibly caused by netem)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]


On 07/02/2009 01:59 PM, Andres Freund wrote:
On 07/02/2009 01:54 PM, Jarek Poplawski wrote:
On Thu, Jul 02, 2009 at 01:43:49PM +0200, Andres Freund wrote: ...
I will start trying to place the issue by testing with existing
kernels between 2.6.30 and now.
If you can afford your time of course this would be very helpful.
Well. Waiting for the issue to resolve itself would cost time as well
;-) I wont be able to finish this today, but perhaps some reduction
of the search space will be enough.
I lied.

I placed it between 2.6.30 and
03347e2592078a90df818670fddf97a33eec70fb (v2.6.30-5415-g03347e2) so
far.
Ok. I finally see the light. I bisected the issue down to
eea08f32adb3f97553d49a4f79a119833036000a :  timers: Logic to move non
pinned timers

Disabling timer migration like provided in the earlier commit stops the issue from occuring.

That it is related to timers is sensible in the light of my findings, that I could trigger the issue only when using delay in netem - that is the codepath using qdisc_watchdog...

Andres

Repasted original problem description for newly CC'ed people:
While playing around with netem (time, not packet count based loss-
bursts) I experienced soft lockups several times - to exclude it was
my modifications causing this I recompiled with the original and it
is still locking up. I captured several of those traces via the
thankfully still working netconsole. The simplest policy I could
reproduce the error with was: tc qdisc add dev eth0 root handle 1:
netem delay 10ms loss 0

I could not reproduce the error without delay - but that may only be
a timing issue, as the host I was mainly transferring data to was on
a local network. I could not reproduce the issue on lo.

The time to reproduce the error varied from seconds after executing
tc to several minutes.

Traces 5+6 are made with vanilla
52989765629e7d182b4f146050ebba0abf2cb0b7

The earlier traces are made with parts of my patches applied, and
only included for completeness as I don't believe my modifications
were causing this and all traces are different, so it may give some
clues.

Lockdep was enabled but did not diagnose anything relevant (one dvb
warning during bootup).

Any ideas for debugging?

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

[Site Home]     [Other Archives]     [Linux Kernel Newbies]     [Fedora Kernel]     [Linux Kernel Testers]     [Linux SH]     [Linux Omap]     [Linux Kbuild]     [Linux Tape]     [Linux Input]     [Linux Kernel Janitors]     [Linux Kernel Packagers]     [Linux Doc]     [Linux Man Pages]     [Linux API]     [Linux Modules]     [Linux Standards]     [Kernel Announce]     [Memory]     [Netdev]     [Git]     [Linux PCI]     [Linux I2C]     [Linux NUMA]     [Netfilter]     [Netfilter Devel]     [SELinux]     [Bugtraq]     [FIO]     [Linux Serial]     [Linux PPP]     [Linux ISDN]     [Linux Next]     [Kernel Stable Commits]     [Linux Tip Commits]     [Kernel MM Commits]     [Linux Security Module]     [Ext3]     [Ext4]     [Linux BTRFS]     [Linux XFS]     [Linux NFS]     [Linux Cachefs]     [Reiser FS]     [Initramfs]     [Fastboot]     [Linux RT Users]     [Linux Virtualization]     [LVS Devel]     [Hot Plug]     [KVM]     [KVM PPC]     [KVM ia64]     [Linux Containers]     [Util Linux NG]     [Sk Drivers]     [Wireless]     [Linux Bluetooth]     [Bluez Devel]     [Ethernet Bridging]     [Embedded Linux]     [Sparse]     [Linux Arch]     [Linux ACPI]     [Linux IBM ACPI]     [Linux OpenGL]     [CPU Freq]     [Linux Power Management]     [Linux DCCP]     [Linux SCTP]     [ALSA Devel]     [Linux USB]     [Large Format Photos]     [DVD Store]     [Tux]     [Gimp]     [Yosemite National Park Forum]     [Linux PA RISC]     [MIPS Linux]     [IBM S/390 Linux]     [ARM Linux]     [ARM Kernel]     [Sparc Linux]     [Linux Security]     [Linux Sound]     [Linux Media]     [Video 4 Linux]     [Linux for the blind]     [Linux RAID]     [Linux ATA RAID]     [Linux Clusters]     [Linux SCSI]     [Linux SCSI Target Infrastructure]     [Linux IDE]     [Linux SMP]     [Linux AXP]     [Linux Alpha]     [Linux M68K]     [Linux ia64]     [Linux 8086]     [Linux x86_64]     [Linux Config]     [Linux Apps]     [Linux MSDOS]     [Linux X.25]     [Linux Crypto]     [DM Crypt]     [Linux Btrace]     [Utrace Devel]     [Yosemite Photos]     [Linux C Programming]     [Linux Assembly]     [Dash]     [DWARVES]     [Older Kernel Mail]

Add to Google Powered by Linux