Hi Jonathan! Sorry I took a while to respond and apologies in advance for them being quite useless...
> - steps to reproduce, assuming I had the same hardware Use the computer. My maximum time without hitting the bug has been less than 48 hours before I added the following module options: 11n_disable=1 power_save=0 wd_disable=1 which some googling suggested might solve it. They do not, but they make it less frequent. With them, I can even get a WEEK without seeing this. But notice I have different hw than the original reporter: mine is X200s with "Intel Corporation Ultimate N WiFi Link 5300" (pci-id: 8086:4236). > - expected result, actual result, and how the difference indicates a > bug (should be simple enough in this case) =) Expected result: wifi keeps working unless I switch it off using rf_kill, physical switch, unload the module, or turn off the computer. Actual result: suddenly, out of the blue, in the middle of typing an email in kmail, see attachment. Bug: the wifi card has certainly not changed into "Unknown hardware type" suddenly. > - how reproducible it is (100% of the time? 50%?) 100% when waiting long enough between reboots. > - which kernel versions you have tested, and results with each There were no problems in the 2.6-series. The bug occurs at least in the Debian kernel versions 3.2.0-1-amd64, 3.0.0-2-amd64, and 3.0.0-1-amd64. > - full "dmesg" output from booting and reproducing the bug, as an > attachment Do not have it now, if really necessary, will get it next time it occurs (which may be a while: I am back to 2.6.39 because I need to get work done). > - any other weird observations or workarounds No workaround. Above module parameters alleviate the issue. This is a regression, so I suggest someone with time (if anyone has it) bisects 2.6 and 3.2... horrible task, I do not envy anyone doing that. > upstream to a public mailing list. So the purpose of these questions > is to collect data on what's known so far as a starting point. Please CC me if you do. Cheers, Juha -- ----------------------------------------------- | Juha Jäykkä, ju...@iki.fi | | http://www.maths.leeds.ac.uk/~juhaj | -----------------------------------------------
[252832.820219] iwlwifi 0000:03:00.0: Error sending POWER_TABLE_CMD: time out after 2000ms. [252832.820229] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 135 [252832.820237] iwlwifi 0000:03:00.0: set power fail, ret = -110 [252835.320320] iwlwifi 0000:03:00.0: Error sending REPLY_QOS_PARAM: time out after 2000ms. [252835.320331] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 137 [252835.320338] iwlwifi 0000:03:00.0: Failed to update QoS [252837.320249] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252837.320259] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 140 [252837.320267] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252839.320130] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252839.320140] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 143 [252839.320148] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252841.320273] iwlwifi 0000:03:00.0: Error sending REPLY_ADD_STA: time out after 2000ms. [252841.320283] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 146 [252841.320296] ieee80211 phy0: failed to remove key (0, 00:24:17:33:f4:f5) from hardware (-110) [252843.320053] iwlwifi 0000:03:00.0: Error sending REPLY_REMOVE_STA: time out after 2000ms. [252843.320057] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 149 [252843.320061] iwlwifi 0000:03:00.0: Error removing station 00:24:17:33:f4:f5 [252845.324086] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252845.324096] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 152 [252845.324104] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252847.328250] iwlwifi 0000:03:00.0: fail to flush all tx fifo queues [252849.328175] iwlwifi 0000:03:00.0: Error sending POWER_TABLE_CMD: time out after 2000ms. [252849.328185] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 154 [252849.328192] iwlwifi 0000:03:00.0: set power fail, ret = -110 [252851.328203] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252851.328214] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 155 [252851.328222] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252853.328234] iwlwifi 0000:03:00.0: Error sending REPLY_ADD_STA: time out after 2000ms. [252853.328245] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 156 [252853.328258] ieee80211 phy0: failed to remove key (1, ff:ff:ff:ff:ff:ff) from hardware (-110) [252853.328380] cfg80211: Calling CRDA to update world regulatory domain [252855.428124] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252855.428135] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 157 [252855.428142] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252857.428183] iwlwifi 0000:03:00.0: Error sending POWER_TABLE_CMD: time out after 2000ms. [252857.428193] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 158 [252857.428201] iwlwifi 0000:03:00.0: set power fail, ret = -110 [252859.428260] iwlwifi 0000:03:00.0: Error sending REPLY_RXON: time out after 2000ms. [252859.428271] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 159 [252859.428279] iwlwifi 0000:03:00.0: Error clearing ASSOC_MSK on BSS (-110) [252861.428175] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: time out after 2000ms. [252861.428186] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 160 [252864.428133] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: time out after 2000ms. [252864.428144] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 161 [252867.428027] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: time out after 2000ms. [252867.428032] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 162 [252870.428036] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: time out after 2000ms. [252870.428041] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 163 [252873.428063] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: time out after 2000ms. [252873.428074] iwlwifi 0000:03:00.0: Current CMD queue read_ptr 134 write_ptr 164 [252874.429359] iwlwifi 0000:03:00.0: No space in command queue [252874.429364] iwlwifi 0000:03:00.0: Restarting adapter queue is full [252874.429373] iwlwifi 0000:03:00.0: Error sending REPLY_SCAN_CMD: enqueue_hcmd failed: -28 [252874.433273] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF ... [252874.792578] ieee80211 phy0: Hardware restart was requested [252874.792628] iwlwifi 0000:03:00.0: L1 Disabled; Enabling L0S [252874.796429] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [252874.796429] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [252874.836786] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [252874.846994] iwlwifi 0000:03:00.0: Radio type=0x0-0x2-0x0 [252874.850986] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF ... [252875.485413] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [252880.496046] iwlwifi 0000:03:00.0: Could not load the INST uCode section [252880.496052] iwlwifi 0000:03:00.0: Failed to start RT ucode: -110 [252880.503267] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF ... [252880.845613] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [252880.859799] iwlwifi 0000:03:00.0: Unable to initialize device. [252880.859825] iwlwifi 0000:03:00.0: Request scan called when driver not ready. ... until I hit the kill switch, at which point syslog shows: [253188.968042] ------------[ cut here ]------------ [253188.968056] WARNING: at /build/buildd-linux-2.6_3.2.1-2-amd64-kK3kdc/linux-2.6-3.2.1/debian/build/source_amd64_none/drivers/net/wireless/iwlwifi/iwl-core.c:1330 iwlagn_mac_remove_interface+0x48/0xdd [iwlwifi]() [253188.968060] Hardware name: 74695KG [253188.968062] Modules linked in: iwlwifi mac80211 cfg80211 hidp hid tun acpi_cpufreq mperf cpufreq_stats cpufreq_userspace cpufreq_powersave cpufreq_conservative rfcomm bnep parport_pc ppdev lp parport autofs4 binfmt_misc uinput fuse nfsd nfs lockd fscache auth_rpcgss nfs_acl sunrpc coretemp loop kvm_intel kvm btusb bluetooth crc16 snd_hda_codec_conexant snd_hda_intel snd_hda_codec arc4 snd_hwdep snd_pcm_oss snd_mixer_oss thinkpad_acpi snd_pcm snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq i915 snd_timer snd_seq_device evdev snd iTCO_wdt drm_kms_helper drm rfkill i2c_algo_bit i2c_i801 ac soundcore battery nvram tpm_tis tpm tpm_bios iTCO_vendor_support i2c_core snd_page_alloc power_supply video button psmouse serio_raw processor wmi xfs sha256_generic cryptd aes_x86_64 aes_generic cbc dm_crypt dm_mod sd_mod crc_t10dif uhci_hcd ahci libahci ehci_hcd thermal thermal_sys usbcore ata_generic libata scsi_mod e1000e usb_common [last unloaded: cfg80211] [253188.968150] Pid: 28201, comm: kworker/0:1 Not tainted 3.2.0-1-amd64 #1 [253188.968153] Call Trace: [253188.968160] [<ffffffff810467ed>] ? warn_slowpath_common+0x78/0x8c [253188.968168] [<ffffffffa0390997>] ? iwlagn_mac_remove_interface+0x48/0xdd [iwlwifi] [253188.968176] [<ffffffffa0227301>] ? rfkill_restore_states+0x7/0x47 [rfkill] [253188.968188] [<ffffffffa061f05b>] ? ieee80211_do_stop+0x30d/0x45e [mac80211] [253188.968192] [<ffffffff8104bcdf>] ? _local_bh_enable_ip.isra.11+0x1e/0x88 [253188.968198] [<ffffffffa022738b>] ? spin_unlock_irq+0xb/0xb [rfkill] [253188.968208] [<ffffffffa061f1be>] ? ieee80211_stop+0x12/0x16 [mac80211] [253188.968217] [<ffffffffa061f1ac>] ? ieee80211_do_stop+0x45e/0x45e [mac80211] [253188.968223] [<ffffffff8127fccc>] ? __dev_close_many+0x84/0xb0 [253188.968226] [<ffffffff8127fdc7>] ? dev_close_many+0x88/0xee [253188.968230] [<ffffffff810363ab>] ? should_resched+0x5/0x23 [253188.968234] [<ffffffff81282031>] ? dev_close+0x37/0x46 [253188.968241] [<ffffffffa0289c66>] ? cfg80211_rfkill_set_block+0x3d/0x62 [cfg80211] [253188.968247] [<ffffffffa0226be1>] ? rfkill_set_block+0x7d/0xf0 [rfkill] [253188.968252] [<ffffffffa0226d8f>] ? __rfkill_switch_all+0x33/0x55 [rfkill] [253188.968258] [<ffffffffa0227266>] ? rfkill_switch_all+0x33/0x48 [rfkill] [253188.968264] [<ffffffffa0227489>] ? rfkill_op_handler+0xfe/0x12d [rfkill] [253188.968268] [<ffffffff8105adc1>] ? process_one_work+0x163/0x284 [253188.968272] [<ffffffff8105bd89>] ? worker_thread+0xc2/0x145 [253188.968276] [<ffffffff8105bcc7>] ? manage_workers.isra.23+0x15b/0x15b [253188.968280] [<ffffffff8105eec5>] ? kthread+0x76/0x7e [253188.968285] [<ffffffff813473b4>] ? kernel_thread_helper+0x4/0x10 [253188.968289] [<ffffffff8105ee4f>] ? kthread_worker_fn+0x139/0x139 [253188.968293] [<ffffffff813473b0>] ? gs_change+0x13/0x13 [253188.968295] ---[ end trace 52cc41750673642a ]--- [253188.968300] iwlwifi 0000:03:00.0: ctx->vif = (null), vif = ffff88001db5cdf0 [253188.968303] iwlwifi 0000:03:00.0: ID = 0: ctx = ffff880091cbb4b0 ctx->vif = (null) a bit later, some more of [253216.532067] iwlwifi 0000:03:00.0: L1 Disabled; Enabling L0S [253216.536016] iwlwifi 0000:03:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0xFFFFFFFF [253216.586757] iwlwifi 0000:03:00.0: Radio type=0x0-0x2-0x0 and other old friends follow until I decide to try reloading the module: [253268.602408] cfg80211: Calling CRDA to update world regulatory domain [253268.617695] Intel(R) Wireless WiFi Link AGN driver for Linux, in-tree: [253268.617698] Copyright(c) 2003-2011 Intel Corporation [253268.617758] iwlwifi 0000:03:00.0: enabling device (0000 -> 0002) [253268.617768] iwlwifi 0000:03:00.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17 [253268.617781] iwlwifi 0000:03:00.0: setting latency timer to 64 [253268.617811] iwlwifi 0000:03:00.0: pci_resource_len = 0x00002000 [253268.617814] iwlwifi 0000:03:00.0: pci_resource_base = ffffc90005094000 [253268.617817] iwlwifi 0000:03:00.0: HW Revision ID = 0x0 [253268.617899] iwlwifi 0000:03:00.0: irq 43 for MSI/MSI-X [253268.617960] iwlwifi 0000:03:00.0: Detected Intel(R) Ultimate N WiFi Link 5300 AGN, REV=0xFFFFFFFF [253268.618003] iwlwifi 0000:03:00.0: Unknown hardware type [253268.618005] iwlwifi 0000:03:00.0: Unable to init EEPROM [253268.618040] iwlwifi 0000:03:00.0: PCI INT A disabled [253268.618046] iwlwifi: probe of 0000:03:00.0 failed with error -2