Re: PROBLEM: kernel BUG at mm/rmap.c:522!

Francesco Ricci Wed, 18 Apr 2007 03:06:09 -0700

>That by itself would suggest a single-bit error, which would point
>you to running memtest86 overnight to check your RAM.  Worth a try.


ok,
I've finished today the long testing on my ram with memtest86+ 1.65
(distributed in debian): no errors.
so I think the problem is somewere else...

tnx for your time!

Hugh Dickins <[EMAIL PROTECTED]> writes:
>On Fri, 13 Apr 2007, Francesco Ricci wrote:
>> 
>> [7.7.] Other information that might be relevant to the problem
>>        (please look in /proc and include all information that you
>>        think to be relevant):
>
>Thanks for your report, and for your patience in supplying all that
>scarcely relevant info you were asked for.
>
>> 
>> from dmesg:
>> Apr 13 11:31:35 localhost kernel: Bad pte = 461b43d4, process = ???,
>> vm_flags = 75, vaddr = b7868000
>
>Oh dear, one of your page tables has got corrupted, the page table
>entry for virtual address b7868000 contains 461b43d4 - nonsense.
>
>> Apr 13 11:31:35 localhost kernel:  [<c014b0ea>] vm_normal_page+0x3e/0x53
>> Apr 13 11:31:35 localhost kernel:  [<c014b6fa>] unmap_vmas+0x183/0x4af
>> Apr 13 11:31:35 localhost kernel:  [<c014de31>] exit_mmap+0x6a/0xd7
>> Apr 13 11:31:35 localhost kernel:  [<c011b217>] mmput+0x20/0x76
>> Apr 13 11:31:35 localhost kernel:  [<c011fa05>] do_exit+0x193/0x71b
>> Apr 13 11:31:35 localhost kernel:  [<c0120003>] sys_exit_group+0x0/0xd
>> Apr 13 11:31:35 localhost kernel:  [<c0127a6d>]
>> get_signal_to_deliver+0x395/0x3bc
>> Apr 13 11:31:35 localhost kernel:  [<c01023a6>]
>do_notify_resume+0x71/0x5d7
>> Apr 13 11:31:35 localhost kernel:  [<c0117778>]
>> default_wake_function+0x0/0xc
>> Apr 13 11:31:35 localhost kernel:  [<c0124657>]
>do_gettimeofday+0x31/0xce
>> Apr 13 11:31:35 localhost kernel:  [<c0131ffc>] sys_futex+0xdc/0xf1
>> Apr 13 11:31:35 localhost kernel:  [<c0102d0a>] work_notifysig+0x13/0x19
>> Apr 13 11:31:35 localhost kernel: ------------[ cut here ]------------
>> Apr 13 11:31:35 localhost kernel: kernel BUG at mm/rmap.c:522!
>> Apr 13 11:31:35 localhost kernel: invalid opcode: 0000 [#1]
>> Apr 13 11:31:35 localhost kernel: SMP 
>> Apr 13 11:31:35 localhost kernel: Modules linked in: smbfs ext3 jbd
>> mbcache mga drm ppdev lp button ac battery ipv6 fuse dm_snapshot
>dm_mirror
>> dm_mod loop tsdev snd_via82xx gameport snd_ac97_codec snd_ac97_bus
>> snd_pcm_oss snd_mixer_oss snd_pcm snd_page_alloc snd_mpu401_uart
>> snd_seq_dummy snd_seq_oss snd_seq_midi snd_seq_midi_event snd_seq
>> i2c_viapro i2c_core snd_timer snd_rawmidi snd_seq_device via_agp
>> parport_pc parport via_ircc psmouse serio_raw floppy snd soundcore
>pcspkr
>> rtc agpgart shpchp pci_hotplug irda crc_ccitt evdev reiserfs ide_cd
>cdrom
>> ide_disk generic via_rhine mii ehci_hcd uhci_hcd via82cxxx ide_core
>> usbcore thermal processor fan
>> Apr 13 11:31:35 localhost kernel: CPU:    0
>> Apr 13 11:31:35 localhost kernel: EIP:    0060:[<c01506c9>]    Not
>tainted
>> VLI
>> Apr 13 11:31:35 localhost kernel: EFLAGS: 00210286   (2.6.18-4-686 #1) 
>> Apr 13 11:31:35 localhost kernel: EIP is at page_remove_rmap+0x14/0x2d
>> Apr 13 11:31:35 localhost kernel: eax: ffffffff   ebx: c1000000   ecx:
>> c1000000   edx: 00000000
>> Apr 13 11:31:35 localhost kernel: esi: b7869000   edi: 00000000   ebp:
>> d08011a4   esp: c9be9e14
>> Apr 13 11:31:35 localhost kernel: ds: 007b   es: 007b   ss: 0068
>> Apr 13 11:31:35 localhost kernel: Process iceape-bin (pid: 20500,
>> ti=c9be8000 task=c2128aa0 task.ti=c9be8000)
>> Apr 13 11:31:35 localhost kernel: Stack: c014b7d5 00000000 df10d278
>> c9be9e7c 00000000 00000001 b7884000 c4bc7b78 
>> Apr 13 11:31:35 localhost kernel:        e80fbe40 c16058a0 00000000
>> ffffffe2 c121002c c4bc7b78 0011ed07 b7884000 
>> Apr 13 11:31:35 localhost kernel:        00000000 c9be9e7c df4c7710
>> e80fbe40 c9be9eb8 c014de31 ffffffff c9be9e78 
>
>And the next page table entry, for virtual address b7869000, has also
>got corrupted: I'm rather guessing, but I believe the c1000000 implies
>it's looking at pfn 0, so the page table entry in question would be
>that 00000001 seen on the stack (1 for present).
>
>That by itself would suggest a single-bit error, which would point
>you to running memtest86 overnight to check your RAM.  Worth a try.
>
>But the 461b43d4 before it suggests corruption from elsewhere in
>the kernel: sorry, I've no clue on that.  Just wait and see if this
>happens again, and whether a pattern emerges - unless someone else
>can suggest something better.
>
>Hugh
>
>> Apr 13 11:31:35 localhost kernel: Call Trace:
>> Apr 13 11:31:35 localhost kernel:  [<c014b7d5>] unmap_vmas+0x25e/0x4af
>> Apr 13 11:31:35 localhost kernel:  [<c014de31>] exit_mmap+0x6a/0xd7
>> Apr 13 11:31:35 localhost kernel:  [<c011b217>] mmput+0x20/0x76
>> Apr 13 11:31:35 localhost kernel:  [<c011fa05>] do_exit+0x193/0x71b
>> Apr 13 11:31:35 localhost kernel:  [<c0120003>] sys_exit_group+0x0/0xd
>> Apr 13 11:31:35 localhost kernel:  [<c0127a6d>]
>> get_signal_to_deliver+0x395/0x3bc
>> Apr 13 11:31:35 localhost kernel:  [<c01023a6>]
>do_notify_resume+0x71/0x5d7
>> Apr 13 11:31:35 localhost kernel:  [<c0117778>]
>> default_wake_function+0x0/0xc
>> Apr 13 11:31:35 localhost kernel:  [<c0124657>]
>do_gettimeofday+0x31/0xce
>> Apr 13 11:31:35 localhost kernel:  [<c0131ffc>] sys_futex+0xdc/0xf1
>> Apr 13 11:31:35 localhost kernel:  [<c0102d0a>] work_notifysig+0x13/0x19
>> Apr 13 11:31:35 localhost kernel: Code: ff ff 85 c0 89 c6 75 c9 b0 01 86
>> 43 28 83 c4 20 89 e8 5b 5e 5f 5d c3 89 c1 90 83 40 08 ff 0f 98 c0 84 c0
>74
>> 1e 8b 41 08 40 79 08 <0f> 0b 0a 02 40 ab 29 c0 8b 51 10 89 c8 83 f2 01
>83
>> e2 01 e9 97 
>> Apr 13 11:31:35 localhost kernel: EIP: [<c01506c9>]
>> page_remove_rmap+0x14/0x2d SS:ESP 0068:c9be9e14
>> Apr 13 11:31:35 localhost kernel:  <1>Fixing recursive fault but reboot
>is
>> needed!


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Re: PROBLEM: kernel BUG at mm/rmap.c:522!

Reply via email to