On Jun 8, 2010, at 12:08 PM, Dianne Yumul wrote: > Hello, > > I'm getting the same thing on one of our servers since upgrading to CentOS > 5.5: > > INFO: task pdflush:21249 blocked for more than 120 seconds. > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > pdflush D 00001EE1 3540 21249 11 21226 (L-TLB) > f0b59f24 00000046 af2b024f 00001ee1 c04ec500 00000000 c041e314 0000000a > f76ae550 af2b2566 00001ee1 00002317 00000001 f76ae65c c180dc44 c198e200 > 00000000 c180e5e4 f0b59fa8 f76ae550 c180dc44 c061bbc0 f76ae550 ffffffff > Call Trace: > [<c04ec500>] __next_cpu+0x12/0x21 > [<c041e314>] find_busiest_group+0x177/0x462 > [<c061bbc0>] schedule+0xbc/0xa55 > [<c061d981>] rwsem_down_read_failed+0x128/0x143 > [<c04389ef>] .text.lock.rwsem+0x35/0x3a > [<c047c014>] sync_supers+0x2f/0xb8 > [<c045df9c>] wb_kupdate+0x36/0x10f > [<c045e431>] pdflush+0x0/0x1a3 > [<c045e53c>] pdflush+0x10b/0x1a3 > [<c045df66>] wb_kupdate+0x0/0x10f > [<c0435f43>] kthread+0xc0/0xed > [<c0435e83>] kthread+0x0/0xed > [<c0405c53>] kernel_thread_helper+0x7/0x10 > >> From the bugs already filed, it seems to happen to many (or any?) processes >> and some notice hangups and performance drops. But our system seems okay, >> probably because it has low traffic and is mostly idle. But I'll still >> reboot to the previous kernel version tonight. > > dianne > > On Jun 8, 2010, at 1:04 AM, Ireneusz Piasecki wrote: > >> W dniu 2010-06-08 09:54, Tsuyoshi Nagata pisze: >>> Hi >>> (2010/06/08 5:12), Steve Brooks wrote: >>>> Jun 7 19:45:21 sraid3 kernel: [<ffffffff800ec2a2>] inode_wait+0x0/0xd >>>> Jun 7 19:45:21 sraid3 kernel: [<ffffffff80063ab0>] >>>> out_of_line_wait_on_bit+0x6c/0x78 >>>> Jun 7 19:45:21 sraid3 kernel: [<ffffffff800a0aec>] >>>> wake_bit_function+0x0/0x23 >>>> Jun 7 19:45:21 sraid3 kernel: [<ffffffff8003dbbf>] ifind_fast+0x6e/0x83 >>> This message was created at Linux/fs/inode.c:ifind_fast() >>> The source code was bellows, >>> >>> Linux/fs/inode.c: >>> 912 static struct inode *ifind_fast(struct super_block *sb, >>> 913 struct hlist_head *head, unsigned long ino) >>> 914 { >>> 915 struct inode *inode; >>> 916 >>> 917 *LOCK* spin_lock(&inode_lock);<= This takes >>> 918 inode = find_inode_fast(sb, head, ino);<= more 120s. >>> 919 if (inode) { >>> 920 __iget(inode); >>> 921 *UNLOCK* spin_unlock(&inode_lock); >>> 922 wait_on_inode(inode); >>> 923 return inode; >>> 924 } >>> 925 spin_unlock(&inode_lock); >>> 926 return NULL; >>> 927 } >>> 928 >>> >>> I guess your your file system has a trouble with i-node(file number) >>> resources. >>> CAUSES: >>> Hard Disk trouble (bit error/raid trouble.) >>> i-node trouble (overflow. etc.) >>> Memory/CPU trouble(&inode_lock) >>> >>> Buy Fresh Hard disks& rebuild them is convenience way. >>> Or memtest86 can finds DIMM trouble.(or CPU, mother board) >>> Or ext4 bug in 194.3.1 kernel, back to ext3! >>> >> Ok, then i will test all of my centos 5.5 32 nodes: cpu, ram, disks etc. >> This came with the kernel of Centos 5.5. Before there was'nt such >> errors/warrning. Redhat bugizilla: >> https://bugzilla.redhat.com/show_bug.cgi?id=573106 >> >> I.Piasecki >> >>> -tsuyoshi >>> _______________________________________________ >>> CentOS mailing list >>> CentOS@centos.org >>> http://lists.centos.org/mailman/listinfo/centos >>> >> >> _______________________________________________ >> CentOS mailing list >> CentOS@centos.org >> http://lists.centos.org/mailman/listinfo/centos >> > > _______________________________________________ > CentOS mailing list > CentOS@centos.org > http://lists.centos.org/mailman/listinfo/centos >
Oh crud, I did a top post! So sorry, won't happen again. dianne _______________________________________________ CentOS mailing list CentOS@centos.org http://lists.centos.org/mailman/listinfo/centos