Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-09-05 Thread Yong Qin
Yes, so far this has only been observed in VASP and a specific dataset. Thanks, On Wed, Sep 5, 2012 at 4:52 AM, Yevgeny Kliteynik wrote: > On 9/4/2012 7:21 PM, Yong Qin wrote: >> On Tue, Sep 4, 2012 at 5:42 AM, Yevgeny Kliteynik >> wrote: >>> On 8/30/2012 10:28 PM, Yong Qin wrote: On Thu,

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-09-05 Thread Paul Kapinos
Yevgeny, we at RZ Aachen also see problems very similar to described in initial posting of Yong Qin, on VASP with Open MPI 1.5.3. We're currently looking for a data set able to reproduce this. I'll write an email if we gotcha it. Best, Paul On 09/05/12 13:52, Yevgeny Kliteynik wrote: I'm

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-09-05 Thread Yevgeny Kliteynik
On 9/4/2012 7:21 PM, Yong Qin wrote: > On Tue, Sep 4, 2012 at 5:42 AM, Yevgeny Kliteynik > wrote: >> On 8/30/2012 10:28 PM, Yong Qin wrote: >>> On Thu, Aug 30, 2012 at 5:12 AM, Jeff Squyres wrote: On Aug 29, 2012, at 2:25 PM, Yong Qin wrote: > This issue has been observed on OMPI

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-09-04 Thread Yong Qin
On Tue, Sep 4, 2012 at 5:42 AM, Yevgeny Kliteynik wrote: > On 8/30/2012 10:28 PM, Yong Qin wrote: >> On Thu, Aug 30, 2012 at 5:12 AM, Jeff Squyres wrote: >>> On Aug 29, 2012, at 2:25 PM, Yong Qin wrote: >>> This issue has been observed on OMPI 1.6 and 1.6.1 with openib btl but not on 1.

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-09-04 Thread Yevgeny Kliteynik
On 8/30/2012 10:28 PM, Yong Qin wrote: > On Thu, Aug 30, 2012 at 5:12 AM, Jeff Squyres wrote: >> On Aug 29, 2012, at 2:25 PM, Yong Qin wrote: >> >>> This issue has been observed on OMPI 1.6 and 1.6.1 with openib btl but >>> not on 1.4.5 (tcp btl is always fine). The application is VASP and >>> onl

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-08-30 Thread Yong Qin
On Thu, Aug 30, 2012 at 5:12 AM, Jeff Squyres wrote: > On Aug 29, 2012, at 2:25 PM, Yong Qin wrote: > >> This issue has been observed on OMPI 1.6 and 1.6.1 with openib btl but >> not on 1.4.5 (tcp btl is always fine). The application is VASP and >> only one specific dataset is identified during th

Re: [OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-08-30 Thread Jeff Squyres
On Aug 29, 2012, at 2:25 PM, Yong Qin wrote: > This issue has been observed on OMPI 1.6 and 1.6.1 with openib btl but > not on 1.4.5 (tcp btl is always fine). The application is VASP and > only one specific dataset is identified during the testing, and the OS > is SL 6.2 with kernel 2.6.32-220.23.

[OMPI users] OMPI 1.6.x Hang on khugepaged 100% CPU time

2012-08-29 Thread Yong Qin
Hi, This issue has been observed on OMPI 1.6 and 1.6.1 with openib btl but not on 1.4.5 (tcp btl is always fine). The application is VASP and only one specific dataset is identified during the testing, and the OS is SL 6.2 with kernel 2.6.32-220.23.1.el6.x86_64. The issue is that when a certain ty