How can I fix the error if all processes open their backing files for mmap
on NSF like you said?

Vincent

On Thu, Oct 23, 2014 at 10:35 PM, Joshua Ladd <jladd.m...@gmail.com> wrote:

> It's not coming from OSHMEM but from the OPAL "shmem" framework. You are
> going to get terrible performance - possibly slowing to a crawl having all
> processes open their backing files for mmap on NSF. I think that's the
> error that he's getting.
>
>
> Josh
>
> On Thu, Oct 23, 2014 at 6:06 AM, Vinson Leung <lwhvinson1...@gmail.com>
> wrote:
>
>> HI, Thanks for your reply:)
>> I really run an MPI program (compile with OpenMPI and run with "mpirun -n
>> 8 ......"). My OpenMPI version is 1.8.3 and my program is Gromacs. BTW,
>> what is OSHMEM ?
>>
>> Best
>> Vincent
>>
>> On Thu, Oct 23, 2014 at 12:21 PM, Ralph Castain <r...@open-mpi.org> wrote:
>>
>>> From your error message, I gather you are not running an MPI program,
>>> but rather an OSHMEM one? Otherwise, I find the message strange as it only
>>> would be emitted from an OSHMEM program.
>>>
>>> What version of OMPI are you trying to use?
>>>
>>> On Oct 22, 2014, at 7:12 PM, Vinson Leung <lwhvinson1...@gmail.com>
>>> wrote:
>>>
>>> Thanks for your reply:)
>>> Follow your advice I tried to set the TMPDIR to /var/tmp and /dev/shm
>>> and even reset to /tmp (I get the system permission), the problem still
>>> occur (CPU utilization still lower than 20%). I have no idea why and ready
>>> to give up OpenMPI instead of using other MPI library.
>>>
>>> --------Old Message-------------
>>>
>>> Date: Tue, 21 Oct 2014 22:21:31 -0400
>>> From: Brock Palen <bro...@umich.edu>
>>> To: Open MPI Users <us...@open-mpi.org>
>>> Subject: Re: [OMPI users] low CPU utilization with OpenMPI
>>> Message-ID: <cc54135d-0cfe-440a-8df2-06b587e17...@umich.edu>
>>> Content-Type: text/plain; charset=us-ascii
>>>
>>> Doing special files on NFS can be weird,  try the other /tmp/ locations:
>>>
>>> /var/tmp/
>>> /dev/shm  (ram disk careful!)
>>>
>>> Brock Palen
>>> www.umich.edu/~brockp
>>> CAEN Advanced Computing
>>> XSEDE Campus Champion
>>> bro...@umich.edu
>>> (734)936-1985
>>>
>>>
>>>
>>> > On Oct 21, 2014, at 10:18 PM, Vinson Leung <lwhvinson1...@gmail.com>
>>> wrote:
>>> >
>>> > Because of permission reason (OpenMPI can not write temporary file to
>>> the default /tmp directory), I change the TMPDIR to my local directory
>>> (export TMPDIR=/home/user/tmp ) and then the MPI program can run. But the
>>> CPU utilization is very low under 20% (8 MPI rank running in Intel Xeon
>>> 8-core CPU).
>>> >
>>> > And I also got some message when I run with OpenMPI:
>>> > [cn3:28072] 9 more processes have sent help message
>>> help-opal-shmem-mmap.txt / mmap on nfs
>>> > [cn3:28072] Set MCA parameter "orte_base_help_aggregate" to 0 to see
>>> all help / error messages
>>> >
>>> > Any idea?
>>> > Thanks
>>> >
>>> > VIncent
>>> > _______________________________________________
>>> > users mailing list
>>> > us...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2014/10/25548.php
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2014/10/25555.php
>>>
>>>
>>>
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2014/10/25556.php
>>>
>>
>>
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post:
>> http://www.open-mpi.org/community/lists/users/2014/10/25558.php
>>
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
> http://www.open-mpi.org/community/lists/users/2014/10/25560.php
>

Reply via email to