I think it has something to do with your environment,  /etc/hosts, IT setup,
hostname function return value e.t.c
I am not sure if it has something to do with Open MPI at all.
Lenny.
On Mon, Aug 17, 2009 at 12:59 PM, jody <jody....@gmail.com> wrote:

> Hi Lenny
>
> Thanks - using the full names makes it work!
> Is there a reason why the rankfile option treats
> host names differently than the hostfile option?
>
> Thanks
>   Jody
>
>
>
> On Mon, Aug 17, 2009 at 11:20 AM, Lenny
> Verkhovsky<lenny.verkhov...@gmail.com> wrote:
> > Hi
> > This message means
> > that you are trying to use host "plankton", that was not allocated via
> > hostfile or hostlist.
> > But according to the files and command line, everything seems fine.
> > Can you try using "plankton.uzh.ch" hostname instead of "plankton".
> > thanks
> > Lenny.
> > On Mon, Aug 17, 2009 at 10:36 AM, jody <jody....@gmail.com> wrote:
> >>
> >> Hi
> >>
> >> When i use a rankfile, i get an error message which i don't understand:
> >>
> >> [jody@plankton tests]$ mpirun -np 3 -rf rankfile -hostfile testhosts
> >> ./HelloMPI
> >>
> --------------------------------------------------------------------------
> >> Rankfile claimed host plankton that was not allocated or
> >> oversubscribed it's slots:
> >>
> >>
> --------------------------------------------------------------------------
> >> [plankton.uzh.ch:24327] [[44857,0],0] ORTE_ERROR_LOG: Bad parameter in
> >> file rmaps_rank_file.c at line 108
> >> [plankton.uzh.ch:24327] [[44857,0],0] ORTE_ERROR_LOG: Bad parameter in
> >> file base/rmaps_base_map_job.c at line 87
> >> [plankton.uzh.ch:24327] [[44857,0],0] ORTE_ERROR_LOG: Bad parameter in
> >> file base/plm_base_launch_support.c at line 77
> >> [plankton.uzh.ch:24327] [[44857,0],0] ORTE_ERROR_LOG: Bad parameter in
> >> file plm_rsh_module.c at line 990
> >>
> --------------------------------------------------------------------------
> >> A daemon (pid unknown) died unexpectedly on signal 1  while attempting
> to
> >> launch so we are aborting.
> >>
> >> There may be more information reported by the environment (see above).
> >>
> >> This may be because the daemon was unable to find all the needed shared
> >> libraries on the remote node. You may set your LD_LIBRARY_PATH to have
> the
> >> location of the shared libraries on the remote nodes and this will
> >> automatically be forwarded to the remote nodes.
> >>
> --------------------------------------------------------------------------
> >>
> --------------------------------------------------------------------------
> >> mpirun noticed that the job aborted, but has no info as to the process
> >> that caused that situation.
> >>
> --------------------------------------------------------------------------
> >> mpirun: clean termination accomplished
> >>
> >>
> >>
> >> With out the '-rf rankfile' option everything works as expected.
> >>
> >> My hostfile :
> >> [jody@plankton tests]$ cat testhosts
> >> # The following node is a quad-processor machine, and we absolutely
> >> # want to disallow over-subscribing it:
> >> plankton slots=3  max-slots=3
> >> # The following nodes are dual-processor machines:
> >> nano_00  slots=2 max-slots=2
> >> nano_01  slots=2 max-slots=2
> >> nano_02  slots=2 max-slots=2
> >> nano_03  slots=2 max-slots=2
> >> nano_04  slots=2 max-slots=2
> >> nano_05  slots=2 max-slots=2
> >> nano_06  slots=2 max-slots=2
> >>
> >> my rank file:
> >> [jody@plankton neander]$ cat rankfile
> >> rank  0=nano_00  slot=1
> >> rank  1=plankton slot=0
> >> rank  2=nano_01  slot=1
> >>
> >> my Open MPI version: 1.3.2
> >>
> >> i get the same error if i use a rankfile which has a single line
> >>  rank  0=plankton  slot=0
> >> (plankton is my local machine) and call mpirun with np 1
> >>
> >> What does the "Rankfile claimed..." message mean?
> >> Did i make an error in my rankfile?
> >> If yes, what would be the correct way to write it?
> >>
> >> Thank You
> >>  Jody
> >> _______________________________________________
> >> users mailing list
> >> us...@open-mpi.org
> >> http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>

Reply via email to