Hmmm...well, it looks from your original error message that Rmpi/snow is using
a single "master" process and then comm_spawn'ing a whole bunch of "workers". I
tried replicating that on a slurm machine by having a single master comm_spawn
a whole bunch of processes, and that worked fine. Of cours
Ah - okay, my misunderstanding. Would you be willing to give the trunk a try?
It might help to know if the problem is solely in 1.6, or continues.
On Jul 26, 2012, at 4:32 PM, Brock Palen wrote:
> I think so, sorry if I gave you the impression that Rmpi changed,
>
> Brock Palen
> www.umich.ed
I think so, sorry if I gave you the impression that Rmpi changed,
Brock Palen
www.umich.edu/~brockp
CAEN Advanced Computing
bro...@umich.edu
(734)936-1985
On Jul 26, 2012, at 7:30 PM, Ralph Castain wrote:
> Guess I'm confused - your original note indicated that something had changed
> in Rmp
Guess I'm confused - your original note indicated that something had changed in
Rmpi that broke things. Are you now saying it was something in OMPI?
On Jul 26, 2012, at 4:22 PM, Brock Palen wrote:
> Ok will see, Rmpi we had working with 1.4 and has not been updated after
> 2010, this this kind
Ok will see, Rmpi we had working with 1.4 and has not been updated after 2010,
this this kinda stinks.
I will keep digging into it thanks for the help.
Brock Palen
www.umich.edu/~brockp
CAEN Advanced Computing
bro...@umich.edu
(734)936-1985
On Jul 26, 2012, at 7:16 PM, Ralph Castain wrote:
Crud - afraid you'll have to ask them, then :-(
On Jul 26, 2012, at 3:50 PM, Brock Palen wrote:
> Ralph,
>
> Rmpi wraps everything up, so I tried setting them with
>
> export OMPI_plm_base_verbose=5
> export OMPI_dpm_base_verbose=5
>
> and I get no extra messages even on helloworld example si
Ralph,
Rmpi wraps everything up, so I tried setting them with
export OMPI_plm_base_verbose=5
export OMPI_dpm_base_verbose=5
and I get no extra messages even on helloworld example simple MPI-1.0 code.
Brock Palen
www.umich.edu/~brockp
CAEN Advanced Computing
bro...@umich.edu
(734)936-1985
O
Well, it looks like comm_spawn is working on 1.6. Afraid I don't know enough
about Rmpi/snow to advise on what changed, but you could add some debug params
to get an idea of where the problem is occurring:
-mca plm_base_verbose 5 -mca dpm_base_verbose 5
should tell you from an OMPI perspective.
Weird - looks like it has done a comm_spawn and having trouble connecting
between the jobs. I can check the basic code and make sure it is working - I
seem to recall someone else recently talking about Rmpi changes causing
problems (different ones than this, IIRC), so you might want to search ou
I have ran into a problem using Rmpi with OpenMPI (trying to get snow running).
I built OpenMPI following another post where I built static:
./configure --prefix=$INSTALL/gcc-4.4.6-static
--mandir=$INSTALL/gcc-4.4.6-static/man --with-tm=/usr/local/torque/
--with-openib --with-psm --enable-stati
10 matches
Mail list logo