Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-27 Thread Ralph Castain
Hmmm...well, it looks from your original error message that Rmpi/snow is using a single "master" process and then comm_spawn'ing a whole bunch of "workers". I tried replicating that on a slurm machine by having a single master comm_spawn a whole bunch of processes, and that worked fine. Of cours

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Ralph Castain
Ah - okay, my misunderstanding. Would you be willing to give the trunk a try? It might help to know if the problem is solely in 1.6, or continues. On Jul 26, 2012, at 4:32 PM, Brock Palen wrote: > I think so, sorry if I gave you the impression that Rmpi changed, > > Brock Palen > www.umich.ed

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Brock Palen
I think so, sorry if I gave you the impression that Rmpi changed, Brock Palen www.umich.edu/~brockp CAEN Advanced Computing bro...@umich.edu (734)936-1985 On Jul 26, 2012, at 7:30 PM, Ralph Castain wrote: > Guess I'm confused - your original note indicated that something had changed > in Rmp

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Ralph Castain
Guess I'm confused - your original note indicated that something had changed in Rmpi that broke things. Are you now saying it was something in OMPI? On Jul 26, 2012, at 4:22 PM, Brock Palen wrote: > Ok will see, Rmpi we had working with 1.4 and has not been updated after > 2010, this this kind

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Brock Palen
Ok will see, Rmpi we had working with 1.4 and has not been updated after 2010, this this kinda stinks. I will keep digging into it thanks for the help. Brock Palen www.umich.edu/~brockp CAEN Advanced Computing bro...@umich.edu (734)936-1985 On Jul 26, 2012, at 7:16 PM, Ralph Castain wrote:

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Ralph Castain
Crud - afraid you'll have to ask them, then :-( On Jul 26, 2012, at 3:50 PM, Brock Palen wrote: > Ralph, > > Rmpi wraps everything up, so I tried setting them with > > export OMPI_plm_base_verbose=5 > export OMPI_dpm_base_verbose=5 > > and I get no extra messages even on helloworld example si

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Brock Palen
Ralph, Rmpi wraps everything up, so I tried setting them with export OMPI_plm_base_verbose=5 export OMPI_dpm_base_verbose=5 and I get no extra messages even on helloworld example simple MPI-1.0 code. Brock Palen www.umich.edu/~brockp CAEN Advanced Computing bro...@umich.edu (734)936-1985 O

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Ralph Castain
Well, it looks like comm_spawn is working on 1.6. Afraid I don't know enough about Rmpi/snow to advise on what changed, but you could add some debug params to get an idea of where the problem is occurring: -mca plm_base_verbose 5 -mca dpm_base_verbose 5 should tell you from an OMPI perspective.

Re: [OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Ralph Castain
Weird - looks like it has done a comm_spawn and having trouble connecting between the jobs. I can check the basic code and make sure it is working - I seem to recall someone else recently talking about Rmpi changes causing problems (different ones than this, IIRC), so you might want to search ou

[OMPI users] OpenMPI and Rmpi/snow

2012-07-26 Thread Brock Palen
I have ran into a problem using Rmpi with OpenMPI (trying to get snow running). I built OpenMPI following another post where I built static: ./configure --prefix=$INSTALL/gcc-4.4.6-static --mandir=$INSTALL/gcc-4.4.6-static/man --with-tm=/usr/local/torque/ --with-openib --with-psm --enable-stati