Re: [OMPI users] OpenMPI / SLURM Job Issues

2007-06-27 Thread Jeff Pummill
Thanks for the info Tim. That worked perfectly. And I now have the OpenMPI FAQ page bookmarked ;-) Jeff F. Pummill Tim Prins wrote: Hi Jeff, If you submit a batch script, there is no need to do a salloc. See the Open MPI FAQ for details on how to run on SLURM: http://www.open-mpi.org/fa

Re: [OMPI users] OpenMPI / SLURM Job Issues

2007-06-27 Thread Tim Prins
Hi Jeff, If you submit a batch script, there is no need to do a salloc. See the Open MPI FAQ for details on how to run on SLURM: http://www.open-mpi.org/faq/?category=slurm Hope this helps. Tim On Wednesday 27 June 2007 14:21, Jeff Pummill wrote: > Hey Jeff, > > Finally got my test nodes back

Re: [OMPI users] OpenMPI / SLURM Job Issues

2007-06-27 Thread Jeff Pummill
Hey Jeff, Finally got my test nodes back and was looking at the info you sent. On the SLURM page, it states the following: *Open MPI* relies upon SLURM to allocate resources for the job and then mpirun to initiate the tasks. When using salloc command, mpirun's -nol

Re: [OMPI users] OpenMPI / SLURM Job Issues

2007-06-21 Thread Jeff Pummill
Thanks for the info Jeff! All of my "test" nodes are temporarily busy, but I should be able to play with this some more tomorrow. I'll update the post if I have more questions or find any additional tips ;-) Jeff F. Pummill Senior Linux Cluster Administrator University of Arkansas Fayettevil

Re: [OMPI users] OpenMPI / SLURM Job Issues

2007-06-21 Thread Jeff Squyres
Ick; I'm surprised that we don't have this info on the FAQ. I'll try to rectify that shortly. How are you launching your jobs through SLURM? OMPI currently does not support the "srun -n X my_mpi_application" model for launching MPI jobs. You must either use the -A option to srun (i.e., g

[OMPI users] OpenMPI / SLURM Job Issues

2007-06-20 Thread Jeff Pummill
Just started working with OpenMPI / SLURM combo this morning. I can successfully launch this job from the command line and it runs to completion, but when launching from SLURM they hang. They appear to just sit with no load apparent on the compute nodes even though SLURM indicates they are run