I happened to misspell a hostname, then it hanged.
[mishima@manage ~]$ mpirun -np 6 -host node05,nod06 ~/mis/openmpi/demos/myprog nod06: Unknown host mpirun: abort is already in progress...hit ctrl-c again to forcibly terminate Tetsuya > No problem - we appreciate you taking the time to confirm. Jeff encountered something late today, so we may indeed still have a lingering problem. :-( > > Will keep you posted > Ralph > > > On Mar 13, 2014, at 5:08 PM, tmish...@jcity.maeda.co.jp wrote: > > > > > > > Hi Ralph, I'm late to your release again due to TD. > > > > At that time, I manually applied #4386 and #4383 to 1.7 branch > > - namely openmpi-1.7.5rc2, and did the check. I might have > > made some mistake. > > > > Now, I found openmpi-1.7.5rc3 had just released and confirmed > > it worked fine, thanks. > > > > Tetsuya > > > >> It's okay - we thought we had it fixed, but not for that scenario. > >> > >> On Mar 12, 2014, at 9:02 PM, tmish...@jcity.maeda.co.jp wrote: > >> > >>> > >>> > >>> Sorry for disturbing, please keep going ... > >>> > >>> Tetsuya > >>> > >>>> Yes, I know - I am just finishing the fix now. > >>>> > >>>> On Mar 12, 2014, at 8:48 PM, tmish...@jcity.maeda.co.jp wrote: > >>>> > >>>>> > >>>>> > >>>>> Hi Ralph, this problem is not fixed completely by today's latest > >>>>> ticket #4383, I guess ... > >>>>> > >>>>> https://svn.open-mpi.org/trac/ompi/ticket/4383 > >>>>> > >>>>> For example, in case of returing with ORTE_ERR_SILENT from the line > > 514 > >>>>> in rmaps_rr_mapper.c file, the problem still occurs. I executed the > > job > >>>>> under the unmanaged condition - rsh without torque: > >>>>> > >>>>> [mishima@manage openmpi-1.7.5rc2]$ mpirun -np 6 -host node05,node06 > >>>>> -nooversubscribe ~/mis/openmpi/demos/myprog > >>>>> > >>> > > -------------------------------------------------------------------------- > >>>>> There are not enough slots available in the system to satisfy the 6 > >>> slots > >>>>> that were requested by the application: > >>>>> /home/mishima/mis/openmpi/demos/myprog > >>>>> > >>>>> Either request fewer slots for your application, or make more slots > >>>>> available > >>>>> for use. > >>>>> > >>> > > -------------------------------------------------------------------------- > >>>>> Abort is in progress...hit ctrl-c again within 5 seconds to forcibly > >>>>> terminate > >>>>> Abort is in progress...hit ctrl-c again within 5 seconds to forcibly > >>>>> terminate > >>>>> ..... > >>>>> > >>>>> rmaps_rr_mapper.c: > >>>>> 509 /* quick check to see if we can map all the procs */ > >>>>> 510 if (num_slots < (app->num_procs * > >>>>> orte_rmaps_base.cpus_per_rank)) { > >>>>> 511 if (ORTE_MAPPING_NO_OVERSUBSCRIBE & > >>>>> ORTE_GET_MAPPING_DIRECTIVE(jdata->map->mapping)) { > >>>>> 512 orte_show_help("help-orte-rmaps-base.txt", > >>>>> "orte-rmaps-base:alloc-error", > >>>>> 513 true, app->num_procs, app->app); > >>>>> 514 return ORTE_ERR_SILENT; > >>>>> 515 } > >>>>> > >>>>> > >>>>> Tetsuya > >>>>> > >>>>> _______________________________________________ > >>>>> users mailing list > >>>>> us...@open-mpi.org > >>>>> http://www.open-mpi.org/mailman/listinfo.cgi/users > >>>> > >>>> _______________________________________________ > >>>> users mailing list > >>>> us...@open-mpi.org > >>>> http://www.open-mpi.org/mailman/listinfo.cgi/users > >>> > >>> _______________________________________________ > >>> users mailing list > >>> us...@open-mpi.org > >>> http://www.open-mpi.org/mailman/listinfo.cgi/users > >> > >> _______________________________________________ > >> users mailing list > >> us...@open-mpi.org > >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > _______________________________________________ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users