Good call, Reuti. Thanks for the expansion and details! > On Aug 10, 2016, at 16:25, Reuti <re...@staff.uni-marburg.de> wrote: > > >> Am 10.08.2016 um 21:46 schrieb MacMullan IV, Hugh: >> >> Hi Ulrich: >> >> I haven't gone past openmpi v1.10, but you'll likely want to change >> 'control_slaves' in your PE conf to 'TRUE', to signal that you have tight >> integration. https://www.open-mpi.org/faq/?category=sge (and probably >> 'job_is_first_task' to 'FALSE') > > Although 'job_is_first_task' set to 'FALSE' is also listed at the Open MPI > website, the usually MPI programming will use rank 0 and so it's save to have > it set to TRUE here. Originally it limited the number of allowed `qrsh > -inherit ...` calls to the slaves (or even locally on the master node of the > parallel job) at times of MPICH(1) (will it allow [n] or [n-1] calls, also to > check proper programming by this). Nowadays Open MPI will start only one > daemon per exechost by `qrsh -inherit ...` and the granted limit won't be > reached anyway. > > -- Reuti > > >> Does that help? >> >> -Hugh >> >> -----Original Message----- >> From: users-boun...@gridengine.org [mailto:users-boun...@gridengine.org] On >> Behalf Of Ulrich Hiller >> Sent: Wednesday, August 10, 2016 3:16 PM >> To: users@gridengine.org >> Subject: [gridengine users] How can i make gridengine not to use ssh? >> >> Hello, >> >> My problem: How can i make gridengine not to use ssh? >> >> Installed: >> openmpi-2.0.0 - configured with sge support. >> gridengine (son of gridengine) 8.1.9-1 >> >> I have a simple openmpi program 'teste' which only gives "hello world" >> output. >> I start it with: >> qsub -pe orte 160 -V -j yes -cwd -S /bin/bash <<< "mpiexec -n 160 teste >>>> /home/ljohndoe/out.dat" >> on the master node. >> I get back the error: >> >> Host key verification failed. >> Host key verification failed. >> Permission denied, please try again. >> Permission denied, please try again. >> Received disconnect from 192.168.117.6: 2: Too many authentication >> failures for johndoe >> Permission denied, please try again. >> Permission denied, please try again. >> Received disconnect from 192.168.117.5: 2: Too many authentication >> failures for johndoe >> [...] >> >> When i configure a passwordless ssh login to the execute nodes >> (exchanging the ssh key from master with 'ssh-copy-id), it works like >> charm. So it obviuously uses ssh connection to the execute nodes. >> >> the output of 'qconf -sconf' contains: >> >> login_shells sh,bash,ksh,csh,tcsh >> qlogin_command builtin >> qlogin_daemon builtin >> rlogin_command builtin >> rlogin_daemon builtin >> rsh_command builtin >> rsh_daemon builtin >> >> (as far as i read this was the problem of a thread some time ago in this >> list. But i seem to have the correct values) >> >> So everything should be fine- or not? >> Also with >> qlogin -l 'h=exec01' >> and >> qrsh -l 'h=exec01' >> i can go without problems to the first node.(called exec01), and i can >> also login to all other execute nodes as well. >> >> Is there anywhere another 'switch' where i can let qsub run _not_ over ssh? >> >> If is is of interest, the output of 'qconf -sp orte' is: >> pe_name orte >> slots 9999999 >> user_lists NONE >> xuser_lists NONE >> start_proc_args NONE >> stop_proc_args NONE >> allocation_rule $round_robin >> control_slaves FALSE >> job_is_first_task TRUE >> urgency_slots min >> accounting_summary FALSE >> qsort_args NONE >> >> Also, i do not have any ssh lines in ~/.profile or ~/.bashrc >> >> >> Kind regards, ulrich >> >> >> >> >> >> >> _______________________________________________ >> users mailing list >> users@gridengine.org >> https://gridengine.org/mailman/listinfo/users >> >> _______________________________________________ >> users mailing list >> users@gridengine.org >> https://gridengine.org/mailman/listinfo/users >
_______________________________________________ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users