Am 11.02.2013 um 12:26 schrieb Pierre Lindenbaum: > <snip> > and I've changed `shell_start_mode posix_compliant` to `unix_behavior ` > using `qconf -mconf`. (However, shell_start_mode is still listed as > posix_compliant )
AFAIK this is deprecated on the configuration level, as it moved to the queue definition `qconf -mq all.q`. > Now, qsh -pe orte 4 works > > qsh -pe orte 4 A plain `qsh` is working for you? This is an old startup method due to the insecure X11 startup it shouldn't be used any longer IMO. > Your job 84581 ("INTERACTIVE") has been submitted > waiting for interactive job to be scheduled ... > Your interactive job 84581 has been successfully scheduled. > > > (should I run that command before running any a new mpirun command ?) > > when invoking: > > qsub -cwd -pe orte 7 with-a-shell.sh > or > qrsh -cwd -pe orte 100 /commun/data/packages/openmpi/bin/mpirun > /path/to/a.out arg1 arg2 arg3 .... > > that works too ! Thank you ! :-) > > > queuename qtype resv/used/tot. load_avg > arch states > > --------------------------------------------------------------------------------- > all.q@node01 BIP 0/15/64 2.76 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 15 > > --------------------------------------------------------------------------------- > all.q@node02 BIP 0/14/64 3.89 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14 > > --------------------------------------------------------------------------------- > all.q@node03 BIP 0/14/64 3.23 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14 > > --------------------------------------------------------------------------------- > all.q@node04 BIP 0/14/64 3.68 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14 > > --------------------------------------------------------------------------------- > all.q@node05 BIP 0/15/64 2.91 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 15 > > --------------------------------------------------------------------------------- > all.q@node06 BIP 0/14/64 3.91 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14 > > --------------------------------------------------------------------------------- > all.q@node07 BIP 0/14/64 3.79 lx24-amd64 > 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14 > > > > OK, my first openmpi program works. But as far as I can see: it is faster > when invoked on the master node (~3.22min) than when invoked by means of SGE > (~7H45): It's 7:45 to 3:32 - both in minutes:seconds, or? All machines are the same regarding speed and core count? BTW: running interactively in SGE might not set environment variables in case you use `qrsh` without a command or `qlogin` and some default hostfile will be used instead (unless you provide one). Below with the supplied command it should be fine. -- Reuti > time /commun/data/packages/openmpi/bin/mpirun -np 7 /path/to/a.out arg1 > arg2 arg3 .... > 670.985u 64.929s 3:32.36 346.5% 0+0k 16322112+6560io 32pf+0w > > time qrsh -cwd -pe orte 7 /commun/data/packages/openmpi/bin/mpirun > /path/to/a.out arg1 arg2 arg3 .... > 0.023u 0.036s 7:45.05 0.0% 0+0k 1496+0io 1pf+0w > > > > I'm going to investigate this... :-) > > Thank you again > > Pierre > > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users