Am 11.02.2013 um 12:26 schrieb Pierre Lindenbaum:
> <snip>
> and I've changed `shell_start_mode posix_compliant` to `unix_behavior `
> using `qconf -mconf`. (However, shell_start_mode is still listed as
> posix_compliant )
AFAIK this is deprecated on the configuration level, as it moved to the queue
definition `qconf -mq all.q`.
> Now, qsh -pe orte 4 works
>
> qsh -pe orte 4
A plain `qsh` is working for you? This is an old startup method due to the
insecure X11 startup it shouldn't be used any longer IMO.
> Your job 84581 ("INTERACTIVE") has been submitted
> waiting for interactive job to be scheduled ...
> Your interactive job 84581 has been successfully scheduled.
>
>
> (should I run that command before running any a new mpirun command ?)
>
> when invoking:
>
> qsub -cwd -pe orte 7 with-a-shell.sh
> or
> qrsh -cwd -pe orte 100 /commun/data/packages/openmpi/bin/mpirun
> /path/to/a.out arg1 arg2 arg3 ....
>
> that works too ! Thank you ! :-)
>
>
> queuename qtype resv/used/tot. load_avg
> arch states
>
> ---------------------------------------------------------------------------------
> all.q@node01 BIP 0/15/64 2.76 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 15
>
> ---------------------------------------------------------------------------------
> all.q@node02 BIP 0/14/64 3.89 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14
>
> ---------------------------------------------------------------------------------
> all.q@node03 BIP 0/14/64 3.23 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14
>
> ---------------------------------------------------------------------------------
> all.q@node04 BIP 0/14/64 3.68 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14
>
> ---------------------------------------------------------------------------------
> all.q@node05 BIP 0/15/64 2.91 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 15
>
> ---------------------------------------------------------------------------------
> all.q@node06 BIP 0/14/64 3.91 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14
>
> ---------------------------------------------------------------------------------
> all.q@node07 BIP 0/14/64 3.79 lx24-amd64
> 84598 0.55500 mpirun lindenb r 02/11/2013 12:03:36 14
>
>
>
> OK, my first openmpi program works. But as far as I can see: it is faster
> when invoked on the master node (~3.22min) than when invoked by means of SGE
> (~7H45):
It's 7:45 to 3:32 - both in minutes:seconds, or?
All machines are the same regarding speed and core count? BTW: running
interactively in SGE might not set environment variables in case you use `qrsh`
without a command or `qlogin` and some default hostfile will be used instead
(unless you provide one). Below with the supplied command it should be fine.
-- Reuti
> time /commun/data/packages/openmpi/bin/mpirun -np 7 /path/to/a.out arg1
> arg2 arg3 ....
> 670.985u 64.929s 3:32.36 346.5% 0+0k 16322112+6560io 32pf+0w
>
> time qrsh -cwd -pe orte 7 /commun/data/packages/openmpi/bin/mpirun
> /path/to/a.out arg1 arg2 arg3 ....
> 0.023u 0.036s 7:45.05 0.0% 0+0k 1496+0io 1pf+0w
>
>
>
> I'm going to investigate this... :-)
>
> Thank you again
>
> Pierre
>
>
> _______________________________________________
> users mailing list
> [email protected]
> http://www.open-mpi.org/mailman/listinfo.cgi/users