I'd appreciate some advice and help on this one. We're having
serious problems running parallel applications on our cluster. After
each batch job finishes, we lose a certain amount of available
memory. Additional jobs cause free memory to gradually go down until
the machine starts swapping an
Hi there!
I have just installed torque and openmpi and trying to make them work.
I ran ./configure --with-tm=/usr/local for openmpi and torque integration.
But when I run "mpirun -H node2 hello" on "node1", it asks the password of
node2.
And when I typed the password, eventually it tells below err