I think that ACML creates threads to fill all the available cores. If
you run 2 instances of ACML, it will create twice as many threads as
available cores and performances are obviously terrible. You should
check the ACML documentation to get the name of the environment
variable controling
Correction:
compute-0-0.local np=8 (and not np =4)
Besides, that when we set mpi_paffinity_alone 1, then even though 8
threads were running but the total sum of %CPU was around 400%. For
some reasons, only half of the processing powers of the nodes were
being utilized. The 4 threads of the fir
Dear, all. We just finished installing the first batch of nodes with
the following configurations.
Machines: Dual Quad core AMD 2350 + 16 Gig of RAMs
OS + Apps: Rocks 4.3 + Torque (2.1.8-1) + Maui (3.2.6p19-1) + Openmpi
(1.1.1-8) + VASP
Interconnections: Gigabit Ethernet ports + Extreme Summit x4