Yes MM...  But here a single node has 16cores not 64 cores.
The 1st two jobs were with OMPI-1.4.5.
      16 cores of single node - 3692.403
      16 cores on two nodes (8 cores per node) - 12338.809

The 1st two jobs were with OMPI-1.6.5.
      16 cores of single node - 3547.879
      16 cores on two nodes (8 cores per node) - 5527.320

      As others said, due to shared memory communication the single node
job is running faster, but I was expecting a slight difference between 1 &
2 nodes - which is taking 60% more time here.



On Thu, Oct 31, 2013 at 8:19 PM, Ralph Castain <r...@open-mpi.org> wrote:

> Yes, though the degree of impact obviously depends on the messaging
> pattern of the app.
>
> On Oct 31, 2013, at 2:50 AM, MM <finjulh...@gmail.com> wrote:
>
> Of course, by this you mean, with the same total number of nodes, for e.g.
> 64 process on 1 node using shared mem, vs 64 processes spread over 2 nodes
> (32 each for e.g.)?
>
>
> On 29 October 2013 14:37, Ralph Castain <r...@open-mpi.org> wrote:
>
>> As someone previously noted, apps will always run slower on multiple
>> nodes vs everything on a single node due to the shared memory vs IB
>> differences. Nothing you can do about that one.
>>
> _______________________________________________
>
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>

Reply via email to