[OMPI users] Understanding the buffering of small messages with tcp network

George Markomanolis Thu, 10 Mar 2011 06:19:21 -0500

Dear all,

I would like you to ask for a topic that there are already manyquestions but I am not familiar a lot with it. I want to understand thebehaviour of an application where there are many messages less than 64KB(eager mode) and I use TCP network. I am trying to understand in orderto simulate this application.For example it can be possible to have one MPI_Send of 1200 bytes aftersome computation, then two messages of the same size, after computation,etc. However according to the measurements and the profiling the cost ofthe communication is less than the latency of the network. I canunderstand that the cost of the MPI_Send is the copy to the bufferhowever sending the message to the destination it should cost at leastthe latency. So are the messages buffered in the sender and they aresent as packet to the receiver? My tcp window is 4MB and I use the samevalue for snd_buff and rcv_buff. If they are buffered in the sender whatis the criterion/algorithm? I mean if I have one message, aftercomputation and after again message is it possible these two messages tobe buffered from the sender point of view or this happens only on thereceiver? If there is any document/paper that I can read about this Iwould be appreciate to provide me the link.A simple example is that if I have a loop that rank 0 sends two messagesto rank 1 then the duration of the first message is bigger than thesecond's one and if I increase the loop to 10 or 20 messages then allthe messages cost a lot less than the first one and also less from whatSkaMPI measures. So I am sure that it should be a buffer issue (orsomething else that I can't think about).


Best regards,
Georges

[OMPI users] Understanding the buffering of small messages with tcp network

Reply via email to