Re: [OMPI users] MPI_Comm_Accept / MPI::Comm::Accept problem.

Jeff Squyres Sat, 28 Apr 2007 08:50:30 -0400

This is actually expected behavior. We make the assumption that MPIprocesses are meant to exhibit as low latency as possible, andtherefore use active polling for most message passing.

Additionally, it may be possible that connections could come acrossmultiple devices, so we need to poll them all to check for progress/connections. We've talked internally about getting better atrecognizing single-device scenarios (and therefore allowingblocking), but haven't really done much about it. Our internalinterfaces were designed to be non-blocking for polling for maximumperformance (i.e., lowest latency / highest bandwidth).



On Apr 26, 2007, at 3:48 PM, Nuno Sucena Almeida wrote:

Hello,

        I'm having a weird problem while using the MPI_Comm_Accept (C) or the
MPI::Comm::Accept (C++ bindings).
My "server" runs until the call to this function but if there's noclientconnecting, it sits there eating all CPU (100%), although if aclient connectsthe loop works fine, but when the client disconnects again we areback to the
same high CPU usage.
I tried using OpenMPI version 1.1.2 and 1.2. The machinesarchitectures areAMD Opteron and Intel Itanium2 respectively, the former compiledwith gcc
4.1.1 and the later with gcc 3.2.3.

        The C++ code is here:

        http://compel.bu.edu/~nuno/openmpi/

        along with the logs for orted and the 'server' output.

        I started orted with:

        orted --persistent --seed --scope public  --universe foo

        and the 'server' with

        mpirun --universe foo -np 1 ./server
The code is a C++ conversion from the C basic one posted at thempi-forum
website:
        
        http://www.mpi-forum.org/docs/mpi-20-html/node106.htm#Node109
Is there an easy fix for this? I tried also the C version havingthe same
problem...

                                        Regards,
                                                                                
        Nuno
--
http://aeminium.org/slug/
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users



--
Jeff Squyres
Cisco Systems

Re: [OMPI users] MPI_Comm_Accept / MPI::Comm::Accept problem.

Reply via email to