Re: [OMPI users] open-mpi behaviour on Fedora, Ubuntu, Debian and CentOS

Tim Prince Mon, 26 Apr 2010 09:37:24 -0400

On 4/26/2010 2:31 AM, Asad Ali wrote:

On Mon, Apr 26, 2010 at 8:01 PM, Ashley Pittman <ash...@pittman.co.uk<mailto:ash...@pittman.co.uk>> wrote:
    On 25 Apr 2010, at 22:27, Asad Ali wrote:

    > Yes I use different machines such as
    >
    > machine 1 uses AMD Opterons. (Fedora)
    >
    > machine 2 and 3 use Intel Xeons. (CentOS)
    >
    > machine 4 uses slightly older Intel Xeons. (Debian)
    >
    > Only machine 1 gives correct results.  While CentOS and Debian
    results are same but are wrong and different from those of machine 1.

    Have you verified the are actually wrong or are they just
    different?  It's actually perfectly possible for the same program
    to get different results from run to run even on the same hardware
    and the same OS.  All floating point operations by the MPI library
    are expected to be deterministic but changing the process layout
    or and MPI settings can affect this and of course anything the
    application does can introduce differences as well.

    Ashley.
The code is the same with the same input/output and the same constantsetc. From run to run the results can only be different if you eitheruse different input/output or use different random number seeds. Herein my case the random number seeds are the same as well. This meansthat this code must give (and it does) the same results no matter howmany times you run it. I didn't tamper with mpi-settings for any run.I have verified that results of only Fedora are correct because I knowwhat is in my data and how should my model behave and I get a nearlyperfect convergence on Fedora OS. Even my dual core laptop with Ubuntu9.10 also gives correct results. The other OSs give the same resultsfor a few hundred iterations as Fedora but then an unusual thinghappens and the results start getting wrong.

If you're really interested in solving your "problem," you'll have toconsider important details such as which compiler was used, whichoptions (e.g. 387 vs. sse), run-time setting of x87 or SSE controlregisters, 32- vs. 64-bit compilation. SSE2 is the default for 64-bitcompilation, but compilers vary on defaults for 32-bit. If your programdepends on x87 extra precision of doubles, or efficient mixing of doubleand long double, 387 code may be a better choice, but limits yourefficiency.


--
Tim Prince

Re: [OMPI users] open-mpi behaviour on Fedora, Ubuntu, Debian and CentOS

Reply via email to