Re: [OMPI users] Regression: Fortran derived types with newer MPI module

Jeff Squyres (jsquyres) Tue, 7 Jan 2014 22:09:09 -0500 (EST)

Yes, I can explain what's going on here.  The short version is that a change 
was made with the intent to provide maximum Fortran code safety, but with a 
possible backwards compatibility issue.  If this change is causing real 
problems, we can probably change this, but I'd like a little feedback from the 
Fortran MPI dev community first.


It's a complex issue, and requires a little background and discussion, sorry...

a) Back in the 1.6.x series, we allowed users to build multiple variants of the 
"use mpi" Fortran interface:

- tiny: only the MPI_SIZEOF subroutine
- small: tiny + all MPI subroutines that did not take choice buffers, and the 
MPI functions (WTICK, WTIME)
- medium: small + all MPI subroutines that take 1 choice buffer (e.g., MPI_SEND)
- large: all MPI subroutines (even those that take 2 choice buffers, such as 
collectives)
  --> Note: the "large" size never really worked for uninteresting reasons.  It 
won't be fixed.

See the OMPI 1.6.x README for more details.

The default is "small" in the 1.6.x series.  This means that when you call 
MPI_SEND (and any other function that takes a choice buffer), you are not 
getting an MPI-implementation-provided prototype for that function -- it's 
essentially the same as how everyone has implemented mpif.h (i.e., no 
prototypes).

This is why you are able to compile your code in OMPI 1.6.x with "use mpi" -- 
because there is no prototype for MPI_SEND in the mpi module.  Heck, you could 
even:

  ! Don't pass enough params to MPI_SEND
  call MPI_Send(bogus)

and it would compile and link.  It will likely segv at run time, but that's a 
different issue.

b) I *believe* that MPICH does the equivalent of "tiny", but I'm not going to 
swear to that (meaning: you're not getting any prototypes for any MPI 
subroutines other than MPI_SIZEOF).

This is why you are able to compile your code with MPICH and "use mpi" -- same 
disclaimers as a) (i.e., you get no compile-time protection for when you don't 
call MPI subroutines properly).

c) The design of the MPI-2 "mpi" module has multiple flaws that are identified 
in the MPI-3 text (but were not recognized back in MPI-2.x days).  Here's one: 
until F2008+addendums, there was no Fortran equivalent of "void *".  Hence, the 
mpi module has to overload MPI_Send() and have a prototype *for every possible 
type and dimension*.

The OMPI "medium" implementation actually provides overloaded prototypes for 
all pre-defined Fortran datatypes (INTEGER, REAL, ...etc.), for scalars and, by 
default, array ranks up to 4.  Fortran <2003 allows up to... er... I think 
?7?... dimensional arrays, but providing an overloaded interface for each 
scalar type and all array dimensions for each type explodes the number of 
overloaded prototypes in the mpi module; most compilers that we tested several 
years ago would segv with this many interfaces in a single module.

It gets worse with the MPI subroutines that take multiple choice buffers: you 
get an exponential explosion of interfaces.  IIRC, a fully-populated mpi 
"large" module would contain over 5 million interfaces.

Craig Rasmussen and I wrote a paper about this in EuroMPI 2005 
(http://www.open-mpi.org/papers/euro-pvmmpi-2005-fortran/).  It was one of the 
issues that eventually led to the creation of the MPI-3 mpi_f08 module.

Here's another fatal flaw: it's not possible for an MPI implementation to 
provide MPI_Send() prototypes for user-defined Fortran datatypes.  Hence, the 
example you cite is a pipe dream for the "mpi" module because there's no way to 
specify a (void*)-like argument for the choice buffer.  

Meaning: Fortran MPI apps can either have compile-time safety or user-defined 
datatypes as choice buffers.  Pick one.

d) A solution to the problems listed in c) is to use non-standard, 
compiler-specific "ignore TKR" functionality in the mpi module implementation, 
which effectively provides (void*) functionality.  Hence, an implementation can 
have a *single* MPI_SEND subroutine interface, and use a pragma to ignore the 
type, kind, and rank of the choice buffer parameter.

OMPI 1.7 and beyond actually has 2 different implementations of the mpi module:

- the old tiny/small/medium-based interface for compilers that do not support 
"ignore TKR" pragmas (i.e., gfortran)
- a new ignore-TKR-based module that prototypes all MPI subroutines and 
functions

Meaning: OMPI 1.7 with non-gfortran works great (i.e., your sample code 
compiles).  OMPI 1.7 with gfortran is *mostly* the same as it was in 1.6, 
except that we changed the default from "small" to "medium".

*** This is what is causing your problem.  In OMPI 1.6, we didn't provide an 
interface for MPI_SEND by default.  In OMPI 1.7, we do.

Craig Rasmussen and I debated long and hard about whether to change the default 
from "small" to "medium" or not.  We finally ended up doing it with the 
following rationale:

- very few codes use the "mpi" module
- but those who do should have the maximum amount of compile-time protection

...but we always knew that someone may come complaining some day.  And that day 
has now come.

So my question to you / the Fortran MPI dev community is: what do you want (for 
gfortran)?  

Do you want us to go back to the "small" size by default, or do you want more 
compile-time protection by default?  (with the obvious caveat that you can't 
use user-defined Fortran datatypes as choice buffers; you might be able to use 
something like c_loc, but I haven't thought deeply about this and don't know 
offhand if that works)



On Jan 6, 2014, at 11:34 PM, Jed Brown <jedbr...@mcs.anl.gov> wrote:

> The attached code is from the example on page 629-630 (17.1.15 Fortran
> Derived Types) of MPI-3.  This compiles cleanly with MPICH and with OMPI
> 1.6.5, but not with the latest OMPI.  Arrays higher than rank 4 would
> have a similar problem since they are not enumerated.  Did someone
> decide that a necessarily-incomplete enumeration of types was "good
> enough" and that other users should use some other workaround?
> 
> $ ~/usr/ompi/bin/mpifort -c struct.f90 
> struct.f90:40.55:
> 
>  call MPI_SEND(foo, 1, newtype, dest, tag, comm, ierr)
>                                                       1
> Error: There is no specific subroutine for the generic 'mpi_send' at (1)
> struct.f90:43.48:
> 
>  call MPI_GET_ADDRESS(fooarr(1), disp(1), ierr)
>                                                1
> Error: There is no specific subroutine for the generic 'mpi_get_address' at 
> (1)
> struct.f90:44.48:
> 
>  call MPI_GET_ADDRESS(fooarr(2), disp(2), ierr)
>                                                1
> Error: There is no specific subroutine for the generic 'mpi_get_address' at 
> (1)
> struct.f90:50.61:
> 
>  call MPI_SEND(fooarr, 5, newarrtype, dest, tag, comm, ierr)
>                                                             1
> Error: There is no specific subroutine for the generic 'mpi_send' at (1)
> 
> 
> $ ~/usr/ompi/bin/ompi_info
>                 Package: Open MPI jed@batura Distribution
>                Open MPI: 1.9a1
>  Open MPI repo revision: r29531M
>   Open MPI release date: Oct 26, 2013
>                Open RTE: 1.9a1
>  Open RTE repo revision: r29531M
>   Open RTE release date: Oct 26, 2013
>                    OPAL: 1.9a1
>      OPAL repo revision: r29531M
>       OPAL release date: Oct 26, 2013
>                 MPI API: 2.2
>            Ident string: 1.9a1
>                  Prefix: /home/jed/usr/ompi
> Configured architecture: x86_64-unknown-linux-gnu
>          Configure host: batura
>           Configured by: jed
>           Configured on: Mon Jan  6 19:38:01 CST 2014
>          Configure host: batura
>                Built by: jed
>                Built on: Mon Jan  6 19:49:41 CST 2014
>              Built host: batura
>              C bindings: yes
>            C++ bindings: no
>             Fort mpif.h: yes (all)
>            Fort use mpi: yes (limited: overloading)
>       Fort use mpi size: deprecated-ompi-info-value
>        Fort use mpi_f08: no
> Fort mpi_f08 compliance: The mpi_f08 module was not built
>  Fort mpi_f08 subarrays: no
>           Java bindings: no
>  Wrapper compiler rpath: runpath
>              C compiler: gcc
>     C compiler absolute: /usr/bin/gcc
>  C compiler family name: GNU
>      C compiler version: 4.8.2
>            C++ compiler: g++
>   C++ compiler absolute: /usr/bin/g++
>           Fort compiler: /usr/bin/gfortran
>       Fort compiler abs: 
>         Fort ignore TKR: no
>   Fort 08 assumed shape: no
>      Fort optional args: no
>            Fort BIND(C): no
>            Fort PRIVATE: no
>           Fort ABSTRACT: no
>       Fort ASYNCHRONOUS: no
>          Fort PROCEDURE: no
> Fort f08 using wrappers: yes
>             C profiling: yes
>           C++ profiling: no
>   Fort mpif.h profiling: yes
>  Fort use mpi profiling: yes
>   Fort use mpi_f08 prof: no
>          C++ exceptions: no
>          Thread support: posix (MPI_THREAD_MULTIPLE: no, OPAL support: yes, 
> OMPI progress: no, ORTE progress: yes, Event lib: yes)
>           Sparse Groups: no
>  Internal debug support: yes
>  MPI interface warnings: yes
>     MPI parameter check: runtime
> Memory profiling support: no
> Memory debugging support: no
>         libltdl support: yes
>   Heterogeneous support: no
> mpirun default --prefix: no
>         MPI I/O support: yes
>       MPI_WTIME support: gettimeofday
>     Symbol vis. support: yes
>   Host topology support: yes
>          MPI extensions: 
>   FT Checkpoint support: no (checkpoint thread: no)
>   C/R Enabled Debugging: no
>     VampirTrace support: yes
>  MPI_MAX_PROCESSOR_NAME: 256
>    MPI_MAX_ERROR_STRING: 256
>     MPI_MAX_OBJECT_NAME: 64
>        MPI_MAX_INFO_KEY: 36
>        MPI_MAX_INFO_VAL: 256
>       MPI_MAX_PORT_NAME: 1024
>  MPI_MAX_DATAREP_STRING: 128
>           MCA backtrace: execinfo (MCA v2.0, API v2.0, Component v1.9)
>            MCA compress: bzip (MCA v2.0, API v2.0, Component v1.9)
>            MCA compress: gzip (MCA v2.0, API v2.0, Component v1.9)
>                 MCA crs: none (MCA v2.0, API v2.0, Component v1.9)
>                  MCA db: hash (MCA v2.0, API v1.0, Component v1.9)
>                  MCA db: print (MCA v2.0, API v1.0, Component v1.9)
>               MCA event: libevent2021 (MCA v2.0, API v2.0, Component v1.9)
>               MCA hwloc: external (MCA v2.0, API v2.0, Component v1.9)
>                  MCA if: linux_ipv6 (MCA v2.0, API v2.0, Component v1.9)
>                  MCA if: posix_ipv4 (MCA v2.0, API v2.0, Component v1.9)
>         MCA installdirs: env (MCA v2.0, API v2.0, Component v1.9)
>         MCA installdirs: config (MCA v2.0, API v2.0, Component v1.9)
>          MCA memchecker: valgrind (MCA v2.0, API v2.0, Component v1.9)
>              MCA memory: linux (MCA v2.0, API v2.0, Component v1.9)
>               MCA pstat: linux (MCA v2.0, API v2.0, Component v1.9)
>               MCA shmem: mmap (MCA v2.0, API v2.0, Component v1.9)
>               MCA shmem: posix (MCA v2.0, API v2.0, Component v1.9)
>               MCA shmem: sysv (MCA v2.0, API v2.0, Component v1.9)
>               MCA timer: linux (MCA v2.0, API v2.0, Component v1.9)
>                 MCA dfs: app (MCA v2.0, API v1.0, Component v1.9)
>                 MCA dfs: orted (MCA v2.0, API v1.0, Component v1.9)
>                 MCA dfs: test (MCA v2.0, API v1.0, Component v1.9)
>              MCA errmgr: default_app (MCA v2.0, API v3.0, Component v1.9)
>              MCA errmgr: default_hnp (MCA v2.0, API v3.0, Component v1.9)
>              MCA errmgr: default_orted (MCA v2.0, API v3.0, Component v1.9)
>              MCA errmgr: default_tool (MCA v2.0, API v3.0, Component v1.9)
>                 MCA ess: env (MCA v2.0, API v3.0, Component v1.9)
>                 MCA ess: hnp (MCA v2.0, API v3.0, Component v1.9)
>                 MCA ess: singleton (MCA v2.0, API v3.0, Component v1.9)
>                 MCA ess: slurm (MCA v2.0, API v3.0, Component v1.9)
>                 MCA ess: tool (MCA v2.0, API v3.0, Component v1.9)
>               MCA filem: raw (MCA v2.0, API v2.0, Component v1.9)
>             MCA grpcomm: bad (MCA v2.0, API v2.0, Component v1.9)
>                 MCA iof: hnp (MCA v2.0, API v2.0, Component v1.9)
>                 MCA iof: mr_hnp (MCA v2.0, API v2.0, Component v1.9)
>                 MCA iof: mr_orted (MCA v2.0, API v2.0, Component v1.9)
>                 MCA iof: orted (MCA v2.0, API v2.0, Component v1.9)
>                 MCA iof: tool (MCA v2.0, API v2.0, Component v1.9)
>                MCA odls: default (MCA v2.0, API v2.0, Component v1.9)
>                 MCA oob: tcp (MCA v2.0, API v2.0, Component v1.9)
>                 MCA plm: rsh (MCA v2.0, API v2.0, Component v1.9)
>                 MCA plm: slurm (MCA v2.0, API v2.0, Component v1.9)
>                 MCA ras: loadleveler (MCA v2.0, API v2.0, Component v1.9)
>                 MCA ras: simulator (MCA v2.0, API v2.0, Component v1.9)
>                 MCA ras: slurm (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: lama (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: mindist (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: ppr (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: rank_file (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: resilient (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: round_robin (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: seq (MCA v2.0, API v2.0, Component v1.9)
>               MCA rmaps: staged (MCA v2.0, API v2.0, Component v1.9)
>                 MCA rml: oob (MCA v2.0, API v2.0, Component v1.9)
>              MCA routed: binomial (MCA v2.0, API v2.0, Component v1.9)
>              MCA routed: debruijn (MCA v2.0, API v2.0, Component v1.9)
>              MCA routed: direct (MCA v2.0, API v2.0, Component v1.9)
>              MCA routed: radix (MCA v2.0, API v2.0, Component v1.9)
>               MCA state: app (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: hnp (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: novm (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: orted (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: staged_hnp (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: staged_orted (MCA v2.0, API v1.0, Component v1.9)
>               MCA state: tool (MCA v2.0, API v1.0, Component v1.9)
>           MCA allocator: basic (MCA v2.0, API v2.0, Component v1.9)
>           MCA allocator: bucket (MCA v2.0, API v2.0, Component v1.9)
>                MCA bcol: basesmuma (MCA v2.0, API v2.0, Component v1.9)
>                MCA bcol: ptpcoll (MCA v2.0, API v2.0, Component v1.9)
>                 MCA bml: r2 (MCA v2.0, API v2.0, Component v1.9)
>                 MCA btl: self (MCA v2.0, API v2.0, Component v1.9)
>                 MCA btl: sm (MCA v2.0, API v2.0, Component v1.9)
>                 MCA btl: tcp (MCA v2.0, API v2.0, Component v1.9)
>                 MCA btl: vader (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: basic (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: hierarch (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: inter (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: libnbc (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: ml (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: self (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: sm (MCA v2.0, API v2.0, Component v1.9)
>                MCA coll: tuned (MCA v2.0, API v2.0, Component v1.9)
>                 MCA dpm: orte (MCA v2.0, API v2.0, Component v1.9)
>                MCA fbtl: posix (MCA v2.0, API v2.0, Component v1.9)
>               MCA fcoll: dynamic (MCA v2.0, API v2.0, Component v1.9)
>               MCA fcoll: individual (MCA v2.0, API v2.0, Component v1.9)
>               MCA fcoll: static (MCA v2.0, API v2.0, Component v1.9)
>               MCA fcoll: two_phase (MCA v2.0, API v2.0, Component v1.9)
>                  MCA fs: ufs (MCA v2.0, API v2.0, Component v1.9)
>                  MCA io: ompio (MCA v2.0, API v2.0, Component v1.9)
>                  MCA io: romio (MCA v2.0, API v2.0, Component v1.9)
>               MCA mpool: grdma (MCA v2.0, API v2.0, Component v1.9)
>               MCA mpool: sm (MCA v2.0, API v2.0, Component v1.9)
>                 MCA osc: pt2pt (MCA v2.0, API v2.0, Component v1.9)
>                 MCA osc: rdma (MCA v2.0, API v2.0, Component v1.9)
>                 MCA pml: v (MCA v2.0, API v2.0, Component v1.9)
>                 MCA pml: bfo (MCA v2.0, API v2.0, Component v1.9)
>                 MCA pml: cm (MCA v2.0, API v2.0, Component v1.9)
>                 MCA pml: ob1 (MCA v2.0, API v2.0, Component v1.9)
>              MCA pubsub: orte (MCA v2.0, API v2.0, Component v1.9)
>              MCA rcache: vma (MCA v2.0, API v2.0, Component v1.9)
>                 MCA rte: orte (MCA v2.0, API v2.0, Component v1.9)
>                MCA sbgp: basesmsocket (MCA v2.0, API v2.0, Component v1.9)
>                MCA sbgp: basesmuma (MCA v2.0, API v2.0, Component v1.9)
>                MCA sbgp: p2p (MCA v2.0, API v2.0, Component v1.9)
>            MCA sharedfp: individual (MCA v2.0, API v2.0, Component v1.9)
>            MCA sharedfp: lockedfile (MCA v2.0, API v2.0, Component v1.9)
>            MCA sharedfp: sm (MCA v2.0, API v2.0, Component v1.9)
>                MCA topo: basic (MCA v2.0, API v2.1, Component v1.9)
>           MCA vprotocol: pessimist (MCA v2.0, API v2.0, Component v1.9)
> 
> subroutine foobar
>  use mpi
> 
>  type, BIND(C) :: mytype
>     integer :: i
>     real :: x
>     double precision :: d
>     logical :: l
>  end type mytype
> 
>  type(mytype) :: foo, fooarr(5)
>  integer :: blocklen(4), type(4)
>  integer(KIND=MPI_ADDRESS_KIND) :: disp(4), base, lb, extent
> 
>  call MPI_GET_ADDRESS(foo%i, disp(1), ierr)
>  call MPI_GET_ADDRESS(foo%x, disp(2), ierr)
>  call MPI_GET_ADDRESS(foo%d, disp(3), ierr)
>  call MPI_GET_ADDRESS(foo%l, disp(4), ierr)
> 
>  base = disp(1)
>  disp(1) = disp(1) - base
>  disp(2) = disp(2) - base
>  disp(3) = disp(3) - base
>  disp(4) = disp(4) - base
> 
>  blocklen(1) = 1
>  blocklen(2) = 1
>  blocklen(3) = 1
>  blocklen(4) = 1
>  type(1) = MPI_INTEGER
>  type(2) = MPI_REAL
>  type(3) = MPI_DOUBLE_PRECISION
>  type(4) = MPI_LOGICAL
> 
>  call MPI_TYPE_CREATE_STRUCT(4, blocklen, disp, type, newtype, ierr)
>  call MPI_TYPE_COMMIT(newtype, ierr)
> 
>  ! call MPI_SEND(foo%i, 1, newtype, dest, tag, comm, ierr)
>  ! or
>  call MPI_SEND(foo, 1, newtype, dest, tag, comm, ierr)
>  ! expects that base == address(foo%i) == address(foo)
> 
>  call MPI_GET_ADDRESS(fooarr(1), disp(1), ierr)
>  call MPI_GET_ADDRESS(fooarr(2), disp(2), ierr)
>  extent = disp(2) - disp(1)
>  lb = 0
>  call MPI_TYPE_CREATE_RESIZED(newtype, lb, extent, newarrtype, ierr)
>  call MPI_TYPE_COMMIT(newarrtype, ierr)
> 
>  call MPI_SEND(fooarr, 5, newarrtype, dest, tag, comm, ierr)
> 
> end subroutine foobar


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

Re: [OMPI users] Regression: Fortran derived types with newer MPI module

Reply via email to