Hi Gus Correa,

Thank you so much for such a detail explanation:)
i understand that now.

Best wishes ^_^

On Mon, Dec 6, 2010 at 7:41 AM, Gustavo Correa <g...@ldeo.columbia.edu>wrote:

> Hi Daofeng
>
> It is hard to tell what is happening in the Infiniband side of the problem.
> Did somebody perhaps remove the Infiniband card from this machine?
> Was it ever there?
> Did somebody perhaps changed the Linux kernel modules that are loaded
> (perhaps changing /etc/module.config or similar)?
> Maybe other people in your organization know.
>
> If this is a single computer, not a cluster, you don't loose anything by
> not
> having Infinband.
> In this case, you can reinstall OpenMPI without Infiniband support, by just
> doing "make distclean" in the OpenMPI build directory (to cleanup what is
> there),
> then "./configure --prefix=/wherever/you/want/to/install --without-openib",
> then "make", and "make install".
>
> Alternatively, you can continue to use what you already have with the "-mca
> btl ^openib" flag.
>
> If this is a cluster, of course you would benefit from Infiniband, which is
> a faster
> network than Ethernet or Gigabit Ethernet.
> In this case you need to ask for help of somebody that knows more about
> your cluster
> hardware, to restore the Infiniband to a sane and healthy state.
> Or, if there is no Infinband hardware, or if it is broken, just reinstall
> OpenMPi following
> the little recipe above.  You will be able to run your programs using
> Ethernet (I assume
> the cluster would have Ethernet).  Not very fast, but will work.
>
> My two cents,
> Gus Correa
>
>
> On Dec 4, 2010, at 4:47 AM, Daofeng Li wrote:
>
> > Hi Gus,
> >
> > thank you for your response.
> > i think this is much about hardware which i know little about them:)
> > might be the machine i used dont have the card you mentioned as i run:
> >  /usr/sbin/ibstat
> > ibwarn: [4260] umad_init: can't read ABI version from
> /sys/class/infiniband_mad/abi_version (No such file or directory): is
> ib_umad module loaded?
> > ibpanic: [4260] main: can't init UMAD library: (No such file or
> directory)
> >
> > but you really helped me as:
> >
> > $ mpirun -mca btl ^openib -n 8 hello_cxx
> > Hello, world!  I am 6 of 8
> > Hello, world!  I am 0 of 8
> > Hello, world!  I am 4 of 8
> > Hello, world!  I am 7 of 8
> > Hello, world!  I am 5 of 8
> > Hello, world!  I am 2 of 8
> > Hello, world!  I am 1 of 8
> > Hello, world!  I am 3 of 8
> >
> > that's really cool~
> >
> > thank you all:)
> >
> > Best Wishes.
> > On Sat, Dec 4, 2010 at 11:12 AM, Gus Correa <g...@ldeo.columbia.edu>
> wrote:
> > Hi Daofeng
> >
> > Do you have an Infiniband card in the machine where you are
> > running the program?
> > (Open Fabrics / OFED is the software support for Infiniband.
> > I guess you need the same version installed in all machines.)
> >
> > Does the directory referred in the error message actually
> > exist in your machine (i.e,  /dev/infiniband) ?
> >
> > Are you running it in the same machine where you installed OpenMPI?
> >
> > What output do you get from:
> > /usr/sbin/ibstat
> > ?
> >
> > Did you compile the programs with the mpicc,mpiCC, mpif77
> > from the same OpenMPI that you built?
> > (Some Linux distributions and compilers come with
> > their own flavors of MPI, or you may also
> > have installed MPICH or MVAPICH, so it is not uncommon to mix up.)
> >
> > Have you tried to suppress the use of Infinband, i.e.:
> >
> > mpirun -mca btl ^openib -n 8 hello_cxx
> >
> > (Well, "openib" is the OpenMPI support for Infiniband.
> > The "^" means "don't use it")
> >
> > I hope this helps,
> > Gus Correa
> >
> > Daofeng Li wrote:
> > Dear Jeff,
> >  actually i didnot understand this....can you or anyone tell me what to
> do?
> >  Thx.
> >  Best.
> >
> > On Fri, Dec 3, 2010 at 9:41 PM, Jeff Squyres (jsquyres) <
> jsquy...@cisco.com <mailto:jsquy...@cisco.com>> wrote:
> >
> >    It means that you probably have a version mismatch with your
> >    OpenFabrics drivers and or you have no OpenFabrics hardware and you
> >    should probably disable those drivers.
> >    Sent from my PDA. No type good.
> >    On Dec 3, 2010, at 4:56 AM, "Daofeng Li" <lid...@gmail.com
> >    <mailto:lid...@gmail.com>> wrote:
> >
> >    Dear list,
> >         i am currently try to use the OpenMPI package
> >    i install it at my home directory
> >    ./configure --prefix=$HOME --enable-mpi-threads
> >    make
> >    make install
> >         and the i add the ~/bin to the path and ~/lib to the
> >    ld_library_path to my .bashrc file
> >         everything seems normal as i can run the example programs:
> >    mpirun -n 8 hello_cxx
> >    mpirun -n 8 hello_f77
> >    mpirun -n 8 hello_c
> >    etc...
> >         but error messages appeas:
> >         $ mpirun -n 8 hello_cxx
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >
>  --------------------------------------------------------------------------
> >    [[32727,1],1]: A high-performance Open MPI point-to-point
> >    messaging module
> >    was unable to find any relevant network interfaces:
> >    Module: OpenFabrics (openib)
> >      Host: localhost.localdomain
> >    Another transport will be used instead, although this may result in
> >    lower performance.
> >
>  --------------------------------------------------------------------------
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    CMA: unable to open /dev/infiniband/rdma_cm
> >    librdmacm: couldn't read ABI version.
> >    librdmacm: assuming: 4
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    libibverbs: Fatal: couldn't read uverbs ABI version.
> >    Hello, world!  I am 1 of 8
> >    Hello, world!  I am 0 of 8
> >    Hello, world!  I am 3 of 8
> >    Hello, world!  I am 5 of 8
> >    Hello, world!  I am 7 of 8
> >    Hello, world!  I am 4 of 8
> >    Hello, world!  I am 6 of 8
> >    Hello, world!  I am 2 of 8
> >    [localhost.localdomain:30503] 7 more processes have sent help
> >    message help-mpi-btl-base.txt / btl:no-nics
> >    [localhost.localdomain:30503] Set MCA parameter
> >    "orte_base_help_aggregate" to 0 to see all help / error messages
> >         i am wondering whether i install openmpi the right way
> >    anyone would give some suggestions?
> >         thanks in advance.
> >         Best Regards.
> >    --     Daofeng Li
> >    College of Biological Science
> >    China Agricultural University
> >    Beijing
> >    China
> >
> >    _______________________________________________
> >    users mailing list
> >    us...@open-mpi.org <mailto:us...@open-mpi.org>
> >
> >    http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >    _______________________________________________
> >    users mailing list
> >    us...@open-mpi.org <mailto:us...@open-mpi.org>
> >
> >    http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >
> >
> >
> > --
> > Daofeng Li
> > College of Biological Science
> > China Agricultural University
> > Beijing
> > China
> >
> >
> > ------------------------------------------------------------------------
> >
> >
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
> >
> >
> >
> > --
> > Daofeng Li
> > College of Biological Science
> > China Agricultural University
> > Beijing
> > China
> >
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>



-- 
Daofeng Li
College of Biological Science
China Agricultural University
Beijing
China

Reply via email to