[OMPI users] WRF Problem running in Parallel

2011-02-22 Thread Ahsan Ali
task 1 of 4 starting wrf task 3 of 4 starting wrf task 2 of 4 -- mpirun noticed that process rank 3 with PID 6044 on node pmd02.pakmet.comexited on signal 11 (Segmentation fault). ** * -- Syed Ahsan Ali Bokhari Electronic

Re: [OMPI users] WRF Problem running in Parallel (jody)

2011-02-22 Thread Ahsan Ali
360 JS 75 JE 98 WRF TILE 5 IS 1 IE360 JS 99 JE122 WRF TILE 6 IS 1 IE360 JS123 JE146 WRF TILE 7 IS 1 IE360 JS147 JE170 WRF TILE 8 IS 1 IE360 JS171 JE195 WRF NUMBER OF TILES = 8 Best Regards, -- Syed Ahsa

Re: [OMPI users] WRF Problem running in Parallel (Gus Correa)

2011-02-23 Thread Ahsan Ali
t; ith wrf) > > > > Have you tried running a single instance of wrf.exe? > > Have you tried to run a simple application (like a "hello world") on your > nodes? > > > > Jody > > > > > > On Tue, Feb 22, 2011 at 7:37 AM, Ahsan Ali > wrote:

[OMPI users] WRF run on multiple Nodes

2011-04-01 Thread Ahsan Ali
Hello, I want to run WRF on multiple nodes in a linux cluster using openmpi, giving the command *mpirun -np 4 ./wrf.exe* just submit it to the single node . I don't know how to run it on other nodes as well. Help needed. Regards, -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Res

Re: [OMPI users] WRF run on multiple Nodes

2011-04-03 Thread Ahsan Ali
age-ID: > Content-Type: text/plain; charset="iso-8859-1" > > look into -machinefile > > On Fri, Apr 1, 2011 at 8:16 PM, Ahsan Ali wrote: > > > Hello, > > > > I want to run WRF on multiple nodes in a linux cluster using openmpi, > > giving the comma

Re: [OMPI users] WRF run on multiple Nodes

2011-04-04 Thread Ahsan Ali
Dear John Hearns, The cluster is installed using Platform cluster Manager (PCM). LSF is installed but no OpenMPI application is yet integrated with LSF. WRF help gave me following instructions. mpirun -v -machinefile ~/mach.conf -np 2 wrf.exe Please talk to your computer manager about how to s

[OMPI users] WRF Problem running in Parallel on multiple nodes (cluster)

2011-05-03 Thread Ahsan Ali
. -- mpirun: clean termination accomplished -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Development Division Pakistan Meteorological Department H-8/4, Islamabad. Phone # off +92518358714 Cell # +923155145014

Re: [OMPI users] [Wrf-users] WRF Problem running in Parallel on multiple nodes(cluster)

2011-05-03 Thread Ahsan Ali
hine that works. > > > > Bart > > > > *From:* wrf-users-boun...@ucar.edu [mailto:wrf-users-boun...@ucar.edu] *On > Behalf Of *Ahsan Ali > *Sent:* Tuesday, May 03, 2011 1:04 AM > *To:* us...@open-mpi.org > *Subject:* [Wrf-users] WRF Problem running in Parallel on multiple &

[OMPI users] Infiniband Error

2011-09-12 Thread Ahsan Ali
128 vendor error 129 qp_idx 3* I removed that particular node and then the error was removed.Please suggest me what could be the solution to this. Thanking you in advance. -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Development Division Pakistan Meteorological Department

[OMPI users] Errors for openib, mpirun fails

2014-07-21 Thread Syed Ahsan Ali
Dear All I need your help to solve this cluster related issue causing mpirun malfunction. I get following warning for some of the nodes and then the route failure message comes causing failure to mpirun. *WARNING: There is at least one OpenFabrics device found but there are no active ports dete

Re: [OMPI users] Errors for openib, mpirun fails

2014-07-21 Thread Syed Ahsan Ali
l (Pasha) Shamis > --- > Computer Science Research Group > Computer Science and Math Division > Oak Ridge National Laboratory > > > > > > > On Jul 21, 2014, at 3:17 AM, Syed Ahsan Ali ahsansha...@gmail.com>> wrote: > > Dear All > > I need your help

Re: [OMPI users] Errors for openib, mpirun fails

2014-07-22 Thread Syed Ahsan Ali
are down on > 01-01. > > You may disable support for infiniband by adding --mca btl ^openib. > > Best, > Pavel (Pasha) Shamis > --- > Computer Science Research Group > Computer Science and Math Division > Oak Ridge National Laboratory > > > > >

Re: [OMPI users] Errors for openib, mpirun fails

2014-07-22 Thread Syed Ahsan Ali
id/lid (sort of equivalent of mac address in ethernet world). > As you can guess these two can not be identical for two different machines > (unless you moved the card around). > > Best, > Pasha > > On Jul 21, 2014, at 11:26 PM, Syed Ahsan Ali <mailto:ahsansha...@gmail.com

Re: [OMPI users] Errors for openib, mpirun fails

2014-07-22 Thread Syed Ahsan Ali
shooting-infiniband-connection-issues-using-ofed-tools > > You might also try try running only over port 1 with the mca parameter: > > -mca btl_openib_if_include mlx4_0:1 > > Hope this helps. > > Josh > > > On Tue, Jul 22, 2014 at 12:10 AM, Syed Ahsan Ali > wro

[OMPI users] How to keep multiple installations at same time

2014-08-04 Thread Syed Ahsan Ali
I want to compile openmpi with both intel and gnu compilers. How can I install both at the same time and then specify which one to use during job submission. Regards Ahsan

Re: [OMPI users] openmpi 1.8.1 error witg gfortran

2014-08-06 Thread Syed Ahsan Ali
Issue resolved. On Wed, Aug 6, 2014 at 2:48 PM, Syed Ahsan Ali wrote: > I have following error while compiling > > > *** Fortran compiler > checking whether we are using the GNU Fortran compiler... yes > checking whether /opt/gcc-4.9.1/bin/gfortran accepts -g... yes > con

[OMPI users] openmpi-1.8.1 Unable to compile on CentOS6.5

2014-08-26 Thread Syed Ahsan Ali
I have problems in compilation of openmpi-1.8.1 on Linux machine. Kindly see the logs attached. configure.bz2 Description: BZip2 compressed data

Re: [OMPI users] openmpi-1.8.1 Unable to compile on CentOS6.5

2014-08-26 Thread Syed Ahsan Ali
ubscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > > Link to this post: > http://www.open-mpi.org/community/lists/users/2014/08/25150.php > > > -- > Jeff Squyres > jsquy...@cisco.com > For corporate legal information go to: > http://www.cisco.com/web/

[OMPI users] mpirun fails across nodes

2014-11-12 Thread Syed Ahsan Ali
Dear All I need your advice. While trying to run mpirun job across nodes I get following error. It seems that the two nodes i.e, compute-01-01 and compute-01-06 are not able to communicate with each other. While nodes see each other on ping. [pmdtest@pmd ERA_CLM45]$ mpirun -np 16 -hostfile hostli

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
see all help / error messages On Wed, Nov 12, 2014 at 7:32 PM, Jeff Squyres (jsquyres) wrote: > Do you have firewalling enabled on either server? > > See this FAQ item: > > http://www.open-mpi.org/faq/?category=running#diagnose-multi-host-problems > > > > On

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
~]$ On Thu, Nov 13, 2014 at 12:03 PM, Syed Ahsan Ali wrote: > Hi Jefff > > No firewall is enabled. Running the diagnostics I found that non > communication mpi job is running . While ring_c remains stuck. There > are of course warnings for open fabrics but in my case I an running &

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
; could you try > > $ mpirun --mca btl ^openib --host compute-01-01,compute-01-06 ring_c > > > can you also try to run mpirun from a compute node instead of the head > node ? > > Cheers, > > Gilles > > On 2014/11/13 16:07, Syed Ahsan Ali wrote: >> Here is what I

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
r via this address ? > /* e.g. from compute-01-01 can you ping the 192.168.108.* ip address of > compute-01-06 ? */ > > could you also run > > mpirun --mca btl ^openib --host compute-01-01,compute-01-06 --mca > btl_tcp_if_include 10.0.0.0/8 ring_c > > and see whether it help

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
option is you to use > --mca btl_tcp_if_exclude ib0 > > On 2014/11/13 16:43, Syed Ahsan Ali wrote: >> You are right it is running on 10.0.0.0 interface [pmdtest@pmd ~]$ >> mpirun --mca btl ^openib --host compute-01-01,compute-01-06 --mca >> btl_tcp_if_include 10.0.0.0/8 ring_c >

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
Ok ok I can disable that as well. Thank you guys. :) On Thu, Nov 13, 2014 at 12:50 PM, Syed Ahsan Ali wrote: > Now it looks through the loopback address > > [pmdtest@pmd ~]$ mpirun --host compute-01-01,compute-01-06 --mca > btl_tcp_if_exclude ib0 ring_c > Process 0 sending 10 t

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
gt; can you run on both compute nodes ? > netstat -nr > > > On 2014/11/13 16:50, Syed Ahsan Ali wrote: >> Now it looks through the loopback address >> >> [pmdtest@pmd ~]$ mpirun --host compute-01-01,compute-01-06 --mca >> btl_tcp_if_exclude ib0 ring_c >> Process 0

Re: [OMPI users] mpirun fails across nodes

2014-11-13 Thread Syed Ahsan Ali
Gilles > > On 2014/11/13 16:59, Syed Ahsan Ali wrote: >> I don't see it running >> >> [pmdtest@compute-01-01 ~]$ netstat -nr >> Kernel IP routing table >> Destination Gateway Genmask Flags MSS Window irtt >> Iface >> 1

[OMPI users] mpirun fails across cluster

2015-02-27 Thread Syed Ahsan Ali
I am trying to run openmpi application on my cluster. But the mpirun fails, simple hostname command gives this error [pmdtest@hpc bin]$ mpirun --host compute-0-0 hostname -- Sorry! You were supposed to get help about: op

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Syed Ahsan Ali
tion is that the Rocks documentation was obscure > about this, not making clear the difference between > /export/apps and /share/apps. > > Issuing the Rocks commands: > "tentakel 'ls -d /export/apps'" > "tentakel 'ls -d /share/apps'" > ma

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Syed Ahsan Ali
[pmdtest@hpc bin]$ Ahsan On Fri, Feb 27, 2015 at 10:17 PM, Gus Correa wrote: > Hi Syed Ahsan Ali > > To avoid any leftovers and further confusion, > I suggest that you delete completely the old installation directory. > Then start fresh from the configure step with the pre

Re: [OMPI users] mpirun fails across cluster

2015-02-27 Thread Syed Ahsan Ali
Oh sorry. That is related to application. I need to recompile application too I guess. On Fri, Feb 27, 2015 at 10:44 PM, Syed Ahsan Ali wrote: > Dear Gus > > Thanks once again for suggestion. Yes I did that before installation > to new path. I am getting error now about some library

[OMPI users] Error building Openmpi (configure: error: C compiler cannot create executables)

2012-02-02 Thread Syed Ahsan Ali
. configure: error: in `/home/precis/opemmpi/openmpi-1.4.2': configure: error: C compiler cannot create executables See `config.log' for more details. -- Kind Regards Syed Ahsan Ali Bokhari

[OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format error

2012-02-27 Thread Syed Ahsan Ali
Dear All, I am running an application with mpirun but it gives following error, it is not picking up hostlist, there are other applications which run well with hostlist but it just gives following error with [pmdtest@pmd02 d00_dayfiles]$ tail -f *_hrm mpirun -np /home/MET/hrm/bin/hrm -

Re: [OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format error

2012-02-28 Thread Syed Ahsan Ali
tem with same configuration. On Tue, Feb 28, 2012 at 10:12 AM, PukkiMonkey wrote: > No of processes missing after -np > Should be something like: > mpirun -np 256 ./exec > > > > Sent from my iPhone > > On Feb 27, 2012, at 8:47 PM, Syed Ahsan Ali wrote: > >

Re: [OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format error

2012-02-29 Thread Syed Ahsan Ali
gt; argument for -np missing or argument not being numeric? > > > > Probably - I'm sure that the atol is returning zero, which should cause > an error output. I'll check. > > > > > >> > >> -- Reuti > >> > >> > >>> >

Re: [OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format error

2012-02-29 Thread Syed Ahsan Ali
ooks fine, can u add --mca btl_openib_verbose 1 to the mpirun > argument list, and see what it says? > > > > On Tue, Feb 28, 2012 at 10:15 PM, Syed Ahsan Ali wrote: > >> After creating new hostlist and making the scripts again it is working >> now and picking up the ho

Re: [OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format error

2012-02-29 Thread Syed Ahsan Ali
Sorry Jeff I couldn't get you point. On Wed, Feb 29, 2012 at 4:27 PM, Jeffrey Squyres wrote: > On Feb 29, 2012, at 2:17 AM, Syed Ahsan Ali wrote: > > > [pmdtest@pmd02 d00_dayfiles]$ echo ${MPIRUN} -np ${NPROC} -hostfile > $i{ABSDIR}/hostlist -mca btl sm,openib,self --mca

Re: [OMPI users] Could not execute the executable "/home/MET/hrm/bin/hostlist": Exec format errorI

2012-03-01 Thread Syed Ahsan Ali
outfile , > you were piping the output to the outfile instead of stdout. > > Sent from my iPhone > > On Feb 29, 2012, at 8:44 PM, Syed Ahsan Ali wrote: > > Sorry Jeff I couldn't get you point. > > On Wed, Feb 29, 2012 at 4:27 PM, Jeffrey Squyres wrote: >

[OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
Dear All, I am having problem with running an application on Dell cluster . The model starts well but no further progress is shown. It just stuck. I have checked the systems, no apparent hardware error is there. Other open mpi applications are running well on the same cluster. I have tried running

Re: [OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
sourceforge.net/ > > Scalable Grid Engine Support Program > http://www.scalablelogic.com/ > > > On Tue, Apr 24, 2012 at 12:49 AM, Syed Ahsan Ali > wrote: > > Dear All, > > > > I am having problem with running an application on Dell cluster . The > mode

Re: [OMPI users] HRM problem

2012-04-24 Thread Syed Ahsan Ali
ocesses is the job using? Are you oversubscribing your > processors? > What version of Open MPI are you using? > Have you tested all network connections? > It might help us to know the size of cluster you are running and what > type of network? > > --td > > On 4/24/2012

[OMPI users] undefined reference to `netcdf_mp_nf90_open_'

2012-06-26 Thread Syed Ahsan Ali
Dear All I am getting following error while compilation of an application. Seems like something related to netcdf and mpif90. Although I have compiled netcdf with mpif90 option, dont why this error is happening. Any hint would be highly appreciated. /home/pmdtest/cosmo/source/cosmo_110525_4.18/

Re: [OMPI users] undefined reference to `netcdf_mp_nf90_open_'

2012-06-27 Thread Syed Ahsan Ali
cally, you can take the mpif90 command that is being used to > generate these errors and add "--showme" to the end of it, and you'll see > what underlying compiler command is being executed under the covers. That > might help you understand exactly what is going on. &g

Re: [OMPI users] undefined reference to `netcdf_mp_nf90_open_'

2012-06-27 Thread Syed Ahsan Ali
gs are compiled with a different naming notation (check names in the > lib really contain the expected number of final underscores). > > I compiled cosmo 4.22 with openmpi and netcdf not long ago without any > problems. > > Best, > - Dima. > > 2012/6/26 Syed Ahsan Ali > &g

Re: [OMPI users] undefined reference to `netcdf_mp_nf90_open_'

2012-06-27 Thread Syed Ahsan Ali
h the > underlying Fortran compiler. > > > -- > Tim Prince > > __**_ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/**mailman/listinfo.cgi/users<http://www.open-mpi.org/mailman/listinfo.cgi/users> > -- Syed Ahsan Ali Bokhari Electronic Engineer

[OMPI users] Permission denied, please try again.

2012-08-01 Thread Syed Ahsan Ali
Dear All I am having problem while running an application on cluster. The application was working fine but now this error has arised . we used to run the application the same way with user pmdtest and there was no error. I dont know which permission it is asking for. Please help! [pmdtest@pmd02

Re: [OMPI users] UC Permission denied, please try again.

2012-08-02 Thread Syed Ahsan Ali
.org [mailto:users-boun...@open-mpi.org] *On > Behalf Of *Syed Ahsan Ali > *Sent:* 01 August 2012 08:45 > *To:* Open MPI Users > *Subject:* [OMPI users] Permission denied, please try again. > > ** ** > > > Dear All > > ** ** > > I am having problem while

Re: [OMPI users] UC Permission denied, please try again.

2012-08-02 Thread Syed Ahsan Ali
Yes all the compute nodes are NFS mounted with the master node, so everthing is same, all other nodes are accessible on ssh without password. On Thu, Aug 2, 2012 at 1:09 PM, John Hearns wrote: > On 02/08/2012, Syed Ahsan Ali wrote: > > Yes the issue has been diagnosed. I can ssh them

Re: [OMPI users] UC Permission denied, please try again.

2012-08-03 Thread Syed Ahsan Ali
Am 02.08.2012 um 17:57 schrieb Syed Ahsan Ali: > > > Yes all the compute nodes are NFS mounted with the master node, so > everthing is same, all other nodes are accessible on ssh without password. > > Are you using a queuing system? > > SSH could be setup to work from the mas

[OMPI users] Need solution- nodes can't find the paths.

2012-10-03 Thread Syed Ahsan Ali
Dear All I have a Dell Cluster running Platform Cluster Manager (PCM) , the compute nodes are NFS mounted with the master node. Storage (SAN) is mounted to the installer node only, the problem is that I am running a programme which uses data which resides on Storage , so as far as running the prog

Re: [OMPI users] Need solution- nodes can't find the paths.

2012-10-03 Thread Syed Ahsan Ali
re the cluster head node to export the SAN volume by NFS > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Develo

Re: [OMPI users] Need solution- nodes can't find the paths.

2012-10-03 Thread Syed Ahsan Ali
//www.open-mpi.org/mailman/listinfo.cgi/users > -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Development Division Pakistan Meteorological Department H-8/4, Islamabad. Phone # off +92518358714 Cell # +923155145014

[OMPI users] Infiniband errors

2012-11-28 Thread Syed Ahsan Ali
Dear All I have an application which is run using openmpi and uses infiniband flags. The application is a forecast model simulation. A frequent problem arises that the Infiniband mezzanine cards of servers become faulty (don't know the reason why it happens so frequent), the model simulation becom

Re: [OMPI users] Infiniband errors

2012-11-28 Thread Syed Ahsan Ali
_ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > -- Syed Ahsan Ali Bokhari Electronic Engineer (EE) Research & Development Division Pakistan Meteorological Department H-8/4, Islamabad. Phone # off +92518358714 Cell # +923155145014

Re: [OMPI users] Infiniband errors

2012-11-28 Thread Syed Ahsan Ali
; What type of infiniband card do you have? > What drivers are you using? > > ___ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > -- Syed Ahsan Ali Bokhari Electronic Engineer (

[OMPI users] mca_btl_tcp_frag_recv: readv failed: Connection timed out (110)

2012-11-29 Thread Syed Ahsan Ali
I recieve following error while running an application Does this represent any hardware issue? [compute-01-01.private.dns.zone][[60090,1],10][btl_tcp_frag.c:216:mca_btl_tcp_frag_recv] mca_btl_tcp_frag_recv: readv failed: Connection timed out (110) [compute-01-01.private.dns.zone][[60090,1],13][btl

Re: [OMPI users] Infiniband errors

2012-12-19 Thread Syed Ahsan Ali
Dear John I found this output of ibstatus on some nodes (most probably the problem causing) [root@compute-01-08 ~]# ibstatus Fatal error: device '*': sys files not found (/sys/class/infiniband/*/ports) Does this show any hardware or software issue? Thanks On Wed, Nov 28, 2012 at 3:17 PM, Jo

Re: [OMPI users] Infiniband errors

2012-12-20 Thread Syed Ahsan Ali
=12,pgrp=3876,timeout=300,minproto=5,maxproto=5,indirect 0 0 Thanks and Regards On Wed, Dec 19, 2012 at 8:38 PM, Yann Droneaud wrote: > Le mercredi 19 décembre 2012 à 12:12 +0500, Syed Ahsan Ali a écrit : > > Dear John > > > > I found this output of ibstatus on some nod

[OMPI users] configure: error: Could not run a simple Fortran 77 program. Aborting.

2013-02-01 Thread Syed Ahsan Ali
I am getting following error while bulding openmpi *** Fortran 90/95 compiler checking whether we are using the GNU Fortran compiler... yes checking whether gfortran accepts -g... yes checking if Fortran 77 compiler works... no **

Re: [OMPI users] configure: error: Could not run a simple Fortran 77 program. Aborting.

2013-02-01 Thread Syed Ahsan Ali
d here: > > http://www.open-mpi.org/community/help/ > > > On Feb 1, 2013, at 5:58 AM, Syed Ahsan Ali wrote: > > > > > I am getting following error while bulding openmpi > > > > *** Fortran 90/95 compiler > > checking whether we are using the GNU

Re: [OMPI users] configure: error: Could not run a simple Fortran 77 program. Aborting.

2013-02-06 Thread Syed Ahsan Ali
re:28765: error: Could not run a simple Fortran 77 program. > Aborting. > - > > Perhaps you need to set your LD_LIBRARY_PATH to point to where libgfortran > is located? > > In short: when you can run gfortran manually to compile/run trivial > fortran programs, then configure will suc

[OMPI users] error while loading shared libraries: libhdf5.so.7:

2013-02-07 Thread Syed Ahsan Ali
I have been running this program successfully before but some copy operation from /usr/ directory has caused this error. The program runs fine on the cores of the same machine. libhdf5.so.7 is also present. [pmdtest@pmd HadGEM]$ mpirun -np 32 -hostfile hostlist rca.x rca.x: error while loading sh

Re: [OMPI users] error while loading shared libraries: libhdf5.so.7:

2013-02-07 Thread Syed Ahsan Ali
Dear John Thanks for the reply. I'll need help of you people to solve this problem. I am not expert in HPC and this would be my learning as well. Let me add that the cluster is based on Platform Cluster Manager (PCM) by IBM Computing. The compute nodes are NFS mounted with the installer node. There

Re: [OMPI users] error while loading shared libraries: libhdf5.so.7:

2013-02-07 Thread Syed Ahsan Ali
Dear John Looking into output of ldd for master and compute nodes solved my problem. Thanks for such a simple solution. :) On Thu, Feb 7, 2013 at 9:37 PM, Syed Ahsan Ali wrote: > Dear John > Thanks for the reply. I'll need help of you people to solve this problem. > I am not exp

[OMPI users] Running openmpi jobs on two system

2013-03-22 Thread Syed Ahsan Ali
I have a very basic question. If we want to run mpirun job on two systems which are not part of cluster, then how we can make it possible. Can the host be specifiend on mpirun which is not compute node, rather a stand alone system. Thanks Ahsan

Re: [OMPI users] Running openmpi jobs on two system

2013-03-22 Thread Syed Ahsan Ali
? just as the compute nodes are nfs mounted with the installer node. Ahsan On Fri, Mar 22, 2013 at 3:33 PM, Reuti wrote: > Am 22.03.2013 um 10:14 schrieb Syed Ahsan Ali: > > > I have a very basic question. If we want to run mpirun job on two > systems which are not part of clust

Re: [OMPI users] Running openmpi jobs on two system-librdmacm: couldn't read ABI version

2013-03-26 Thread Syed Ahsan Ali
ain wrote: > > On Mar 22, 2013, at 3:42 AM, Syed Ahsan Ali wrote: > > Actually due to some data base corruption I am not able to add any new > node to cluster from the installer node. So I want to run parallel job on > more nodes without adding them to existing cluster. > You

Re: [OMPI users] Running openmpi jobs on two system-librdmacm: couldn't read ABI version

2013-03-26 Thread Syed Ahsan Ali
It may be because the other system is running upgraded version of linux which is not having infiniband drivers. Any solution? On Tue, Mar 26, 2013 at 12:42 PM, Syed Ahsan Ali wrote: > Tried this but mpirun exits with this error > > mpirun -np 40 /home/MET/hrm/bin/hrm > librdmacm: c

[OMPI users] Fwd: compilation aborted for Handler.cpp (code 2)

2013-09-16 Thread Syed Ahsan Ali
Dear All I am trying to compile openmpi-1.6.5 on fc16.x86_64 with icc and ifort but getting the subject error. config.out and make.out is attached. Following command was used for configure ./configure CC=icc CXX=icpc FC=ifort F77=ifort F90=ifort --prefix=/home/openmpi_gfortran -enable-mpi-f90

Re: [OMPI users] compilation aborted for Handler.cpp (code 2)

2013-09-18 Thread Syed Ahsan Ali
Please find attached again. On Tue, Sep 17, 2013 at 11:35 AM, Jeff Squyres (jsquyres) wrote: > On Sep 16, 2013, at 9:00 AM, Syed Ahsan Ali wrote: > >> I am trying to compile openmpi-1.6.5 on fc16.x86_64 with icc and ifort >> but getting the subject error. config.out and ma

[OMPI users] Fwd: compilation aborted for Handler.cpp (code 2)

2013-09-19 Thread Syed Ahsan Ali
I am trying to compile openmpi-1.6.5 on fc16.x86_64 with icc and ifort but getting the subject error. config.out and make.out is attached. Following command was used for configure ./configure CC=icc CXX=icpc FC=ifort F77=ifort F90=ifort --prefix=/home/openmpi_gfortran -enable-mpi-f90 --enable-mpi

Re: [OMPI users] Fwd: compilation aborted for Handler.cpp (code 2)

2013-09-20 Thread Syed Ahsan Ali
Output of make V=1 is attached. Again same error. If intel compiler is using C++ headers from gfortran then how can we avoid this. On Fri, Sep 20, 2013 at 11:07 AM, Bert Wesarg wrote: > Hi, > > On Fri, Sep 20, 2013 at 4:49 AM, Syed Ahsan Ali wrote: >> I am trying to compile o

Re: [OMPI users] compilation aborted for Handler.cpp (code 2)

2013-09-22 Thread Syed Ahsan Ali
essing* that this is a problem with your local icpc installation. > > Can you compile / run other C++ codes that use the STL with icpc? > > > On Sep 20, 2013, at 6:59 AM, Syed Ahsan Ali wrote: > >> Output of make V=1 is attached. Again same error. If intel compiler is &g

Re: [OMPI users] compilation aborted for Handler.cpp (code 2)

2013-09-27 Thread Syed Ahsan Ali
> > > On Sep 22, 2013, at 10:40 AM, Syed Ahsan Ali wrote: > >> Its ok Jeff. >> I am not sure about other C++ codes and STL with icpc because it never >> happened and I don't know anything about STL.(pardon my less >> knowledge). What do you suggest in this

Re: [OMPI users] compilation aborted for Handler.cpp (code 2)

2013-10-03 Thread Syed Ahsan Ali
Dear Jeff Thank you for explaining. Please find attached test logs which explain the error. Regards On Fri, Sep 27, 2013 at 6:12 PM, Jeff Squyres (jsquyres) wrote: > On Sep 27, 2013, at 6:53 AM, Syed Ahsan Ali wrote: > >> Thank you very much Jeff. It worked now. > > Good. &g

[OMPI users] Error message related to infiniband

2014-01-19 Thread Syed Ahsan Ali
Dear All I am getting infiniband errors while running mpirun applications on cluster. I get these errors even when I don't include infiniband usage flags in mpirun command. Please guide mpirun -np 72 -hostfile hostlist ../bin/regcmMPI regcm.in

Re: [OMPI users] Error message related to infiniband

2014-01-19 Thread Syed Ahsan Ali
case TCP. > However, the TCP transport is unable to create a socket to the remote host. > The most likely cause is a firewall, so you might want to check that and > turn it off. > > > On Jan 19, 2014, at 4:19 AM, Syed Ahsan Ali wrote: > > Dear All > > I am getting infi

Re: [OMPI users] Error message related to infiniband

2014-01-20 Thread Syed Ahsan Ali
t; > Also, if you use host names in your hostfile, I guess they need to be able > to > resolve the names into IP addresses. > Check if your /etc/hosts file, DNS server, or whatever you > use for name resolution, is correct and consistent across the cluster. > > On Jan 19, 2014, at 10:

[OMPI users] Fatal Error: Cannot read module file 'mpi.mod' opened at (1), because it was created by a different version of GNU Fortran

2015-07-28 Thread Syed Ahsan Ali
I am getting this error during installation of an application. Apparently the error seems to be complaining about openmpi being compiled with different version of gnu fortran but I am sure that it was compiled with gcc-4.9.2. The same is also being used for application compilation. I am using open

Re: [OMPI users] Fatal Error: Cannot read module file 'mpi.mod' opened at (1), because it was created by a different version of GNU Fortran

2015-07-28 Thread Syed Ahsan Ali
is likely mpifort simply run gfortran, and your PATH does not point to > gfortran 4.9.2 > > Cheers, > > Gilles > > > On 7/28/2015 1:47 PM, Syed Ahsan Ali wrote: >> >> I am getting this error during installation of an application. >> Apparently the error se