Re: [OMPI users] qelr_alloc_context: Failed to allocate context for device.

2019-11-13 Thread Llolsten Kaonga via users
Hello Mateo, What version of openmpi are you running? Also, the OFED-4.17-1 release notes do not claim support for CentOS 7.7. It supports CentsOS 7.6. Apologies if you have already tried CentOS 7.6. We have been able to run openmpi (earlier this month): OS: CentOS 7.6 mpir

Re: [OMPI users] openmpi-4.0.1 build error

2019-10-03 Thread Llolsten Kaonga via users
thank you. -- Llolsten From: users On Behalf Of Llolsten Kaonga via users Sent: Wednesday, October 02, 2019 12:46 PM To: 'Open MPI Users' Cc: Llolsten Kaonga Subject: [OMPI users] openmpi-4.0.1 build error Hello all, OS: CentOS 7.7 OFED: MLNX_OFED_LINUX-4.7-1.0.0.1 R

[OMPI users] openmpi-4.0.1 build error

2019-10-02 Thread Llolsten Kaonga via users
Hello all, OS: CentOS 7.7 OFED: MLNX_OFED_LINUX-4.7-1.0.0.1 Running the command "make all install" returns: In file included from btl_uct_device_context.h:16:0, from btl_uct_component.c:40: btl_uct_rdma.h: In function 'mca_btl_uct_get_rkey': btl_uct_rdma.h

Re: [OMPI users] OpenMPI v4.0.0 signal 11 (Segmentation fault)

2019-02-20 Thread Llolsten Kaonga
Hello Adam, During the InfiniBand Plugfest 34 event last October, we found that mpirun hang on FDR systems if you run with the openib btl option. Yossi Itigin (@Mellanox) suggested that we run using the following options: --mca btl self,vader --mca pml ucx -x UCX_RC_PATH_MTU=4096

Re: [OMPI users] Difficulties when trying to download files?

2018-09-25 Thread Llolsten Kaonga
Hello Jorge, What happens when you go to this link https://www.open-mpi.org/software/ompi/v3.1/ and click on the file openmpi-3.1.2.tar.gz in the table? I asks because I am able to download the tarball without a problem. Maybe the problem you were seeing was temporary. Cheers. -- Llolsten ---

[OMPI users] Unable to open a shared object libsmartio-rdmav17.so

2018-08-24 Thread Llolsten Kaonga
Hello all, This may be a silly question but I hope that someone does know the answer. We use Open MPI to run the Intel Benchmarks to test InfiniBand and RoCE network fabrics. We recently installed OFED-4.17 and when we attempt to run the tests, we see the error below. Command: /usr/loc

Re: [OMPI users] Big jump from OFED 1.5.4.1 -> recent (stable). Any suggestions?

2016-06-15 Thread Llolsten Kaonga
e latest stable RHEL 6.x (probably 6.7, not sure if we will go with 6.8). Did you have to recompile Open MPI stacks or any of the existing MPI software? Thank you for your input! -Memo On 6/13/16 10:57 PM, Llolsten Kaonga wrote: Hello Mehmet, OFED is now around 3.18.2-rc2 and there is

Re: [OMPI users] Big jump from OFED 1.5.4.1 -> recent (stable). Any suggestions?

2016-06-15 Thread Llolsten Kaonga
ers. If it is Mellanox, is it not better to use the MLNX_OFED ? This information may help us build our cluster. Hence, asking. Thanks, - Sreenidhi. On Wed, Jun 15, 2016 at 1:17 PM, Peter Kjellström mailto:c...@nsc.liu.se> > wrote: On Tue, 14 Jun 2016 13:18:33 -0400 "Llolsten Ka

Re: [OMPI users] Big jump from OFED 1.5.4.1 -> recent (stable). Any suggestions?

2016-06-14 Thread Llolsten Kaonga
Hello Grigory, I am not sure what Redhat does exactly but when you install the OS, there is always an InfiniBand Support module during the installation process. We never check/install that module when we do OS installations because it is usually several versions of OFED behind (almost obsolete).

Re: [OMPI users] Big jump from OFED 1.5.4.1 -> recent (stable). Any suggestions?

2016-06-13 Thread Llolsten Kaonga
Hello Mehmet, OFED is now around 3.18.2-rc2 and there is talk of an rc3. We have used many different versions of OFED, and we are now running OFED 3.18.1 rc2 with the latest version of Open MPI with no trouble (OS is CentOS 7.2). What version of Open MPI are you planning to use? What OS,

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-06-03 Thread Llolsten Kaonga
0 for external access and eth1 for private network) and MPI should only use the internal network, then you can allow all traffic on the internal port, and mpirun --mca oob_tcp_if_include eth1 --mca btl_tcp_if_include eth1 ... Cheers, Gilles On Wednesday, May 11, 2016, Llolsten Kaonga mai

Re: [OMPI users] ORTE has lost communication

2016-05-16 Thread Llolsten Kaonga
Hello Guilherme, This may be off but try running your mpirun command with the option “–tag-output”. If you see a “broken pipe”, then your issue may be firewall related. You could then check the thread “Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled” for how

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-11 Thread Llolsten Kaonga
while Cheers, Gilles On Wednesday, May 11, 2016, Llolsten Kaonga mailto:l...@soft-forge.com> > wrote: Hello Gilles/Jeff, Thank you for clarifying this. We have three ports but the RoCE traffic is supposed to use one of the internal ports. However, we do allow use of one of t

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-11 Thread Llolsten Kaonga
pirun --mca oob_tcp_if_include eth1 --mca btl_tcp_if_include eth1 ... Cheers, Gilles On Wednesday, May 11, 2016, Llolsten Kaonga mailto:l...@soft-forge.com> > wrote: Hello Jeff, I think what you suggest is likely exactly what we want to see happen. We run the interop tests with a

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-10 Thread Llolsten Kaonga
b (and the machine where mpirun is invoked, if that is a different machine). You could also open your firewall to trust random TCP connections just between the servers in your cluster. > On May 10, 2016, at 3:44 PM, Llolsten Kaonga wrote: > > Hello Orion, > > I actually rathe

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-10 Thread Llolsten Kaonga
Sent: Tuesday, May 10, 2016 3:31 PM To: Open MPI Users Subject: Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled On 05/10/2016 09:24 AM, Llolsten Kaonga wrote: > Hello Durga, > > As I mentioned earlier, up to version 1.8.2, we would just disable

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-10 Thread Llolsten Kaonga
how did this ever worked with a firewall enabled? I have, in the past, have faced similar situations, thus I am curious to know the answer as well. Thanks Durga The surgeon general advises you to eat right, exercise regularly and quit ageing. On Mon, May 9, 2016 at 2:31 PM, Llolste

[OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-09 Thread Llolsten Kaonga
Hello all, We've been running openmpi for a long time and up to version 1.8.2 and CentOS 6.7 with commands such as the one below: usr/local/bin/mpirun --allow-run-as-root --mca btl openib,self,sm --mca pml ob1 -np 2 -np 8 -hostfile /root/mpi-hosts /usr/local/bin/IMB-MPI1 To be able to run