Re: [OMPI users] MXM problem

2015-05-28 Thread Тимур Исмагилов
I have already sent it On Thu, May 28, 2015 at 10:21 AM. 28 мая 2015 г. 20:07 пользователь Mike Dubman написал:it is fine to recompile OMPI from HPCx to apply site default (choice of job scheduler for example, OMPI from HPCX compiled with ssh support only, etc.).If ssh launcher is working on your

Re: [OMPI users] MXM problem

2015-05-28 Thread Mike Dubman
it is fine to recompile OMPI from HPCx to apply site default (choice of job scheduler for example, OMPI from HPCX compiled with ssh support only, etc.). If ssh launcher is working on your system - than OMPI from HPCX should work as well. could you please send to Alina (in cc) the command line and

Re: [OMPI users] MXM problem

2015-05-28 Thread Timur Ismagilov
Is it normal to rebuild openmpi from hpcx? Why binaries don't work? Четверг, 28 мая 2015, 14:01 +03:00 от Alina Sklarevich : >Thank you for this info. > >If 'yalla' now works for you, is there anything that is still wrong? > >Thanks, >Alina. > >On Thu, May 28, 2015 at 10:21 AM, Timur Ismagilov

Re: [OMPI users] MXM problem

2015-05-28 Thread Timur Ismagilov
I'm sorry for the delay . Here it is: ( I used 5 min time limit ) /gpfs/NETHOME/oivt1/nicevt/itf/sources/hpcx-v1.3.330-icc-OFED-1.5.4.1-redhat6.2-x86_64/ompi-mellanox-v1.8/bin/mpirun -x LD_PRELOAD=/gpfs/NETHOME/oivt1/nicevt/itf/sources/hpcx-v1.3.330-icc-OFED-1.5.4.1-   redhat6.2-x86_64

Re: [OMPI users] MXM problem

2015-05-26 Thread Timur Ismagilov
It does not work for single node: 1) host: $  $HPCX_MPI_DIR/bin/mpirun -x MXM_IB_PORTS=mlx4_0:1 -x MXM_SHM_KCOPY_MODE=off -host node5 -mca pml yalla -x MXM_TLS=ud,self,shm --prefix $HPCX_MPI_DIR -mca plm_base_verbose 5  -mca oob_base_verbose 10 -mca rml_base_verbose 10 --debug-daemons  -np 1 .

Re: [OMPI users] MXM problem

2015-05-26 Thread Timur Ismagilov
1. mxm_perf_test - OK. 2. no_tree_spawn  - OK. 3. ompi yalla and "--mca pml cm --mca mtl mxm" still does not work (I use prebuild ompi-1.8.5 from hpcx-v1.3.330) 3.a) host:$  $HPCX_MPI_DIR/bin/mpirun -x MXM_IB_PORTS=mlx4_0:1 -x MXM_SHM_KCOPY_MODE=off -host node5,node153  --mca pml cm --mca mtl

Re: [OMPI users] MXM problem

2015-05-25 Thread Mike Dubman
scif is a OFA device from Intel. can you please select export MXM_IB_PORTS=mlx4_0:1 explicitly and retry On Mon, May 25, 2015 at 8:26 PM, Timur Ismagilov wrote: > Hi, Mike, > that is what i have: > > $ echo $LD_LIBRARY_PATH | tr ":" "\n" > /gpfs/NETHOME/oivt1/nicevt/itf/sources/hpcx-v1.3.0-327-i

Re: [OMPI users] MXM problem

2015-05-25 Thread Timur Ismagilov
Hi, Mike, that is what i have: $ echo $LD_LIBRARY_PATH | tr ":" "\n" /gpfs/NETHOME/oivt1/nicevt/itf/sources/hpcx-v1.3.0-327-icc-OFED-1.5.3-redhat6.2/fca/lib     /gpfs/NETHOME/oivt1/nicevt/itf/sources/hpcx-v1.3.0-327-icc-OFED-1.5.3-redhat6.2/hcoll/lib       /gpfs/NETHOME/oivt1/n

Re: [OMPI users] MXM problem

2015-05-25 Thread Mike Dubman
Hi Timur, seems that yalla component was not found in your OMPI tree. can it be that your mpirun is not from hpcx? Can you please check LD_LIBRARY_PATH,PATH, LD_PRELOAD and OPAL_PREFIX that it is pointing to the right mpirun? Also, could you please check that yalla is present in the ompi_info -l 9

Re: [OMPI users] MXM problem

2015-05-25 Thread Timur Ismagilov
I can password-less ssh to all nodes: base$ ssh node1 node1$ssh node2 Last login: Mon May 25 18:41:23 node2$ssh node3 Last login: Mon May 25 16:25:01 node3$ssh node4 Last login: Mon May 25 16:27:04 node4$ Is this correct? In ompi-1.9 i do not have no-tree-spawn problem. Понедельник, 25 мая 20

Re: [OMPI users] MXM problem

2015-05-25 Thread Ralph Castain
I can’t speak to the mxm problem, but the no-tree-spawn issue indicates that you don’t have password-less ssh authorized between the compute nodes > On May 25, 2015, at 8:55 AM, Timur Ismagilov wrote: > > Hello! > > I use ompi-v1.8.4 from hpcx-v1.3.0-327-icc-OFED-1.5.3-redhat6.2; > OFED-1.5.4