[OMPI users] Change behavior of --output-filename

2019-11-12 Thread Max Sagebaum via users
Hello @ all, Short question: How to select what is the behavior of --output-filename? Long question: I am used to that the option --output-filename file.out will generate the files file.out.0 file.out.1 etc. The default logic switched to file.out/1/rank.*/stdout man mpirun gives me: -output-fil

Re: [OMPI users] Change behavior of --output-filename

2019-11-12 Thread Ralph Castain via users
The man page is simply out of date - seeĀ  https://github.com/open-mpi/ompi/issues/7095 for further thinking On Nov 12, 2019, at 1:26 AM, Max Sagebaum via users mailto:users@lists.open-mpi.org> > wrote: Hello @ all, Short question: How to select what is the behavior of --output-filename? Long q

[OMPI users] optimized cuda rdma mca-params.conf?

2019-11-12 Thread Douglas Duckworth via users
Good Morning We are running OpenMPI 4.0.2 on several CentOS 7.8 nodes with V-100s, driver version 418.87.01, persistent mode enabled, and nvidia peer memory module present. We are seeing low penalty for running jobs across multiple GPU nodes. I am relatively new to tuning OpenMPI so I wanted

Re: [OMPI users] Change behavior of --output-filename

2019-11-12 Thread Jeff Squyres (jsquyres) via users
On Nov 12, 2019, at 9:17 AM, Ralph Castain via users mailto:users@lists.open-mpi.org>> wrote: The man page is simply out of date - see https://github.com/open-mpi/ompi/issues/7095 for further thinking And https://github.com/open-mpi/ompi/issues/7133 for what might happen going forward. -- Jef

[OMPI users] qelr_alloc_context: Failed to allocate context for device.

2019-11-12 Thread Matteo Guglielmi via users
I'm trying to get openmpi over RoCE working with this setup: card: https://www.gigabyte.com/Accessory/CLNOQ42-rev-10#ov OS: CentOS 7.7 modinfo qede filename: /lib/modules/3.10.0-1062.4.1.el7.x86_64/kernel/drivers/net/ethernet/qlogic/qede/qede.ko.xz version:8.37.0.20 license: