[OMPI users] Help deciphering error message

2024-03-07 Thread Jeffrey Layton via users
Good afternoon, I'm getting an error message I'm not sure how to use to debug an issue. I'll try to give you all of the pertinent about the setup, but I didn't build the system nor install the software. It's an NVIDIA SuperPod system with Base Command Manager 10.0. I'm building IOR but I'm really

[OMPI users] Help running OpenMPI in prrte

2023-01-11 Thread Jonathon Anderson via users
I am getting an error and crash when trying to use PRRTE to run a containerized instance of OSU Micro-Benchmarks built against OpenMPI. The same container works using PMI2 support in Slurm. Full details are available at https://github.com/openpmix/prrte/issues/1635, but they suggested I reach out t

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-26 Thread Angel de Vicente via users
Hello, thanks for your help and suggestions. At the end it was no issue with OpenMPI or with any other system stuff, but rather a single line in our code. I thought I was doing the tests with the -fbounds-check option, but it turns out I was not, arrrghh!! At some point I was writing outside one

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-23 Thread Protze, Joachim via users
achim From: users on behalf of Angel de Vicente via users Sent: Friday, April 22, 2022 10:31:38 PM To: Keller, Rainer Cc: Angel de Vicente ; Open MPI Users Subject: Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bi

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-22 Thread George Bosilca via users
gt; Jeff Squyres > jsquy...@cisco.com > > > From: users on behalf of Cici Feng via > users > Sent: Friday, April 22, 2022 5:30 AM > To: Open MPI Users > Cc: Cici Feng > Subject: Re: [OMPI users] help with M1 chip macOS openMPI install

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Angel de Vicente via users
Hello, "Keller, Rainer" writes: > You’re using MPI_Probe() with Threads; that’s not safe. > Please consider using MPI_Mprobe() together with MPI_Mrecv(). many thanks for the suggestion. I will try with the M variants, though I was under the impression that mpi_probe() was OK as far as one made

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Angel de Vicente via users
Hello Jeff, "Jeff Squyres (jsquyres)" writes: > With THREAD_FUNNELED, it means that there can only be one thread in > MPI at a time -- and it needs to be the same thread as the one that > called MPI_INIT_THREAD. > > Is that the case in your app? the master rank (i.e. 0) never creates threads,

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Keller, Rainer via users
Dear Angel, You’re using MPI_Probe() with Threads; that’s not safe. Please consider using MPI_Mprobe() together with MPI_Mrecv(). However, you mention running with only one Thread — setting OMP_NUM_THREADS=1, assuming you didn’t set using omp_set_num_threads() again, or use num_threads() clause…

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-22 Thread Jeff Squyres (jsquyres) via users
22 5:30 AM To: Open MPI Users Cc: Cici Feng Subject: Re: [OMPI users] help with M1 chip macOS openMPI installation Hi George, Thanks so much with the tips and I have installed Rosetta in order for my computer to run the Intel software. However, the same error appears as I tried to make the fi

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Jeff Squyres (jsquyres) via users
o.com From: users on behalf of Angel de Vicente via users Sent: Friday, April 22, 2022 10:54 AM To: Gilles Gouaillardet via users Cc: Angel de Vicente Subject: Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none Thanks Gilles, G

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Angel de Vicente via users
Thanks Gilles, Gilles Gouaillardet via users writes: > You can first double check you > MPI_Init_thread(..., MPI_THREAD_MULTIPLE, ...) my code uses "mpi_thread_funneled" and OpenMPI was compiled with MPI_THREAD_MULTIPLE support: , | ompi_info | grep -i thread | Thread support: p

Re: [OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Gilles Gouaillardet via users
You can first double check you MPI_Init_thread(..., MPI_THREAD_MULTIPLE, ...) And the provided level is MPI_THREAD_MULTIPLE as you requested. Cheers, Gilles On Fri, Apr 22, 2022, 21:45 Angel de Vicente via users < users@lists.open-mpi.org> wrote: > Hello, > > I'm running out of ideas, and wonde

[OMPI users] Help diagnosing MPI+OpenMP application segmentation fault only when run with --bind-to none

2022-04-22 Thread Angel de Vicente via users
Hello, I'm running out of ideas, and wonder if someone here could have some tips on how to debug a segmentation fault I'm having with my application [due to the nature of the problem I'm wondering if the problem is with OpenMPI itself rather than my app, though at this point I'm not leaning strong

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-22 Thread Cici Feng via users
org> wrote: > >> A little more color on Gilles' answer: I believe that we had some Open >> MPI community members work on adding M1 support to Open MPI, but Gilles is >> absolutely correct: the underlying compiler has to support the M1, or you >> won't get an

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread George Bosilca via users
2 6:11 AM > To: Open MPI Users > Cc: Cici Feng > Subject: Re: [OMPI users] help with M1 chip macOS openMPI installation > > Gilles, > > Thank you so much for the quick response! > openMPI installed by brew is compiled on gcc and gfortran using the > original compil

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread Jeff Squyres (jsquyres) via users
@cisco.com From: users on behalf of Cici Feng via users Sent: Thursday, April 21, 2022 6:11 AM To: Open MPI Users Cc: Cici Feng Subject: Re: [OMPI users] help with M1 chip macOS openMPI installation Gilles, Thank you so much for the quick response! openMPI installed

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread Cici Feng via users
Gilles, Thank you so much for the quick response! openMPI installed by brew is compiled on gcc and gfortran using the original compilers by Apple. Now I haven't figured out how to use this gcc openMPI for the inversion software :( Given by your answer, I think I'll pause for now with the M1-intel

Re: [OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread Gilles Gouaillardet via users
Cici, I do not think the Intel C compiler is able to generate native code for the M1 (aarch64). The best case scenario is it would generate code for x86_64 and then Rosetta would be used to translate it to aarch64 code, and this is a very downgraded solution. So if you really want to stick to the

[OMPI users] help with M1 chip macOS openMPI installation

2022-04-21 Thread Cici Feng via users
Hi there, I am trying to install an electromagnetic inversion software (MARE2DEM) of which the intel C compilers and open-MPI are considered as the prerequisite. However, since I am completely new to computer science and coding, together with some of the technical issues of the computer I am build

Re: [OMPI users] [Help] Must orted exit after all spawned proecesses exit

2021-05-19 Thread Ralph Castain via users
To answer your specific questions: The backend daemons (orted) will not exit until all locally spawned procs exit. This is not configurable - for one thing, OMPI procs will suicide if they see the daemon depart, so it makes no sense to have the daemon fail if a proc terminates. The logic behind

Re: [OMPI users] [Help] Must orted exit after all spawned proecesses exit

2021-05-17 Thread Jeff Squyres (jsquyres) via users
FYI: general Open MPI questions are better sent to the user's mailing list. Up through the v4.1.x series, the "orted" is a general helper process that Open MPI uses on the back-end. It will not quit until all of its children have died. Open MPI's run time is designed with the intent that some

[OMPI users] Help with MPI and macOS Firewall

2021-03-18 Thread Matt Thompson via users
All, This isn't specifically an Open MPI issue, but as that is the MPI stack I use on my laptop, I'm hoping someone here might have a possible solution. (I am pretty sure something like MPICH would trigger this as well.) Namely, my employer recently did something somewhere so that now *any* MPI a

Re: [OMPI users] help

2020-12-14 Thread Lesiano 16 via users
Thanks for the answer On Mon, Dec 14, 2020 at 4:20 PM Jeff Squyres (jsquyres) wrote: > On Dec 12, 2020, at 4:58 AM, Lesiano 16 via users < > users@lists.open-mpi.org> wrote: > > > > My question is, can I assume that when skipping the beginning of the > file that MPI will fill up with zeros? Or

Re: [OMPI users] help

2020-12-14 Thread Jeff Squyres (jsquyres) via users
On Dec 12, 2020, at 4:58 AM, Lesiano 16 via users wrote: > > My question is, can I assume that when skipping the beginning of the file > that MPI will fill up with zeros? Or is it implementation dependent? > > I have read the standard, but I could not found anything meaningful expected > for:

[OMPI users] help

2020-12-12 Thread Lesiano 16 via users
Hi, for the following code: #include int main() { MPI_Init(NULL, NULL); int world_rank, world_size; MPI_Comm_rank(MPI_COMM_WORLD, &world_rank); MPI_Comm_size(MPI_COMM_WORLD, &world_size); MPI_Offset offset; MPI_Status status; MPI_File fh; MPI_File_open(MPI_COMM

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-25 Thread Adam Simpson via users
Best, Adam From: Matt Thompson Sent: Tuesday, February 25, 2020 5:54 AM To: Adam Simpson Cc: Open MPI Users Subject: Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI External email: Use caution opening links or attachments Adam, A couple questions. First,

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-25 Thread Matt Thompson via users
-- > *From:* users on behalf of Matt > Thompson via users > *Sent:* Monday, February 24, 2020 5:15 PM > *To:* Open MPI Users > *Cc:* Matt Thompson > *Subject:* Re: [OMPI users] Help with One-Sided Communication: Works in > Intel MPI, Fails in O

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Adam Simpson via users
le it with "sysctl -w kernel.yama.ptrace_scope=0". Adam From: users on behalf of Matt Thompson via users Sent: Monday, February 24, 2020 5:15 PM To: Open MPI Users Cc: Matt Thompson Subject: Re: [OMPI users] Help with One-Sided Communication: Works i

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Matt Thompson via users
> “Read -1, expected 48, errno = 1” > > does not stem from MPI I/O, at least not from the ompio library. What file > system did you use for these tests? > > > > Thanks > > Edgar > > > > *From:* users *On Behalf Of *Matt > Thompson via users > *Sent:* Mo

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Matt Thompson via users
On Mon, Feb 24, 2020 at 4:57 PM Gabriel, Edgar wrote: > I am not an expert for the one-sided code in Open MPI, I wanted to comment > briefly on the potential MPI -IO related item. As far as I can see, the > error message > > > > “Read -1, expected 48, errno = 1” > > does not stem from MPI I/O, at

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Nathan Hjelm via users
s not stem from MPI I/O, at least not from the ompio library. What file > system did you use for these tests? > > Thanks > Edgar > > From: users On Behalf Of Matt Thompson via > users > Sent: Monday, February 24, 2020 1:20 PM > To: users@lists.open-mpi.org >

Re: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Gabriel, Edgar via users
these tests? Thanks Edgar From: users On Behalf Of Matt Thompson via users Sent: Monday, February 24, 2020 1:20 PM To: users@lists.open-mpi.org Cc: Matt Thompson Subject: [OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI All, My guess is this is a "I

[OMPI users] Help with One-Sided Communication: Works in Intel MPI, Fails in Open MPI

2020-02-24 Thread Matt Thompson via users
All, My guess is this is a "I built Open MPI incorrectly" sort of issue, but I'm not sure how to fix it. Namely, I'm currently trying to get an MPI project's CI working on CircleCI using Open MPI to run some unit tests (on a single node, so need some oversubscribe). I can build everything just fin

Re: [OMPI users] HELP: openmpi is not using the specified infiniband interface !!

2020-01-14 Thread George Bosilca via users
According to the error message you are using MPICH not Open MPI. George. On Tue, Jan 14, 2020 at 5:53 PM SOPORTE MODEMAT via users < users@lists.open-mpi.org> wrote: > Hello everyone. > > > > I would like somebody help me to figure out how can I make that the > openmpi use the infiniband inte

Re: [OMPI users] HELP: openmpi is not using the specified infiniband interface !!

2020-01-14 Thread Gilles Gouaillardet via users
Soporte, The error message is from MPICH! If you intend to use Open MPI, fix your environment first Cheers, Gilles Sent from my iPod > On Jan 15, 2020, at 7:53, SOPORTE MODEMAT via users > wrote: > > Hello everyone. > > I would like somebody help me to figure out how can I make that the

[OMPI users] HELP: openmpi is not using the specified infiniband interface !!

2020-01-14 Thread SOPORTE MODEMAT via users
Hello everyone. I would like somebody help me to figure out how can I make that the openmpi use the infiniband interface that I specify with the command: /opt/mpi/openmpi_intel-2.1.1/bin/mpirun --mca btl self,openib python mpi_hola.py But when I print the hostname and the ip address of the int

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-23 Thread Matt Thompson
gt; > > _MAC > > > > *From:* users [mailto:users-boun...@lists.open-mpi.org] *On Behalf Of *Matt > Thompson > *Sent:* Tuesday, January 22, 2019 6:04 AM > *To:* Open MPI Users > *Subject:* Re: [OMPI users] Help Getting Started with Open MPI and PMIx > and UCX > >

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-22 Thread Cabral, Matias A
: Open MPI Users Subject: Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX Well, By turning off UCX compilation per Howard, things get a bit better in that something happens! It's not a good something, as it seems to die with an infiniband error. As this is an Omnipath s

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-22 Thread Matt Thompson
Well, By turning off UCX compilation per Howard, things get a bit better in that something happens! It's not a good something, as it seems to die with an infiniband error. As this is an Omnipath system, is OpenMPI perhaps seeing libverbs somewhere and compiling it in? To wit: (1006)(master) $ mpi

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-20 Thread Howard Pritchard
Hi Matt Definitely do not include the ucx option for an omnipath cluster. Actually if you accidentally installed ucx in it’s default location use on the system Switch to this config option —with-ucx=no Otherwise you will hit https://github.com/openucx/ucx/issues/750 Howard Gilles Gouaillard

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-19 Thread Gilles Gouaillardet
Matt, There are two ways of using PMIx - if you use mpirun, then the MPI app (e.g. the PMIx client) will talk to mpirun and orted daemons (e.g. the PMIx server) - if you use SLURM srun, then the MPI app will directly talk to the PMIx server provided by SLURM. (note you might have to srun --mpi=pm

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-18 Thread Cabral, Matias A
(vader?). So, if the job is not starting this seems to be a runtime issue rather than transport…. Pmix? slurm? Thanks _MAC From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Matt Thompson Sent: Friday, January 18, 2019 10:27 AM To: Open MPI Users Subject: Re: [OMPI users] Help

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-18 Thread Matt Thompson
On Fri, Jan 18, 2019 at 1:13 PM Jeff Squyres (jsquyres) via users < users@lists.open-mpi.org> wrote: > On Jan 18, 2019, at 12:43 PM, Matt Thompson wrote: > > > > With some help, I managed to build an Open MPI 4.0.0 with: > > We can discuss each of these params to let you know what they are. > > >

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-18 Thread Jeff Squyres (jsquyres) via users
On Jan 18, 2019, at 12:43 PM, Matt Thompson wrote: > > With some help, I managed to build an Open MPI 4.0.0 with: We can discuss each of these params to let you know what they are. > ./configure --disable-wrapper-rpath --disable-wrapper-runpath Did you have a reason for disabling these? They'

Re: [OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-18 Thread Matt Thompson
All, With some help, I managed to build an Open MPI 4.0.0 with: ./configure --disable-wrapper-rpath --disable-wrapper-runpath --with-psm2 --with-slurm --enable-mpi1-compatibility --with-ucx --with-pmix=/usr/nlocal/pmix/2.1 --with-libevent=/usr CC=icc CXX=icpc FC=ifort The MPI 1 is because I need

[OMPI users] Help Getting Started with Open MPI and PMIx and UCX

2019-01-17 Thread Matt Thompson
Dear Open MPI Gurus, A cluster I use recently updated their SLURM to have support for UCX and PMIx. These are names I've seen and heard often at SC BoFs and posters, but now is my first time to play with them. So, my first question is how exactly should I build Open MPI to try these features out.

Re: [OMPI users] help installing openmpi 3.0 in ubuntu 16.04

2018-03-16 Thread Jeff Squyres (jsquyres)
(Sending this to the users list, not to just the owner of the users list) It looks like you might have installed Open MPI correctly. But you have to give some command line options to mpirun to tell it what to do -- you're basically getting an error saying "you didn't tell me what to do, so I di

Re: [OMPI users] Help debugging invalid read

2018-02-19 Thread Florian Lindner
Ok, I think I have found the problem During std::vector::push_back or emplace_back a realloc happens and thus memory locations that I gave to MPI_Isend become invalid. My loop now reads: std::vector eventSendBuf(eventsSize); // Buffer to hold the MPI_EventData object for (int i = 0; i < e

[OMPI users] Help debugging invalid read

2018-02-19 Thread Florian Lindner
Hello, I am having problems understanding an error valgrind gives me. I tried to bog down the program as much as possible. The original program as well as the test example both work fine, but when I link the created library to another application I get segfaults. I think that this piece of code

Re: [OMPI users] Help with binding processes correctly in Hybrid code (Openmpi +openmp)

2017-11-14 Thread Gilles Gouaillardet
Hi, per https://www2.cisl.ucar.edu/resources/computational-systems/cheyenne/running-jobs/pbs-pro-job-script-examples, you can try #PBS -l select=2:ncpus=16:mpiprocs=2:ompthreads=8 Cheers, Gilles On Tue, Nov 14, 2017 at 4:32 PM, Anil K. Dasanna wrote: > Hello all, > > I am relatively new to

[OMPI users] Help with binding processes correctly in Hybrid code (Openmpi +openmp)

2017-11-14 Thread Anil K. Dasanna
Hello all, I am relatively new to mpi computing. I am doing particle simulations. So far, I only used pure mpi and I never had a problem. But for my system, its the best if one uses hybrid programming. But I always fail to correctly bind all processes and receive binding errors from cluster. Could

Re: [OMPI users] Help

2017-04-27 Thread Gus Correa
On 04/27/2017 06:21 AM, Corina Jeni Tudorache wrote: Hello, I am trying to install Open MPI on Centos and I got stuck. I have installed an GNU compiler and after that I run the command: _yum install openmpi-devel.x86_64. _But when I run command mpi selector –- list I receive this error “mpi: c

Re: [OMPI users] Help

2017-04-27 Thread gilles
PM To: Open MPI Users Subject: Re: [OMPI users] Help by the way, are you running CentOS 5 ? it seems mpi-selector is no more available from CentOS 6 Cheers, Gilles - Original Message - Yes, I write it wrong the previous e-mail

Re: [OMPI users] Help

2017-04-27 Thread Corina Jeni Tudorache
Users Subject: Re: [OMPI users] Help by the way, are you running CentOS 5 ? it seems mpi-selector is no more available from CentOS 6 Cheers, Gilles - Original Message - Yes, I write it wrong the previous e-mail, but actually it does not work. Gives the error message: mpi: command not

Re: [OMPI users] Help

2017-04-27 Thread Corina Jeni Tudorache
It says mpi-selector is not installed. And yes for the mpi-selector command, the error message is command not found. C. From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of gil...@rist.or.jp Sent: Thursday, April 27, 2017 12:32 PM To: Open MPI Users Subject: Re: [OMPI users

Re: [OMPI users] Help

2017-04-27 Thread gilles
From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of gil...@rist.or.jp Sent: Thursday, April 27, 2017 11:34 AM To: Open MPI Users Subject: Re: [OMPI users] Help Hi, that looks like a typo, the command is mpi-selector --list

Re: [OMPI users] Help

2017-04-27 Thread gilles
Subject: Re: [OMPI users] Help Hi, that looks like a typo, the command is mpi-selector --list Cheers, Gilles - Original Message - Hello, I am trying to install Open MPI on Centos and I got

Re: [OMPI users] Help

2017-04-27 Thread Corina Jeni Tudorache
users] Help Hi, that looks like a typo, the command is mpi-selector --list Cheers, Gilles - Original Message - Hello, I am trying to install Open MPI on Centos and I got stuck. I have installed an GNU compiler and after that I run the command: yum install openmpi-devel.x86_64

Re: [OMPI users] Help

2017-04-27 Thread gilles
Hi, that looks like a typo, the command is mpi-selector --list Cheers, Gilles - Original Message - Hello, I am trying to install Open MPI on Centos and I got stuck. I have installed an GNU compiler and after that I run the command: yum install openmpi-devel.x86_64.

[OMPI users] Help

2017-04-27 Thread Corina Jeni Tudorache
Hello, I am trying to install Open MPI on Centos and I got stuck. I have installed an GNU compiler and after that I run the command: yum install openmpi-devel.x86_64. But when I run command mpi selector -- list I receive this error "mpi: command not found" I am following the instruction from

Re: [OMPI users] Help with Open MPI 2.1.0 and PGI 16.10: Configure and C++

2017-03-24 Thread Matt Thompson
Gilles, The library I have having issues linking is ESMF and it is a C++/Fortran application. From http://www.earthsystemmodeling.org/esmf_releases/non_public/ESMF_7_0_0/ESMF_usrdoc/node9.html#SECTION00092000 : The following compilers and utilities *are required* for compiling, linkin

Re: [OMPI users] Help with Open MPI 2.1.0 and PGI 16.10: Configure and C++

2017-03-23 Thread Gilles Gouaillardet
Matt, a C++ compiler is required to configure Open MPI. That being said, C++ compiler is only used if you build the C++ bindings (That were removed from MPI-3) And unless you plan to use the mpic++ wrapper (with or without the C++ bindings), a valid C++ compiler is not required at all. /* configur

Re: [OMPI users] Help with Open MPI 2.1.0 and PGI 16.10: Configure and C++

2017-03-23 Thread Reuti
Hi, Am 22.03.2017 um 20:12 schrieb Matt Thompson: > […] > > Ah. PGI 16.9+ now use pgc++ to do C++ compiling, not pgcpp. So, I hacked > configure so that references to pgCC (nonexistent on macOS) are gone and all > pgcpp became pgc++, but: This is not unique to macOS. pgCC used STLPort STL and

[OMPI users] Help with Open MPI 2.1.0 and PGI 16.10: Configure and C++

2017-03-22 Thread Matt Thompson
All, I'm hoping one of you knows what I might be doing wrong here. I'm trying to use Open MPI 2.1.0 for PGI 16.10 (Community Edition) on macOS. Now, I built it a la: http://www.pgroup.com/userforum/viewtopic.php?p=21105#21105 and found that it built, but the resulting mpifort, etc were just not

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-10-21 Thread Gilles Gouaillardet
Matus, This has very likely been fixed by https://github.com/open-mpi/ompi/pull/2259 Can you download the patch at https://github.com/open-mpi/ompi/pull/2259.patch and apply it manually on v1.10 ? Cheers, Gilles On Monday, August 29, 2016, M. D. wrote: > > Hi, > > I would like to ask - are

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread Gilles Gouaillardet
my bad for the confusion, I misread you and miswrote my reply. I will investigate this again. strictly speaking, the clients can only start after the server first write the port info to a file. if you start the client right after the server start, they might use incorrect/outdated info and cause

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread M. D.
Yes I understand it, but I think, this is exactly that situation you are talking about. In my opinion, the test is doing exactly what you said - when a new player is willing to join, other players must invoke MPI_Comm_accept(). All *other* players must invoke MPI_Comm_accept(). Only the last client

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread Gilles Gouaillardet
here is what the client is doing printf("CLIENT: after merging, new comm: size=%d rank=%d\n", size, rank) ; for (i = rank ; i < num_clients ; i++) { /* client performs a collective accept */ CHK(MPI_Comm_accept(server_port_name, MPI_INFO_NULL, 0, intracomm, &intercomm)

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread M. D.
2016-07-19 10:06 GMT+02:00 Gilles Gouaillardet : > MPI_Comm_accept must be called by all the tasks of the local communicator. > Yes, that's how I understand it. In the source code of the test, all the tasks call MPI_Comm_accept - server and also relevant clients. > so if you > > 1) mpirun -np 1

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread Gilles Gouaillardet
MPI_Comm_accept must be called by all the tasks of the local communicator. so if you 1) mpirun -np 1 ./singleton_client_server 2 1 2) mpirun -np 1 ./singleton_client_server 2 0 3) mpirun -np 1 ./singleton_client_server 2 0 then 3) starts after 2) has exited, so on 1), intracomm is made of 1)

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread M. D.
Hi, thank you for your interest in this topic. So, I normally run the test as follows: Firstly, I run "server" (second parameter is 1): *mpirun -np 1 ./singleton_client_server number_of_clients 1* Secondly, I run corresponding number of "clients" via following command: *mpirun -np 1 ./singleton_c

Re: [OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-19 Thread Gilles Gouaillardet
How do you run the test ? you should have the same number of clients in each mpirun instance, the following simple shell starts the test as i think it is supposed to note the test itself is arguable since MPI_Comm_disconnect() is never invoked (and you will observe some related dpm_base_dis

[OMPI users] Help - Client / server - app hangs in connect/accept by the second or next client that wants to connect to server

2016-07-15 Thread M. D.
Hello, I have a problem with basic client - server application. I tried to run C program from this website https://github.com/hpc/cce-mpi-openmpi-1.7.1/blob/master/orte/test/mpi/singleton_client_server.c I saw this program mentioned in many discussions in your website, so I expected that it should

Re: [OMPI users] Help on Windows

2016-02-23 Thread Walt Brainerd
Thank you, Gilles! It's amazing to get such help. It seems to work when I unplugged the ethernet and have the wireless on, but I will check it out further (including the firewall situation) to pin it down. time mpirun -np 4 ./a Hello from 1 out of 4 images. Hello from

Re: [OMPI users] Help on Windows

2016-02-23 Thread Gilles Gouaillardet
Walt, generally speaking, that kind of things happen when you are using a wireless network and/or a firewall. so i recommend you first try to disconnect all your networks and see how things get improved Cheers, Gilles On 2/24/2016 5:08 AM, Walt Brainerd wrote: I am running up-to-date cyg

[OMPI users] Help on Windows

2016-02-23 Thread Walt Brainerd
I am running up-to-date cygwin on W10 on a 4x i5 processor, including gcc 5.3. I built libcaf_mpi.a provided by OpenCoarrays. $ cat hello.f90 program hello implicit none print *, "Hello from", this_image(), & "out of", num_images(), "images." end program hello I compiled the

Re: [OMPI users] Help with OpenMPI and Univa Grid Engine

2016-02-09 Thread Rahul Pisharody
Hello Ralph, Dave, Thank you for your suggestions. Let me check on the nfs mounts. The problem is I am not the grid administrator. I'm working with the grid administrator to get it resolved. If I had my way, I would be probably using Sun Grid. Thank you Dave for pointing out something that I had

Re: [OMPI users] Help with OpenMPI and Univa Grid Engine

2016-02-09 Thread Dave Love
Rahul Pisharody writes: > Hello all, > > I'm trying to get a simple program (print the hostname of the executing > machine) compiled with openmpi run across multiple machines on Univa Grid > Engine. > > This particular configuration has many of the ports blocked. My run command > has the mca opti

Re: [OMPI users] Help with OpenMPI and Univa Grid Engine

2016-02-08 Thread Ralph Castain
Is your OMPI installed on an NFS partition? If so, is it in the same mount point on all nodes? Most likely problem is that the required libraries were not found on the remote node > On Feb 8, 2016, at 10:45 AM, Rahul Pisharody wrote: > > Hello all, > > I'm trying to get a simple program (pr

[OMPI users] Help with OpenMPI and Univa Grid Engine

2016-02-08 Thread Rahul Pisharody
Hello all, I'm trying to get a simple program (print the hostname of the executing machine) compiled with openmpi run across multiple machines on Univa Grid Engine. This particular configuration has many of the ports blocked. My run command has the mca options necessary to limit the ports to the

Re: [OMPI users] Help with Binding in 1.8.8: Use only second socket

2015-12-21 Thread Saliya Ekanayake
I tried the following with OpenMPI 1.8.1 and 1.10.1. The both worked. In my case a node has 2 sockets like yours, but each socket has 12 cores and lstopo showed core numbers for the second socket are from 12 to 23. * mpirun --report-bindings --bind-to core --cpu-set 12,13,14,15,16,17,18,19 -np 8 j

Re: [OMPI users] Help with Binding in 1.8.8: Use only second socket

2015-12-21 Thread Matt Thompson
Ralph, Huh. That isn't in the Open MPI 1.8.8 mpirun man page. It is in Open MPI 1.10, so I'm guessing someone noticed it wasn't there. Explains why I didn't try it out. I'm assuming this option is respected on all nodes? Note: a SmarterManThanI™ here at Goddard thought up this: #!/bin/bash rank=

Re: [OMPI users] Help with Binding in 1.8.8: Use only second socket

2015-12-21 Thread Ralph Castain
Try adding —cpu-set a,b,c,… where the a,b,c… are the core id’s of your second socket. I’m working on a cleaner option as this has come up before. > On Dec 21, 2015, at 5:29 AM, Matt Thompson > wrote: > > Dear Open MPI Gurus, > > I'm currently trying to do something

[OMPI users] Help with Binding in 1.8.8: Use only second socket

2015-12-21 Thread Matt Thompson
Dear Open MPI Gurus, I'm currently trying to do something with Open MPI 1.8.8 that I'm pretty sure is possible, but I'm just not smart enough to figure out. Namely, I'm seeing some odd GPU timings and I think it's because I was dumb and assumed the GPU was on the PCI bus next to Socket #0 as some

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-30 Thread Jeff Squyres (jsquyres)
On Nov 24, 2015, at 9:31 AM, Dave Love wrote: > >> btw, we already use the force, thanks to the ob1 pml and the yoda spml > > I think that's assuming familiarity with something which leaves out some > people... FWIW, I agree: we use unhelpful names for components in Open MPI. What Gilles is s

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-24 Thread Dave Love
Gilles Gouaillardet writes: > Currently, ompi create a file in the temporary directory and then mmap it. > an obvious requirement is the temporary directory must have enough free > space for that file. > (this might be an issue on some disk less nodes) > > > a simple alternative could be to try /

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-20 Thread Gilles Gouaillardet
Currently, ompi create a file in the temporary directory and then mmap it. an obvious requirement is the temporary directory must have enough free space for that file. (this might be an issue on some disk less nodes) a simple alternative could be to try /tmp, and if there is not enough space, try

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-20 Thread Dave Love
Jeff Hammond writes: >> Doesn't mpich have the option to use sysv memory? You may want to try that >> >> > MPICH? Look, I may have earned my way onto Santa's naughty list more than > a few times, but at least I have the decency not to post MPICH questions to > the Open-MPI list ;-) > > If there

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-20 Thread Dave Love
[There must be someone better to answer this, but since I've seen it:] Jeff Hammond writes: > I have no idea what this is trying to tell me. Help? > > jhammond@nid00024:~/MPI/qoit/collectives> mpirun -n 2 ./driver.x 64 > [nid00024:00482] [[46168,0],0] ORTE_ERROR_LOG: Not found in file > ../../.

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Jeff Hammond
On Thu, Nov 19, 2015 at 4:11 PM, Howard Pritchard wrote: > Hi Jeff H. > > Why don't you just try configuring with > > ./configure --prefix=my_favorite_install_dir > --with-libfabric=install_dir_for_libfabric > make -j 8 install > > and see what happens? > > That was the first thing I tried. Howe

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Howard Pritchard
Hi Jeff, I finally got an allocation on cori - its one busy machine. Anyway, using the ompi i'd built on edison with the above recommended configure options I was able to run using either srun or mpirun on cori provided that in the later case I used mpirun -np X -N Y --mca plm slurm ./my_favorit

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Howard Pritchard
Hi Jeff H. Why don't you just try configuring with ./configure --prefix=my_favorite_install_dir --with-libfabric=install_dir_for_libfabric make -j 8 install and see what happens? Make sure before you configure that you have PrgEnv-gnu or PrgEnv-intel module loaded. Those were the configure/com

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Jeff Hammond
> > > How did you configure for Cori? You need to be using the slurm plm > component for that system. I know this sounds like gibberish. > > ../configure --with-libfabric=$HOME/OFI/install-ofi-gcc-gni-cori \ --enable-mca-static=mtl-ofi \ --enable-mca-no-build=btl-openib,

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Howard
Hi Jeff How did you configure for Cori? You need to be using the slurm plm component for that system. I know this sounds like gibberish. There should be a with-slurm configure option to pick up this component. Doesn't mpich have the option to use sysv memory? You may want to try that Oh

Re: [OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Martin Siegert
Hi Jeff, On Thu 19.11.2015 09:44:20 Jeff Hammond wrote: > I have no idea what this is trying to tell me. Help? > > jhammond@nid00024:~/MPI/qoit/collectives> mpirun -n 2 ./driver.x 64 > [nid00024:00482] [[46168,0],0] ORTE_ERROR_LOG: Not found in file > ../../../../../orte/mca/plm/alps/plm_alps_mo

[OMPI users] help understand unhelpful ORTE error message

2015-11-19 Thread Jeff Hammond
I have no idea what this is trying to tell me. Help? jhammond@nid00024:~/MPI/qoit/collectives> mpirun -n 2 ./driver.x 64 [nid00024:00482] [[46168,0],0] ORTE_ERROR_LOG: Not found in file ../../../../../orte/mca/plm/alps/plm_alps_module.c at line 418 I can run the same job with srun without incide

Re: [OMPI users] Help with Specific Binding

2015-09-13 Thread Ralph Castain
Checkout the man page “OMPI_Affinity_str” for an MPI extension that might help > On Sep 13, 2015, at 7:28 AM, Saliya Ekanayake wrote: > > Thank you, I'll try this. Also, is there a way to know which core a process > is bound to within the program other than executing something like taskset >

Re: [OMPI users] Help with Specific Binding

2015-09-13 Thread Gilles Gouaillardet
on linux, you can look at /proc/self/status and search allowed_cpus_list or you can use the sched_getaffinity system call note that in some (hopefully rare)cases, this will return different results than hwloc On Sunday, September 13, 2015, Saliya Ekanayake wrote: > Thank you, I'll try this. Als

Re: [OMPI users] Help with Specific Binding

2015-09-13 Thread Saliya Ekanayake
Thank you, I'll try this. Also, is there a way to know which core a process is bound to within the program other than executing something like taskset from program? On Sun, Sep 13, 2015 at 10:05 AM, Ralph Castain wrote: > Actually, the error was correct - it was me that was incorrect. The > cor

Re: [OMPI users] Help with Specific Binding

2015-09-13 Thread Ralph Castain
Actually, the error was correct - it was me that was incorrect. The correct set of options would be: —map-by ppr:12_node —bind-to core —cpu-set=0,2,4,… Sorry about the confusion > On Sep 13, 2015, at 2:43 AM, Ralph Castain wrote: > > The rankfile will certainly do it, but that error is a bug

  1   2   3   4   >