[OMPI users] Signal code: Non-existant physical address (2)

2020-07-02 Thread Prentice Bisbal via users
I manage a very heterogeneous cluster. I have nodes of different ages with different processors, different amounts of RAM, etc. One user is reporting that on certain nodes, his jobs keep crashing with the errors below. His application is using OpenMPI 1.10.3, which I know is an ancient version

Re: [OMPI users] Unable to run MPI application

2020-07-02 Thread Peter Kjellström via users
On Thu, 2 Jul 2020 10:27:51 + "CHESTER, DEAN \(PGR\) via users" wrote: > The permissions were incorrect! > > For our old installation of OMPI 1.10.6 it didn’t complain which is > strange. Then that did not use PSM and as such had horrible performance :-( /Peter K

Re: [OMPI users] Unable to run MPI application

2020-07-02 Thread CHESTER, DEAN (PGR) via users
The permissions were incorrect! For our old installation of OMPI 1.10.6 it didn’t complain which is strange. Thanks for the help. Dean > On 2 Jul 2020, at 11:01, Peter Kjellström wrote: > > On Thu, 2 Jul 2020 08:38:51 + > "CHESTER, DEAN \(PGR\) via users" wrote: > >> I tried this ag

Re: [OMPI users] Unable to run MPI application

2020-07-02 Thread Peter Kjellström via users
On Thu, 2 Jul 2020 08:38:51 + "CHESTER, DEAN \(PGR\) via users" wrote: > I tried this again and it resulted in the same error: > nymph3.29935PSM can't open /dev/ipath for reading and writing (err=23) > nymph3.29937PSM can't open /dev/ipath for reading and writing (err=23) > nymph3.29936PSM c

Re: [OMPI users] Unable to run MPI application

2020-07-02 Thread CHESTER, DEAN (PGR) via users
I tried this again and it resulted in the same error: nymph3.29935PSM can't open /dev/ipath for reading and writing (err=23) nymph3.29937PSM can't open /dev/ipath for reading and writing (err=23) nymph3.29936PSM can't open /dev/ipath for reading and writing (err=23) ---