I manage a very heterogeneous cluster. I have nodes of different ages
with different processors, different amounts of RAM, etc. One user is
reporting that on certain nodes, his jobs keep crashing with the errors
below. His application is using OpenMPI 1.10.3, which I know is an
ancient version
On Thu, 2 Jul 2020 10:27:51 +
"CHESTER, DEAN \(PGR\) via users" wrote:
> The permissions were incorrect!
>
> For our old installation of OMPI 1.10.6 it didn’t complain which is
> strange.
Then that did not use PSM and as such had horrible performance :-(
/Peter K
The permissions were incorrect!
For our old installation of OMPI 1.10.6 it didn’t complain which is strange.
Thanks for the help.
Dean
> On 2 Jul 2020, at 11:01, Peter Kjellström wrote:
>
> On Thu, 2 Jul 2020 08:38:51 +
> "CHESTER, DEAN \(PGR\) via users" wrote:
>
>> I tried this ag
On Thu, 2 Jul 2020 08:38:51 +
"CHESTER, DEAN \(PGR\) via users" wrote:
> I tried this again and it resulted in the same error:
> nymph3.29935PSM can't open /dev/ipath for reading and writing (err=23)
> nymph3.29937PSM can't open /dev/ipath for reading and writing (err=23)
> nymph3.29936PSM c
I tried this again and it resulted in the same error:
nymph3.29935PSM can't open /dev/ipath for reading and writing (err=23)
nymph3.29937PSM can't open /dev/ipath for reading and writing (err=23)
nymph3.29936PSM can't open /dev/ipath for reading and writing (err=23)
---