On 10/30/2015 02:26 AM, Jeff Squyres (jsquyres) wrote:
Your Open MPI build looks good -- it seems to have found all the right libnuma
stuff during configured, etc.
The error message you're seeing indicates that the embedded hwloc is telling
Open MPI that it doesn't have binding support, which
Yes, I think that test is valid. 1.10.0 is a good version to test.
Ralph: it all looks good to me, but that message is coming from ORTE. Can you
have a look?
Sent from my phone. No type good.
> On Oct 30, 2015, at 12:33 AM, Fabian Wein wrote:
>
>> On 10/30/2015 02:26 AM, Jeff Squyres (jsquy
Fabian Wein writes:
> Is this a valid test?
>
>
> /opt/openmpi-1.10.0-gcc/bin/mpiexec -n 4 hostname
> leo
> leo
> leo
> leo
So, unless you turned off the default binding -- to socket? check the
mpirun man page -- it worked, but the "numa" level failed. I don't know
if that level has to exist, a
I think Dave has probably hit the problem - that node may well not have a
“numa” object. You also might check that node “leo” has libnuma on it
> On Oct 30, 2015, at 6:48 AM, Dave Love wrote:
>
> Fabian Wein writes:
>
>> Is this a valid test?
>>
>>
>> /opt/openmpi-1.10.0-gcc/bin/mpiexec -n
On 10/30/2015 02:48 PM, Dave Love wrote:
Fabian Wein writes:
Is this a valid test?
/opt/openmpi-1.10.0-gcc/bin/mpiexec -n 4 hostname
leo
leo
leo
leo
So, unless you turned off the default binding -- to socket? check the
mpirun man page -- it worked, but the "numa" level failed. I don't kno
On 10/30/2015 04:09 PM, Ralph Castain wrote:
I think Dave has probably hit the problem - that node may well not have a
“numa” object. You also might check that node “leo” has libnuma on it
I have a shared memory system, I run on the machine I build the stuff.
Fabian
He's building and running on a single server (leo). From the configure output,
all the numa libs and headers are available on this leo server.
> On Oct 30, 2015, at 11:09 AM, Ralph Castain wrote:
>
> I think Dave has probably hit the problem - that node may well not have a
> “numa” object. Y
I honestly have no ideas…best I can see, it looks like hwloc feels that it
cannot perform that operation and returns an error.
> On Oct 30, 2015, at 1:31 PM, Jeff Squyres (jsquyres)
> wrote:
>
> He's building and running on a single server (leo). From the configure
> output, all the numa li
Brice -- I know this started on the hwloc list and then bounced over here, but
we're running out of ideas.
Got any clue what is happening here? From the OMPI config logs that Fabian
sent, it looks like hwloc built with libnuma support properly...?
> On Oct 30, 2015, at 4:34 PM, Ralph Castain
Oh, that's an interesting idea: perhaps the "bind to numa" is failing -- but
perhaps "bind to socket" would work.
Can you try:
/opt/openmpi-1.10.0-gcc/bin/mpiexec -bind-to numa -n 4 hostname
and
/opt/openmpi-1.10.0-gcc/bin/mpiexec -bind-to socket -n 4 hostname
> On Oct 30, 2015, at 12:02 P
10 matches
Mail list logo