Re: [OMPI users] Segfault in mca_odls_default.so with > ~100 process.

2010-02-27 Thread Ralph Castain
I modified the patch (it missed a few places, some minor changes in implementation, etc) and committed it to the developer's trunk. I'll check with the release managers to see if this is something they want in the 1.4 series, or if they would rather defer it to the 1.5 series due out soon. Than

Re: [OMPI users] Segfault in mca_odls_default.so with > ~100 process.

2010-02-27 Thread Oliver Ford
Ralph Castain wrote: Yeah, the system won't like this. Your approach makes it look like you are launching 136 app_contexts. We currently only support up to 128 app_contexts. I don't think anyone anticipated somebody trying to use the system this way. I can expand the number to something larger

Re: [OMPI users] Segfault in mca_odls_default.so with > ~100 process.

2010-02-27 Thread Oliver Ford
Ralph Castain wrote: Yeah, the system won't like this. Your approach makes it look like you are launching 136 app_contexts. We currently only support up to 128 app_contexts. I don't think anyone anticipated somebody trying to use the system this way. I can expand the number to something larger

Re: [OMPI users] Segfault in mca_odls_default.so with > ~100 process.

2010-02-27 Thread Ralph Castain
Yeah, the system won't like this. Your approach makes it look like you are launching 136 app_contexts. We currently only support up to 128 app_contexts. I don't think anyone anticipated somebody trying to use the system this way. I can expand the number to something larger. Will have to see how

[OMPI users] Segfault in mca_odls_default.so with > ~100 process.

2010-02-26 Thread Oliver Ford
I am trying to run an MPI code across 136 processing using an appfile (attached), since every process needs to be run with a host/process dependent parameter. This whole system works wonderfully for up to around 100 processes but usually fails with a segfault, apparently in in mca_odls_default.s