n, but it seems to be
> something triggered by your specific setup.
>
>
> On Jun 12, 2014, at 8:48 AM, Dan Dietz wrote:
>
>> Unfortunately, the nightly tarball appears to be crashing in a similar
>> fashion. :-( I used the latest snapshot 1.8.2a1r31981.
>>
>&g
Dan
>>>
>>> On Wed, Jun 11, 2014 at 9:37 AM, Ralph Castain wrote:
>>>> Afraid I'm a little confused now - are you saying it works fine under
>>>> Torque, but segfaults under rsh? Could you please clarify your current
>>>> situation?
>>&g
rsh? Could you please clarify your current situation?
>
>
> On Jun 11, 2014, at 6:27 AM, Dan Dietz wrote:
>
>> It looks like it is still segfaulting with the rsh launcher:
>>
>> ddietz@conte-a084:/scratch/conte/d/ddietz/hello$ mpirun -mca plm rsh
>> -np 4 -m
ill allow it, this will let me see if the problem is somewhere in the Torque
> launcher or elsewhere in OMPI.
>
> Thanks
> Ralph
>
> On Jun 6, 2014, at 12:48 PM, Dan Dietz wrote:
>
>> No problem -
>>
>> These are model name : Intel(R) Xeon(R) CPU E5-2670 0 @
Ack - that was my fault. Too early on a monday morning. This seems to
work perfectly when I correctly submit a job! Thanks!
Dan
On Mon, Jun 9, 2014 at 9:34 AM, Dan Dietz wrote:
> Yes, you're exactly right - this system has 2 Phi cards per node. I
> believe the "PCI 8086"
ere in the Torque
> launcher or elsewhere in OMPI.
>
> Thanks
> Ralph
>
> On Jun 6, 2014, at 12:48 PM, Dan Dietz wrote:
>
>> No problem -
>>
>> These are model name : Intel(R) Xeon(R) CPU E5-2670 0 @ 2.60GHz chips.
>> 2 per node, 8 cores each. No thre
ntain of output.
>
> I'll look into the segfault - hard to understand offhand, but could be an
> uninitialized variable. If you have a chance, could you rerun that test with
> "-mca plm_base_verbose 10" on the cmd line?
>
> Thanks again
> Ralph
>
> On Jun 6
main.c:13
ddietz@conte-a009:/scratch/conte/d/ddietz/hello$ cat nodes
conte-a009
conte-a009
conte-a055
conte-a055
ddietz@conte-a009:/scratch/conte/d/ddietz/hello$ uname -r
2.6.32-358.14.1.el6.x86_64
On Thu, Jun 5, 2014 at 7:54 PM, Ralph Castain wrote:
>
> On Jun 5, 2014, at 2:13 PM, Dan Diet
ile simply contains the first two lines of my original
$PBS_NODEFILE provided by Torque. See above why I modified. Works fine
if use the full file.
Thanks in advance for any pointers you all may have!
Dan
--
Dan Dietz
Scientific Applications Analyst
ITaP Research Computing, Purdue University