Hi,
Would you please recommend a debugger, which can do debugging for parallel
processes on Open MPI systems ?
I hope that it can be installed without root right because I am not a root user
for ourMPI cluster.
Any help is appreciated.
Thanks
Jack
Oct. 28 2010
Hello,
>
> i am still trying to understand the parallelized version of the heat
> equation 2D solving that we saw at school. In order to explain my problem, i
> need to list the main code :
>
> 9 program heat
> 10
> !
Hi,
I tried to build Open MPI 1.5 on Solaris X86 and x86_64 with Oracle
Studio 12.2. I can compile Open MPI with thread support, but I can
only partly install it because "libtool" will not find "f95" although
it is available. "make check" shows no failures.
tyr openmpi-1.5-SunOS.x86_64.32_cc 188
Hello. I'm try to build orcm , in dependencies it need openmpi trunk
with some options have been enabled.
Install fails with message:
Creating orte-migrate.1 man page...
x86_64-pc-linux-gnu-gcc -DHAVE_CONFIG_H -I. -I../../../opal/include
-I../../../orte/include
-I../../../opal/mca/paffinity/hwloc/
Sorry, but can you give us the config line, the config.log and the
full output of make preferrably with make V=1?
--td
On 10/29/2010 04:30 AM, Siegmar Gross wrote:
Hi,
I tried to build Open MPI 1.5 on Solaris X86 and x86_64 with Oracle
Studio 12.2. I can compile Open MPI with thread support,
I'd suggest looking into TotalView (http://www.totalviewtech.com) and/or DDT
(http://www.allinea.com/). I've used TotalView pretty extensively and found
it to be pretty easy to use. They are both commercial, however, and not
cheap.
As far as I know, there isn't a whole lot of open source support
Couple of things stand out:
1. you definitely don't want to use a copy of the trunk beyond r23924. The
developer's trunk is undergoing some major change and orcm no longer is in-sync
with it. I probably won't update orcm to match until later this year (will
freeze integration at r23924).
2. th
I find that using mpirun to launch multiple instance of a serial
debugger is fairly usable (but not perfect) for jobs with fewer than
about four processes.
A description of how to do this is here:
http://www.open-mpi.org/faq/?category=debugging
The biggest drawbacks to this approach are that
a) se
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hi list,
I'm sorry to bother you with a stupid question.
we intend to have for a short period of time, some nodes with 1.4.3 and
others with 1.4.1 (before upgrading everyone to 1.4.3).
I made various test and found both versions to be running togethe
I'd have to go check 1.4.3 and 1.4.1 to be sure, but I would generally *NOT*
assume that different versions like this are compatible.
Open MPI makes an ABI promise (that started with version 1.3.2) that all the
releases in a given feature series and its corresponding super-stable series
(i.e.,
christophe petit wrote:
i
am still trying to understand the parallelized version of the heat
equation 2D solving that we saw at school.
I am confused between the shift of the values near to the bounds done
by the "updateBound" routine and the main loop (at line 161 in main
code) which c
Hi,
Am 29.10.2010 um 18:27 schrieb Jeff Squyres:
> I'd have to go check 1.4.3 and 1.4.1 to be sure, but I would generally *NOT*
> assume that different versions like this are compatible.
I'm getting confused, as these versions are exactly fitting "x.(y+1).*" which
you mention below. So they sh
On Oct 29, 2010, at 12:40 PM, Reuti wrote:
>> I'd have to go check 1.4.3 and 1.4.1 to be sure, but I would generally *NOT*
>> assume that different versions like this are compatible.
>
> I'm getting confused, as these versions are exactly fitting "x.(y+1).*" which
> you mention below. So they s
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
I guess we will play it safe and upgrade every cluster at once so that
we won't get bad surprises.
thank you Jeff.
On 10/29/2010 06:40 PM, Reuti wrote:
> Hi,
>
> Am 29.10.2010 um 18:27 schrieb Jeff Squyres:
>
>> I'd have to go check 1.4.3 and 1.4.1
Am 29.10.2010 um 18:47 schrieb Jeff Squyres:
> On Oct 29, 2010, at 12:40 PM, Reuti wrote:
>
>>> I'd have to go check 1.4.3 and 1.4.1 to be sure, but I would generally
>>> *NOT* assume that different versions like this are compatible.
>>
>> I'm getting confused, as these versions are exactly fit
On 29 Oct 2010, at 12:06, Jeremy Roberts wrote:
> I'd suggest looking into TotalView (http://www.totalviewtech.com) and/or DDT
> (http://www.allinea.com/). I've used TotalView pretty extensively and found
> it to be pretty easy to use. They are both commercial, however, and not
> cheap.
>
thanksI have run padb (the new one with your patch) on my system and got
:-bash-3.2$ padb -Ormgr=pbs -Q 48516.cluster$VAR1 = {};Job 48516.cluster is
not activeActually, the job is running.
How to check whether my system has pbs_pro ?
Any help is appreciated. thanksJinxu DingOct. 29 2010
> Fro
Can you try the following and send me the output.
qstat -n -u `whoami` @clusterName
The output sent before implies that your cluster is called "clusterName" rather
than "cluster" which is a little surprising but let's see what it gives us if
we query on that basis.
Ashley.
On 29 Oct 2010, at
Hi,
this is what I got :
-bash-3.2$ qstat -n -u myName
clsuter:
Req'd Req'd ElapJob ID Username QueueJobname
SessID NDS TSK Memory Time S Time
Not without a list of hostnames it's not any use no, if you can get that, then
I have something to work with. From looking around on google -n might help
here. Once I have this info you'll need to verify that you are able to ssh to
these nodes without a password, that pdsh is installed and gi
20 matches
Mail list logo