Not without a list of hostnames it's not any use no, if you can get that, then I have something to work with. From looking around on google -n might help here. Once I have this info you'll need to verify that you are able to ssh to these nodes without a password, that pdsh is installed and give me the names of an environment variable that pbs sets for ranks within a job.
I'm sure we can get something working but it might be better to take this off-list or to the padb-users list to avoid spamming the Open-MPI users list. Ashley. On 29 Oct 2010, at 18:44, Jack Bryan wrote: > Hi, > > this is what I got : > > -bash-3.2$ qstat -n -u myName > > clsuter: > > Req'd Req'd Elap > Job ID Username Queue Jobname SessID NDS TSK > Memory Time S Time > -------------------- -------- -------- ---------------- ------ ----- --- > ------ ----- - ----- > 48933.cluster.e myName devel myJob 107835 1 -- -- > 00:02 C 00:00 > n20/0 > > Any help is appreciated. > > thanks > > > From: ash...@pittman.co.uk > > Date: Fri, 29 Oct 2010 18:38:25 +0100 > > To: us...@open-mpi.org > > Subject: Re: [OMPI users] open MPI please recommend a debugger for open MPI > > > > > > Can you try the following and send me the output. > > > > qstat -n -u `whoami` @clusterName > > > > The output sent before implies that your cluster is called "clusterName" > > rather than "cluster" which is a little surprising but let's see what it > > gives us if we query on that basis. > > > > Ashley. > > > > On 29 Oct 2010, at 18:29, Jack Bryan wrote: > > > > > thanks > > > > > > I have run padb (the new one with your patch) on my system and got : > > > > > > -bash-3.2$ padb -Ormgr=pbs -Q 48516.cluster > > > $VAR1 = {}; > > > Job 48516.cluster is not active > > > > > > Actually, the job is running. > > > > > > How to check whether my system has pbs_pro ? > > > > > > Any help is appreciated. > > > > > > thanks > > > Jinxu Ding > > > > > > Oct. 29 2010 > > > > > > > > > > From: ash...@pittman.co.uk > > > > Date: Fri, 29 Oct 2010 18:21:46 +0100 > > > > To: us...@open-mpi.org > > > > Subject: Re: [OMPI users] open MPI please recommend a debugger for open > > > > MPI > > > > > > > > > > > > On 29 Oct 2010, at 12:06, Jeremy Roberts wrote: > > > > > > > > > I'd suggest looking into TotalView (http://www.totalviewtech.com) > > > > > and/or DDT (http://www.allinea.com/). I've used TotalView pretty > > > > > extensively and found it to be pretty easy to use. They are both > > > > > commercial, however, and not cheap. > > > > > > > > > > As far as I know, there isn't a whole lot of open source support for > > > > > parallel debugging. The Parallel Tools Platform of Eclipse claims to > > > > > provide a parallel debugger, though I have yet to try it > > > > > (http://www.eclipse.org/ptp/). > > > > > > > > Jeremy has covered the graphical parallel debuggers that I'm aware of, > > > > for a different approach there is padb which isn't a "parallel > > > > debugger" in the traditional model but is able to show you the same > > > > type of information, it won't allow you to point-and-click through the > > > > source or single step through the code but it is lightweight and will > > > > show you the information which you need to know. > > > > > > > > Padb needs to integrate with the resource manager, I know it works with > > > > pbs_pro but it seems there are a few issues on your system which is pbs > > > > (without the pro). I can help you with this and work through the > > > > problems but only if you work with me and provide details of the > > > > integration, in particular I've sent you a version which has a small > > > > patch and some debug printfs added, if you could send me the output > > > > from this I'd be able to tell you if it was likely to work and how to > > > > go about making it do so. > > > > > > > > Ashley. > > > > > > > > -- > > > > > > > > Ashley Pittman, Bath, UK. > > > > > > > > Padb - A parallel job inspection tool for cluster computing > > > > http://padb.pittman.org.uk > > > > > > > > > > > > _______________________________________________ > > > > users mailing list > > > > us...@open-mpi.org > > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > _______________________________________________ > > > users mailing list > > > us...@open-mpi.org > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > -- > > > > Ashley Pittman, Bath, UK. > > > > Padb - A parallel job inspection tool for cluster computing > > http://padb.pittman.org.uk > > > > > > _______________________________________________ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Ashley Pittman, Bath, UK. Padb - A parallel job inspection tool for cluster computing http://padb.pittman.org.uk