Re: [OMPI users] Distribute app over open mpi

2009-11-06 Thread Ralph Castain
I'm afraid we don't position binaries for you, though we have talked about providing that capability. I have some code in the system that - will- do it for certain special circumstances, but not in the general case, and much of the required infrastructure exists in the code today. I'm not su

Re: [OMPI users] an environment variable with same meaning than the -x option of mpiexec

2009-11-06 Thread Ralph Castain
Not at the moment - though I imagine we could create one. It is a tad tricky in that we allow multiple -x options on the cmd line, but we obviously can't do that with an envar. The most likely solution would be to specify multiple "-x" equivalents by separating them with a comma in the enva

Re: [OMPI users] Programming Help needed

2009-11-06 Thread amjad ali
Thanks Dear, Jonathan seems almost perfect; I percieve the same. On Fri, Nov 6, 2009 at 6:17 PM, Tom Rosmond wrote: > AMJAD > > On your first question, the answer is probably, if everything else is > done correctly. The first test is to not try to do the overlapping > communication and computa

Re: [OMPI users] Programming Help needed

2009-11-06 Thread Tom Rosmond
AMJAD On your first question, the answer is probably, if everything else is done correctly. The first test is to not try to do the overlapping communication and computation, but do them sequentially and make sure the answers are correct. Have you done this test? Debugging your original approach

Re: [OMPI users] Programming Help needed

2009-11-06 Thread Jonathan Dursi
Hi, Amjad: [...] What I do is that I start non blocking MPI communication at the partition boundary faces (faces shared between any two processors) , and then start computing values on the internal/non-shared faces. When I complete this computation, I put WAITALL to ensure MPI communication

[OMPI users] Programming Help needed

2009-11-06 Thread amjad ali
Hi all, I need/request some help from those who have some experience in debugging/profiling/tuning parallel scientific codes, specially for PDEs/CFD. I have parallelized a Fortran CFD code to run on Ethernet-based-Linux-Cluster. Regarding MPI communication what I do is that: Suppose that the gri

Re: [OMPI users] mpirun example program fail on multiple nodes- unable to launch specified application on client node

2009-11-06 Thread qing.pang
This sounds like what I need exactly. But when I tried it, I got the following problem - I'm working on my Desktop, and trying to preload the executable to my laptop. -- gordon@gordon-desktop:~/Desktop/openmpi-1.3.3/examples$

[OMPI users] Distribute app over open mpi

2009-11-06 Thread Arnaud Westenberg
Hi all, Sorry for the newbie question, but I'm having a hard time finding the answer, as I'm not even familiar with the terminology... I've setup a small cluster on Ubuntu (hardy) and everything is working great, including slurm etc. If I run the well known 'Pi' program I get the proper result

[OMPI users] mpirun noticed that process rank 1 ... exited on signal 13 (Broken pipe).

2009-11-06 Thread Kritiraj Sajadah
Hi Everyone, I have install openmpi 1.3 and blcr 0.81 on my laptop (single processor). I am trying to checkpoint a small test application: ### #include #include #include #include #include int main(int argc, char **argv) { int rank,size; MPI_Init(&argc, &argv); MPI_Comm

[OMPI users] an environment variable with same meaning than the -x option of mpiexec

2009-11-06 Thread Paul Kapinos
Dear OpenMPI developer, with the -x option of mpiexec there is a way to distribute environmnet variables: -x Export the specified environment variables to the remote nodes before executing the program. Is there an environment variable ( OMPI_) with the same mean

Re: [OMPI users] Segmentation fault whilst running RaXML-MPI

2009-11-06 Thread Nick Holway
Hi, Thank you for the information, I'm going to try the new Intel Compilers which I'm downloading now, but as they're taking so long to download I don't think I'm going to be able to look into this again until after the weekend. BTW using their java-based downloader is a bit less painful than thei

Re: [OMPI users] Question about checkpoint/restart protocol

2009-11-06 Thread Josh Hursey
On Nov 5, 2009, at 4:46 AM, Mohamed Adel wrote: Dear Sergio, Thank you for your reply. I've inserted the modules into the kernel and it all worked fine. But there is still a weired issue. I use the command "mpirun -n 2 -am ft-enable-cr -H comp001 checkpoint-restart- test" to start the an

Re: [OMPI users] checkpoint opempi-1.3.3+sge62

2009-11-06 Thread Josh Hursey
On Oct 28, 2009, at 7:41 AM, Sergio Díaz wrote: Hello, I have achieved the checkpoint of an easy program without SGE. Now, I'm trying to do the integration openmpi+sge but I have some problems... When I try to do checkpoint of the mpirun PID, I got an error similar to the error gotten wh

Re: [OMPI users] problems with checkpointing an mpi job

2009-11-06 Thread Josh Hursey
On Oct 30, 2009, at 1:35 PM, Hui Jin wrote: Hi All, I got a problem when trying to checkpoint a mpi job. I will really appreciate if you can help me fix the problem. the blcr package was installed successfully on the cluster. I configure the ompenmpi with flags, ./configure --with-ft=cr --enabl

Re: [OMPI users] problem using openmpi with DMTCP

2009-11-06 Thread Josh Hursey
(Sorry for the excessive delay in replying) I do not have any experience with the DMTCP project, so I can only speculate on what might be going on here. If you are using DMTCP to transparently checkpoint Open MPI you will need to make sure that you are not using any other interconnect other

Re: [OMPI users] Changing location where checkpoints are saved

2009-11-06 Thread Josh Hursey
(Sorry for the excessive delay in replying) On Sep 30, 2009, at 11:02 AM, Constantinos Makassikis wrote: Thanks for the reply! Concerning the mca options for checkpointing: - are verbosity options (e.g.: crs_base_verbose) limited to 0 and 1 values ? - in priority options (e.g.: crs_blcr_prio

Re: [OMPI users] mpirun example program fail on multiple nodes- unable to launch specified application on client node

2009-11-06 Thread Josh Hursey
As an alternative technique for distributing the binary, you could ask Open MPI's runtime to do it for you (made available in the v1.3 series). You still need to make sure that the same version of Open is installed on all nodes, but if you pass the --preload-binary option to mpirun the runt

Re: [OMPI users] Help: Firewall problems

2009-11-06 Thread Jeff Squyres
On Nov 6, 2009, at 5:49 AM, Lee Amy wrote: Thanks. And actually I don't know if I need to disable iptables to run MPI programs properly. Obviously from your words Open MPI will use random ports so how do I set up in iptables then let trusted machines open their random ports? I'm afraid I'm n

Re: [OMPI users] Help: Firewall problems

2009-11-06 Thread Lee Amy
On Fri, Nov 6, 2009 at 2:39 AM, Jeff Squyres wrote: > On Nov 5, 2009, at 11:28 AM, Lee Amy wrote: > >> I remembered MPI does not count on TCP/IP but why default iptables >> will prevent the MPI programs from running? After I stop iptables then >> programs run well. I use Ethernet as connection. >>

[OMPI users] nonblocking MPI_File_iwrite() does block?

2009-11-06 Thread Christoph Rackwitz
Hi, I was trying to use non-blocking MPI I/O and to my surprise, MPI_File_iwrite() is *blocking*. Please see the attached iwrite-test.c and run it (mpiexec -n 1 or standalone). This is what I get: MPI_File_iwrite: 10.706 s MPI_Wait: 0.000 s I take that to mean that MPI_File_iwrite()

Re: [OMPI users] using specific algorithm for collective communication and knowing the root cpu?

2009-11-06 Thread George Bosilca
On Nov 4, 2009, at 12:46 , George Markomanolis wrote: I have some questions, because I am using some programs for profiling, when you say that the cost of allreduce raise you mean about the time only or also and the flops of this command? Is there some additional work added at the allreduce