Re: [OMPI users] Apllication level checkpointing tools.

2009-06-30 Thread neeraj
Dear Mohamed, Is there some checkpointing software for interconnect other than tcp say IB or Myrinet? Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary of TATA SONS Ltd) B-101, ICC Trade Towers, Senapati Bapat Road Pune 411016 (Mah) INDIA

[OMPI users] selectively bind MPI to one HCA out of available ones

2009-07-15 Thread neeraj
during mpi launch time to select mthca0 or mthca1? Any help is appreciated. Btw i just checked Mvapich and feature is there inside. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary of TATA SONS Ltd) B-101, ICC Trade Towers, Senapati Bapat

Re: [OMPI users] selectively bind MPI to one HCA out of available ones

2009-07-15 Thread neeraj
Thanks Ralph, i found the mca parameter. It is btl_openib_max_btls which controls the available HCAs. Thanks for helping. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary of TATA SONS Ltd) B-101, ICC Trade Towers, Senapati

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
. Since collectives depend heavily on your network architecture and message size, i would like you to first fine tune your collectives on your network fabric before running any scientific application. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
Hi Terry, I feel hierarchical collectives are slower compare to tuned one. I had done some benchmark in the past specific to collectives, and this is what i feel based on my observation. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
Hi Craig, How was the nodefile selected for execution? Whether it was provided by scheduler say LSF/SGE/PBS or you manually gave it? With WRF, we observed giving sequential nodes (Blades which are in the same order as in enclosure) gave us some performance benefit. Regards Neeraj

Re: [OMPI users] Performance question about OpenMPI and MVAPICH2 on IB

2009-08-07 Thread neeraj
infiniband clos network. Results were tested on 12-16 nodes with 8 mpi process each node. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidiary of TATA SONS Ltd) B-101, ICC Trade Towers, Senapati Bapat Road Pune 411016 (Mah) INDIA (O) +91-20

Re: [OMPI users] Multi-threading with OpenMPI ?

2009-09-18 Thread neeraj
using supported thread library on your platform(selected by default during configure or use --with-threads). You can't use OPAL library as it is not exported to outside MPI programming world. Regards Neeraj Chourasia (MTS) Computational Research Laboratories Ltd. (A wholly Owned Subsidia

[OMPI users] Checkpointing hangs with OpenMPI-1.3.1

2009-04-10 Thread neeraj
MPI_Barrier(MPI_COMM_WORLD); } MPI_Finalize(); } Let me know, what could be the error. I feel there is the error in MPI process coordination. Regards Neeraj Chourasia Member of Technical Staff Computational Research Laboratories Limited (A wholly Owned Subsidiary of TAT

Re: [OMPI users] OpenMPI MPI_Bcast Algorithms

2009-04-28 Thread neeraj
may differ from one network topology to another. In that case, i would suggest you to run benchmark programs with 2. option and fine tune the MPI Collectives suited for your cluster architecture. Regards Neeraj Chourasia Member of Technical Staff Computational Research Laboratories

Re: [OMPI users] Checkpointing hangs with OpenMPI-1.3.1

2009-04-28 Thread neeraj
Soon, this error goes away, if i force mpirun to use tcp for communication using mca parameters and then error a) starts coming which is related with some datatype handling during checkpoint. Regards Neeraj Chourasia Member of Technical Staff Computational Research Laboratories Limited (A wholly

Re: [OMPI users] Slightly off topic: Ethernet and InfiniBand speed evolution

2009-05-07 Thread neeraj
Thanks Pasha for sharing IB Roadmaps with us. But i am more interested in to find out latency figures since they often matter more than bit rate. Could there be rough if not accurate the latency figures being targeted in IB World? Regards Neeraj Chourasia Member of Technical Staff

Re: [OMPI users] another mpirun + xgrid question

2007-09-10 Thread Neeraj Chourasia
If you are using scheduler like PBS or SGE over MPI, there is an option called prolog and epilog, where you can give scripts which does copy operation. This script is called before and after job execution as the name suggests. Without it, in mpi itself, i have to see, if it can be done. T

[OMPI users] libnbc compilation

2007-10-01 Thread Neeraj Chourasia
Hello Everyone,    I was checking the development version from svn and found that support for libnbc is going to come in next release. I thought of compiling it, but failed to do.Could some one suggest me how to get it compiled.When i made changes to configure script(Basically added some flags)

[OMPI users] Query regarding GPR

2007-10-09 Thread Neeraj Chourasia
, then how does compute node gets the information of the same during execution ? Does it use OOB for it ?-Neeraj

[OMPI users] Tuning Openmpi with IB Interconnect

2007-10-11 Thread Neeraj Chourasia
communication.Any help in this direction would be appreciated.-Neeraj

[OMPI users] Re :Re: Tuning Openmpi with IB Interconnect

2007-10-11 Thread Neeraj Chourasia
done it for Ethernet/Giga-bit Ethernet and IPoIB ofcourse in experimental stage. Actually i want to contribute for it in OpenMPI and need the help for the same.-NeerajOn Thu, 11 Oct 2007 12:01:39 +0200 Open MPI Users wrote Hi Neeraj, > Could anyone tell me the important tun

[OMPI users] Re :Re: Re :Re: Tuning Openmpi with IB Interconnect

2007-10-12 Thread Neeraj Chourasia
Yes, the buffer was being re-used. No we didnt try to benchmark it with netpipe and other stuffs. But the program was pretty simple. Do you think, I need to test it with bigger chunks (>8MB) for communication.?We also tried manipulating eager_limit and min_rdma_sze, but no success.NeerajOn Fri,

[OMPI users] Compile test programs

2007-10-18 Thread Neeraj Chourasia
having Makefile created on running configure script, but few of them like runtime doesn\'t have the Makefile.Please help me compiling it.-Neeraj

[OMPI users] OpenMPI 1.2.4 vs 1.2

2007-10-24 Thread Neeraj Chourasia
, and to my surprise, old version performs better in both scenarios.Could anyone give me the reason for the same?I repeated the above point to point  tests between all set of nodes, but the result were same :(-Neeraj

[OMPI users] Re :Re: Process 0 with different time executing the same code

2007-10-26 Thread Neeraj Chourasia
Hi,    Please ensure if following things are correct1) The array bounds are equal. Means \"my_x\" and \"size_y\" has the same value on all nodes.2) Nodes are homogenous. To check that, you could decide root to be some different node and run the program-NeerajOn Fri, 26 Oct 2007 10:13:15 +0500 (

[OMPI users] MPI_Send issues with openib btl

2007-10-26 Thread Neeraj Chourasia
messages like local protocol error, flush error, invalid request error, local length error kind of messages.Any help would be appreciated.-Neeraj

[OMPI users] OpenMP and OpenMPI Issue

2007-10-30 Thread Neeraj Chourasia
of reference, program works fine, if we force openmpi to select TCP interconnect using --mca btl tcp,self.-Neeraj #include #include #include #include #include #include "time.h" #include #define MAX 100 int main(int argc, char *argv[]) { int required = MPI_THREAD_MULTIP

[OMPI users] Re :Re: OpenMP and OpenMPI Issue

2007-11-01 Thread Neeraj Chourasia
, at 12:17 AM, Neeraj Chourasia wrote:> Hi folks, > > I have been seeing some nasty behaviour in MPI_Send/Recv> with large dataset(8 MB), when used with OpenMP and Openmpi> together with IB Interconnect. Attached is a program. > >

[OMPI users] Adding new API

2007-11-05 Thread Neeraj Chourasia
should be considerate about?-Neeraj

[OMPI users] version 1.3

2007-11-28 Thread Neeraj Chourasia
ms of checkpointing. But i am pretty sure, once v1.3 will come, it will help a lot to HPC community. I can find the development trunk version, but i am more interested in production release version. -Neeraj

Re: [OMPI users] OpenIB problems

2007-11-29 Thread Neeraj Chourasia
. Problem comes when data size increases and OpenMPI starts splitting it. I think even with Bigger sizes, Program works if interconnect is TCP, but fails to work on IB. So on IB, you can run your program if you set mca paramter mpi_leave_pinned to 1. Cheers Neeraj On Thu, 29 Nov 2007 Brock

[OMPI users] what is MPI_IN_PLACE

2007-12-11 Thread Neeraj Chourasia
Hello everyone,    While going through collective algorithms, I came across preprocessor directive MPI_IN_PLACE which is (void *)1. Its always being compared against source buffer(sbuf). My question is when MPI_IN_PLACE == sbuf condition would be true. As far as i understand, sbuf is the address

[OMPI users] Re :Re: what is MPI_IN_PLACE

2007-12-11 Thread Neeraj Chourasia
Neeraj,MPI_IN_PLACE is defined by the MPI standard in order to allow theusers to specify that the input and output buffers for the collectivesare the same. Moreover, not all collectives support MPI_IN_PLACE andfor those that support it some strict rules apply. Please read the

[OMPI users] orte in persistent mode

2007-12-31 Thread Neeraj Chourasia
i am looking at is in large clusters, mpirun takes lot of time starting orted (by ssh) on remote nodes. If orte is already running, hopefully we can save considerable time. Any comments is appreciated. -Neeraj

[OMPI users] Openmpi with SGE

2008-02-20 Thread Neeraj Chourasia
When i do ssh to n101, there is no orted and qrsh_starter running. While checking its spool file, i came across following message---Execd spool Error Message-----|execd|n101|E|n

[OMPI users] RDMA-CM

2008-06-17 Thread Neeraj Chourasia
Hello everyone,    I downloaded openmpi-1.3 version from night tarballs to check RDMA-CM support. I am able to compile and install it, but dont know how to run it as there is no documentation provided. Did someone try running it with OpenMPI?My another question is Does OpenMPI1.3 has progress-t

[OMPI users] Re :Re: Linpack Benchmark and File Descriptor Limits

2008-09-19 Thread Neeraj Chourasia
Hello,    With openmpi-1.3,  new mca feature is introduced namely --mca routed binomial. This ensures out of band communication to happen in binomial fashion and reduces the net socket opening and hence solves file open issues.-NeerajOn Thu, 18 Sep 2008 16:46:23 -0700 Open MPI Users wrote I'm