Re: [OMPI users] --bynode vs --byslot

2008-06-04 Thread Cally K
Thanks, that was actually a lot of help, I had very little understanding of the bynode and byslot thingy, thanks On 6/5/08, Jeff Squyres wrote: > > On May 23, 2008, at 9:07 PM, Cally K wrote: > > > Hi, I have a question about --bynode and --byslot that i would like > > to clarify > > > > Say, for

Re: [OMPI users] OpenMPI scaling > 512 cores

2008-06-04 Thread Pavel Shamis (Pasha)
Scott Shaw wrote: Hi, I hope this is the right forum for my questions. I am running into a problem when scaling >512 cores on a infiniband cluster which has 14,336 cores. I am new to openmpi and trying to figure out the right -mca options to pass to avoid the "mca_oob_tcp_peer_complete_connect:

Re: [OMPI users] Problem with X forwarding

2008-06-04 Thread Allen Barnett
If you are using a recent version of Linux (as machine A), the X server is probably started with its TCP network connection turned off. For example, if you do: $ ps auxw | grep X /usr/bin/Xorg :0 -br -audit 0 -auth /var/gdm/:0.Xauth -nolisten tcp vt7 The "-nolisten tcp" option turns off the X ser

Re: [OMPI users] OpenMPI scaling > 512 cores

2008-06-04 Thread Jeff Squyres
One other parameter that I neglected to mention (and Scott pointed out to me is *not* documented in the FAQ) is the mpi_preconnect_oob MCA param. This parameter will cause all the OOB connections to be created during MPI_INIT, and *may* help such kind of issues. You *do* need to have eno

[OMPI users] libibverbs

2008-06-04 Thread Brock Palen
We have two installs of openmpi-1.2.3 One with the pgi compilers the other with gcc/Nagf90 One the pgi compilers does not link against libibverbs, but ompi_info shows the openib btl and we see traffic on the fabric. The other built with Nagware links against libibverbs. It also shoes in om

Re: [OMPI users] OpenMPI scaling > 512 cores

2008-06-04 Thread Åke Sandgren
On Wed, 2008-06-04 at 11:43 -0700, Scott Shaw wrote: > Hi, I was wondering if anyone had any comments with regarding to my > posting of questions. Am I off base with my questions or is this the > wrong forum for these types of questions? > > > > > Hi, I hope this is the right forum for my que

Re: [OMPI users] OpenMPI scaling > 512 cores

2008-06-04 Thread Jeff Squyres
First and foremost: is it possible to upgrade your version of Open MPI? The version you are using (1.2.2) is rather ancient -- many bug fixes have occurred since then (including TCP wireup issues). Note that oob_tcp_in|exclude were renamed to be oob_tcp_if_in|exclude in 1.2.3 to be symmet

Re: [OMPI users] Proper way to throw an error to all nodes?

2008-06-04 Thread Jeff Squyres
Yes -- MPI_Abort is the simplest way to get them all to die. But you'll also get error message(s) from OMPI. So you have [at least] 2 options: 1. Exit with MPI error - if (rank == process_who_does_the_checking && !exists(filename)) { print("bad!"); MPI_Abort(MPI_COMM_WORLD)

Re: [OMPI users] OpenMPI scaling > 512 cores

2008-06-04 Thread Scott Shaw
Hi, I was wondering if anyone had any comments with regarding to my posting of questions. Am I off base with my questions or is this the wrong forum for these types of questions? > > Hi, I hope this is the right forum for my questions. I am running into a > problem when scaling >512 cores on

Re: [OMPI users] Problem with X forwarding

2008-06-04 Thread Jeff Squyres
In general, Open MPI doesn't have anything to do with X forwarding. However, if you're using ssh to startup your processes, ssh may configure X forwarding for you (depending on your local system setup). But OMPI closes down ssh channels once applications have launched (there's no need to

Re: [OMPI users] OpenIB problem: error polling HP CQ...

2008-06-04 Thread Jeff Squyres
We have made a *lot* of changes to the run-time support for spawn and some changes to the FLUSH support in the openib BTL for the upcoming v1.3 series. Would it be possible for you to try a trunk nightly tarball snapshot, perchance? http://www.open-mpi.org/nightly/trunk/ On May 29,

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Brock Palen
I really think just having them in a flash container works well, youtube style. I do this with both snapZpro (mac only) and jing (windows & mac), if you want to have a higher quality downloadable traditional video though, they prob wont work. Also these are screen+audio/voice capture tool

Re: [OMPI users] HPMPI versus OpenMPI performance

2008-06-04 Thread Jeff Squyres
Thanks for all the detailed information! It is quite likely that our bsend performance has never been tuned; we simply implemented it, verified that it works, and then moved on -- we hadn't considered that real applications would actually use it. :-\ But that being said, 60% difference is

Re: [OMPI users] disabling tcp altogether

2008-06-04 Thread Jeff Squyres
On Jun 4, 2008, at 10:39 AM, tayfun sen wrote: I would like to run an OpenMPI application on one node and since I think it would be better performance wise I want it to use shared memory for communication and not tcp. Is it possible to use shared memory not only for MPI communication but al

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Jeff Squyres
Thanks for the tip! I downloaded and tried the qt_tools but all conversions that I did to the MP4 format looked absolutely horrid -- the resulting videos had "jagged" images and all kinds of weird artifacts that would appear and disappear. The slides were quite readable, but they just look

Re: [OMPI users] tg3 module

2008-06-04 Thread Patrick Geoffray
Hi Leonardo, Leonardo Fialho wrote: NETDEV WATCHDOG: eth0: transmit timed out tg3: eth0: transmit timed out, resetting tg3: tg3_stop_block timed out, ofs=2c00 enable_bit=2 tg3: tg3_stop_block timed out, ofs=4800 enable_bit=2 tg3: eth0: Link is down. tg3: eth0: Link is up at 1000 Mbps, full duple

Re: [OMPI users] Infinite loop when tcp free list max reached

2008-06-04 Thread Jeff Squyres
On May 26, 2008, at 5:17 PM, Matt Hughes wrote: With the TCP btl, when free list items are exhausted, OMPI 1.2.6 falls into an infinite loop: #3981 0x002a98b4e23f in opal_condition_wait (c=0x2a98c541d0, m=0x2a98c54180) at ../../../../opal/threads/condition.h:81 [snip] Yoinks. The ca

Re: [OMPI users] --bynode vs --byslot

2008-06-04 Thread Jeff Squyres
On May 23, 2008, at 9:07 PM, Cally K wrote: Hi, I have a question about --bynode and --byslot that i would like to clarify Say, for example, I have a hostfile #Hostfile __ node0 node1 slots=2 max_slots=2 node2 slots=2 max_slots=2 node3 slots=4 max_slots=4 ___

Re: [OMPI users] ORTE_ERROR_LOG Timeout

2008-06-04 Thread Jeff Squyres
James -- Sorry for the delay in replying. Do you have any firewall software running on your nodes (e.g., iptables)? OMPI uses random TCP ports to connect between nodes for control messages. If they can't reach each other because TCP ports are blocked, Bad Things will happen (potentially

Re: [OMPI users] openmpi 32-bit g++ compilation issue

2008-06-04 Thread Jeff Squyres
Sorry for the delay in replying. This looks like a problem on your system -- I think Doug is right: your system seems to be picking the wrong libraries when you specify - m32. Can you compile any C++ libraries/binaries with -m32 successfully? On May 19, 2008, at 5:48 PM, Arif Ali wrote:

[OMPI users] tg3 module

2008-06-04 Thread Leonardo Fialho
Hi All, I´m experimenting a strange problem. I don´t know if it was reported, but, thats is: when I run Open MPI in a specific cluster the network card module (tg3) goes down... and in some minutes go up again. Of course its results in "[nodo22][[56833,1],3][btl_tcp_frag.c:216:mca_btl_tcp_fr

[OMPI users] disabling tcp altogether

2008-06-04 Thread tayfun sen
Hello, I would like to run an OpenMPI application on one node and since I think it would be better performance wise I want it to use shared memory for communication and not tcp. Is it possible to use shared memory not only for MPI communication but also for control messages and other similar

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Scott Atchley
Jeff, If I remember correctly, Microsoft dropped support for .AVI 3-4 years ago so it can no longer be played by their media player. It is also not native to QT, so you will have to download a plugin (I have it somewhere if you want me to look for it). I do not know if there is a containe

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Jeff Squyres
On Jun 4, 2008, at 3:54 AM, Andreas Schäfer wrote: I consider him as a authority on this subject: ;-) http://isites.harvard.edu/fs/html/icb.topic58703/winston1.html Thanks -- I'll have a look! -- Jeff Squyres Cisco Systems

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Jeff Squyres
FWIW: I tried the http://adi.loris.tv/ompi-optimized.avi URL on my Mac and got redirected to the Quicktime plugin page. I had no idea which plugin would make it play AVI files, so I skipped it. I tried the URL on a Windows machine and Windows Media Player (i.e., what came up by default) s

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Adrian Knoth
On Wed, Jun 04, 2008 at 11:19:48AM +0200, Adrian Knoth wrote: > People usually recommend ffmpegX for OSX. You might give it a whirl to > transcode your mov to something else, let's say H.264 in an AVI > container. (MP4/AVC, DivX, xvid, there are so many names for it) I've checked your files, they

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Adrian Knoth
On Tue, Jun 03, 2008 at 04:48:50PM -0400, Jeff Squyres wrote: > As for .mov, yes, this is definitely a compromise. I tried uploading > the videos to YouTube and Google Video and a few others, but a) most QT sucks. Youtube (Flash) sucks. > slides look crappy and/or unreadable. So I had to

Re: [OMPI users] eigenvalue problem

2008-06-04 Thread Adrian Knoth
On Fri, May 30, 2008 at 10:22:42PM +0200, Radovan Herchel wrote: > Unfortunately, Arpack is suitable only to calculate a few eigenvalues, > not all. I don't know much about this math stuff, but people over here like SAGE: http://www.sagemath.org It has an MPI binding, programming can be done

Re: [OMPI users] GCC extendability to OpenMPI Specification

2008-06-04 Thread Joe Landman
Mukesh K Srivastava wrote: Hi OMPI Community. Is there any thought process to extend GCC support to OpenMPI or implementation of OpenMPI specification in GCC for C, C++ & Fortran and making it generally available for platforms which supports POSIX. Hi Mukesh: Open MPI is already written in

Re: [OMPI users] Open MPI instructional videos

2008-06-04 Thread Andreas Schäfer
On 16:48 Tue 03 Jun , Jeff Squyres wrote: > - more importantly, however, the audience likes to take the slides > away and when they actually look at them 6 weeks after the lecture, > they might actually remember the content better because they received > the same information via two forms

Re: [OMPI users] GCC extendability to OpenMPI Specification

2008-06-04 Thread Andreas Schäfer
Hi Mukesh, Open MPI is an implementation of the MPI standard. Its API is thus the one of a library, which is contrary to, say OpenMP, which requires changes to the compiler. Open MPI already supports C, C++ and Fortran for virtually any compiler and platform. For what it's worth, there is littl

[OMPI users] GCC extendability to OpenMPI Specification

2008-06-04 Thread Mukesh K Srivastava
Hi OMPI Community. Is there any thought process to extend GCC support to OpenMPI or implementation of OpenMPI specification in GCC for C, C++ & Fortran and making it generally available for platforms which supports POSIX. Can GCC community think extending a support library for OpenMPI in it's re

Re: [OMPI users] HPMPI versus OpenMPI performance

2008-06-04 Thread Mukesh K Srivastava
Hi Could you specify few things - (a) The version of PG Compilers being used both for HPMPI & OMPI. Is the compilers same or the version same? (b) Could you share the configure command given for OMPI with PG compilers? (c) Could you compare the threads enable/disable for both cases needed? (d)

Re: [OMPI users] HPMPI versus OpenMPI performance

2008-06-04 Thread Mukesh K Srivastava
Hi Could you specify few things - (a) The version of PG Compilers being used both for HPMPI & OMPI. Is the compilers same or the version same? (b) Could you share the configure command given for OMPI with PG compilers? (c) Could you compare the threads enable/disable for both cases needed? (d)