Very strange. I cannot reproduce it as I’m able to run any number of nodes and
procs, pushing over 100Mbytes thru without any problem.
Which leads me to suspect that the issue here is with the tty interface. Can
you tell me what shell and OS you are running?
> On Aug 23, 2016, at 3:25 PM, Jing
I’ve never heard of that, and cannot imagine what it has to do with the
resource manager. Can you point to where you heard that one?
FWIW: we don’t ship OMPI with anything in the default mca params file, so
somebody must have put it in there for you.
> On Aug 23, 2016, at 4:48 PM, Andy Riebs
I gleaned from the web that I need to comment out
"opal_event_include=epoll" in /etc/openmpi-mca-params.conf
in order to use Open MPI with PBS Pro.
Can we also disable that in other cases, like Slurm, or is this
something specific to PBS Pro?
Andy
--
Andy Riebs
andy.ri...@hpe.com
Hewlett-Pa
Everything stuck at MPI_Init. For a test job with 2 nodes and 10 cores each
node, I got the following
$ mpirun ./a.out < test.in
Rank 2 has cleared MPI_Init
Rank 4 has cleared MPI_Init
Rank 7 has cleared MPI_Init
Rank 8 has cleared MPI_Init
Rank 0 has cleared MPI_Init
Rank 5 has cleared MPI_Init
The IO forwarding messages all flow over the Ethernet, so the type of fabric is
irrelevant. The number of procs involved would definitely have an impact, but
that might not be due to the IO forwarding subsystem. We know we have flow
control issues with collectives like Bcast that don’t have buil
Hi Ralph,
I tested v2.0.1rc1 with your code but has the same issue. I also installed
v2.0.1rc1 on a different cluster which has Mellanox QDR Infiniband and get the
same result. For the tests you have done, how many cores and nodes did you use?
I can trigger the problem by using multiple nodes
Problem solved! I had to remove a macports directory from my path that had
the improper MPI library, and at last the code is working. Thanks so much
to everyone for your friendly and prompt suggestions - I appreciate it very
much.
Dick
On Tue, Aug 23, 2016 at 4:51 PM, Richard G French
wrote:
>
2415 rfrench@cosmos> mpicc -show
gcc -I/usr/local/include -L/usr/local/lib -lmpi
2416 rfrench@cosmos> which mpicc
/usr/local/bin/mpicc
I'll get rid of the mpich-mp stuff and give it another try!
On Tue, Aug 23, 2016 at 4:40 PM, Reuti wrote:
> Hi,
>
> Am 23.08.2016 um 21:43 schrieb Richard
It sounds like I need to understand what flavor of MPI I need for this
project. I'll do some homework to try to understand the differences.
Thanks for the leads.
Dick
On Tue, Aug 23, 2016 at 4:40 PM, Douglas L Reeder
wrote:
> Dick,
>
> In the openmpi library directory you can do
>
> nm library_
Hi,
Am 23.08.2016 um 21:43 schrieb Richard G French:
> Hi, all -
> I'm trying to build the SPH code Gadget2
> (http://wwwmpa.mpa-garching.mpg.de/gadget/) under OS X 10.11.6 and I am
> getting the following type of error:
>
> 222 rfrench@cosmos> make
>
> mpicc main.o run.o predict.o begrun.o
Dick,
In the openmpi library directory you can do
nm library_file_name|grep ompi_mpi_byte
And that wil tell you if that library contains ompi_mpi_byte
Doug
> On Aug 23, 2016, at 2:30 PM, Richard G French wrote:
>
> Thanks for the suggestion, Doug - but I can't seem to find the missing
> func
I’m confused - you keep talking about MPICH, but the symbol you are looking for
is from OMPI. You cannot mix the two MPI libraries - is that what you are
trying to do?
> On Aug 23, 2016, at 1:30 PM, Richard G French wrote:
>
> Thanks for the suggestion, Doug - but I can't seem to find the miss
Thanks for the suggestion, Doug - but I can't seem to find the missing
function ompi_mpi_byte in any of those other libraries. I'll keep looking!
I wonder if I failed to configure mpich properly when I built it.
Dick
On Tue, Aug 23, 2016 at 4:01 PM, Douglas L Reeder
wrote:
> Richard,
>
> It loo
Richard,
It looks like you need to add some -l arguments to the the specific openmpi
libraries hat you need (e.g., -lmpi -lmpi_cxx)
Doug
> On Aug 23, 2016, at 1:43 PM, Richard G French wrote:
>
> Hi, all -
> I'm trying to build the SPH code Gadget2
> (http://wwwmpa.mpa-garching.mpg.de/gad
Hi, all -
I'm trying to build the SPH code Gadget2 (
http://wwwmpa.mpa-garching.mpg.de/gadget/) under OS X 10.11.6 and I am
getting the following type of error:
222 rfrench@cosmos> make
mpicc main.o run.o predict.o begrun.o endrun.o global.o timestep.o
init.o restart.o io.o accel.o read_ic.o
Might be worth trying with --mca btl_openib_cpc_include udcm and see if that
works.
-Nathan
On Aug 23, 2016, at 02:41 AM, "Juan A. Cordero Varelaq"
wrote:
Hi Gilles,
If I run it like this:
mpirun --mca btl ^openib,usnic --mca pml ob1 --mca btl_sm_use_knem 0 -np 5
myscript.sh
it works fine
Juan,
You are not using infiniband with these options.
Have your infiniband fabric fixed, and you will be able to mpirun without any
options, and with native infiniband.
Cheers,
Gilles
"Juan A. Cordero Varelaq" wrote:
>Hi Gilles,
>
>If I run it like this:
>
>mpirun --mca btl ^openib,usnic --m
Hi Gilles,
If I run it like this:
mpirun --mca btl ^openib,usnic --mca pml ob1 --mca btl_sm_use_knem 0 -np
5 myscript.sh
it works fine. Am I using infiniband in this way? However, if I remove
*openib*, I get the *librdmacm: Fatal: unable to open RDMA device*
error. So what would be the most
Juan,
if you want to use infiniband with the openib/btl (i am assuming MXM is
not available on your platform, and you to not want
to use infiniband via usnic/libfabric), you can
mpirun --mca pml ob1 --mca btl ^usnic ...
/* i am pretty sure mpirun ... would do the trick too */
if you get th
Hi Gilles,
so if I use rthe option --mca pml ob1, I use infiniband and it will be
as fast as normal, right?
Thanks
On 22/08/16 14:22, Gilles Gouaillardet wrote:
Juan,
to keep things simple, --mca pml ob1 ensures you are not using mxm
(yet an other way to use infiniband)
IPoIB is unlikely
20 matches
Mail list logo