Re: [OMPI users] [EXT] Re: [EXT] Re: Error handling

2023-07-19 Thread Alexander Stadik via users
depending on exactly when Open MPI kills them. -- $ echo $? 79 From: users mailto:users-boun...@lists.open-mpi.org>> on behalf of Alexander Stadik via users mailto:users@lists.open-mpi.org&

Re: [OMPI users] [EXT] Re: [EXT] Re: Error handling

2023-07-19 Thread Alexander Stadik via users
are your values "random" if you provide correct values ? Even for negative values you could use MIN to pick one value and return it. What is the problem with `MPI_Abort` ? it does seem to do what you want. George. On Tue, Jul 18, 2023 at 4:38 AM Alexander Stadik via users mailto:user

Re: [OMPI users] [EXT] Re: Error handling

2023-07-18 Thread Alexander Stadik via users
MIN to pick one value and return it. What is the problem with `MPI_Abort` ? it does seem to do what you want. George. On Tue, Jul 18, 2023 at 4:38 AM Alexander Stadik via users mailto:users@lists.open-mpi.org>> wrote: Hey everyone, I am working for longer time now with cuda-aware Open

[OMPI users] Error handling

2023-07-18 Thread Alexander Stadik via users
Hey everyone, I am working for longer time now with cuda-aware OpenMPI, and developed longer time back a small exceptions handling framework including MPI and CUDA exceptions. Currently I am using MPI_Abort with costum error numbers, to terminate everything elegantly, which works well, by just

Re: [OMPI users] cuda-aware OpenMPI - high number of small asynch sent messages create invalid write

2022-02-22 Thread Alexander Stadik via users
I work. here is the valgrind output https://pastebin.com/Wwvn8Pa7 here the ompi_info –all output https://pastebin.com/FW0fazZH here the gdb output https://pastebin.com/4fNsxUd1 From: users On Behalf Of Alexander Stadik via users Sent: Thursday, 3 February 2022 14:06 To: users@lists.op

Re: [OMPI users] cuda-aware OpenMPI - high number of small asynch sent messages create invalid write

2022-02-03 Thread Alexander Stadik via users
://pastebin.com/4fNsxUd1 From: users On Behalf Of Alexander Stadik via users Sent: Thursday, 3 February 2022 14:06 To: users@lists.open-mpi.org Cc: Alexander Stadik Subject: [OMPI users] cuda-aware OpenMPI - high number of small asynch sent messages create invalid write CAUTION: This email

[OMPI users] cuda-aware OpenMPI - high number of small asynch sent messages create invalid write

2022-02-03 Thread Alexander Stadik via users
Hello whoever reads this, I am running my code using CUDA aware OpenMPI (see ompi_info –all attached). First I will explain the problem, further down I will give additional info about versions, hardware and debugging. The Problem: My application solves multiple mathematical equations on GPU via