I filed https://svn.open-mpi.org/trac/ompi/ticket/4856 to apply these ROMIO 
patches.

Probably won't happen until 1.8.3.

On Aug 6, 2014, at 2:54 PM, Rob Latham <r...@mcs.anl.gov> wrote:

> 
> 
> On 08/06/2014 11:50 AM, Mohamad Chaarawi wrote:
> 
>> To replicate, run the program with 2 or more procs:
>> 
>> mpirun -np 2 ./hindexed_io mpi_test_file
>> 
>> [jam:15566] *** Process received signal ***
>> [jam:15566] Signal: Segmentation fault (11)
>> [jam:15566] Signal code: Address not mapped (1)
>> [jam:15566] Failing at address: (nil)
>> [jam:15566] [ 0] [0xfcd440]
>> [jam:15566] [ 1]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(ADIOI_Flatten_datatype+0x17a)[0xc80f2a]
> 
> I bet OpenMPI needs to pick up a few patches for this fault:
> 
> - http://git.mpich.org/mpich.git/commit/50f3d5806
> - http://git.mpich.org/mpich.git/commit/97114ec5b
> - http://git.mpich.org/mpich.git/commit/90e15e9b0
> - http://git.mpich.org/mpich.git/commit/76a079c7c
> 
> 
> ... and two more patches that are sitting in my tree waiting review.
> 
> 
> ==rob
> 
> 
> 
>> 
>> [jam:15566] [ 2]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(ADIO_Set_view+0x1c1)[0xc72a6d]
>> [jam:15566] [ 3]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(mca_io_romio_dist_MPI_File_set_view+0x69b)[0xc8d11b]
>> 
>> [jam:15566] [ 4]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(mca_io_romio_file_set_view+0x7c)[0xc4f7c5]
>> 
>> [jam:15566] [ 5]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(PMPI_File_set_view+0x1e6)[0xb32f7e]
>> 
>> [jam:15566] [ 6] ./hindexed_io[0x8048aa6]
>> [jam:15566] [ 7] /lib/libc.so.6(__libc_start_main+0xdc)[0x7d5ebc]
>> [jam:15566] [ 8] ./hindexed_io[0x80487e1]
>> [jam:15566] *** End of error message ***
>> 
>> If I use --mca io ompio with 2 or more procs, the program segfaults in
>> write_at_all (regardless of what routine is used to construct a 0 sized
>> datatype):
>> 
>> [jam:15687] *** Process received signal ***
>> [jam:15687] Signal: Floating point exception (8)
>> [jam:15687] Signal code: Integer divide-by-zero (1)
>> [jam:15687] Failing at address: 0x3e29b7
>> [jam:15687] [ 0] [0xe56440]
>> [jam:15687] [ 1]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(ompi_io_ompio_set_explicit_offset+0x9d)[0x3513bc]
>> 
>> [jam:15687] [ 2]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(ompio_io_ompio_file_write_at_all+0x3e)[0x35869a]
>> 
>> [jam:15687] [ 3]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(mca_io_ompio_file_write_at_all+0x66)[0x358650]
>> 
>> [jam:15687] [ 4]
>> /scr/chaarawi/install/ompi/lib/libmpi.so.1(MPI_File_write_at_all+0x1b3)[0x1f46f3]
>> 
>> [jam:15687] [ 5] ./hindexed_io[0x8048b07]
>> [jam:15687] [ 6] /lib/libc.so.6(__libc_start_main+0xdc)[0x7d5ebc]
>> [jam:15687] [ 7] ./hindexed_io[0x80487e1]
>> [jam:15687] *** End of error message ***
>> 
>> If I use mpich 3.1.2 , I don't see those issues.
>> 
>> Thanks,
>> Mohamad
>> 
>> 
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/users/2014/08/24931.php
>> 
> 
> -- 
> Rob Latham
> Mathematics and Computer Science Division
> Argonne National Lab, IL USA
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2014/08/24934.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

Reply via email to