On Mar 8, 2006, at 4:46 AM, Pierre Valiron wrote:
Sorry for the interruption. I back on mpi tracks again.
I have rebuilt openmpi-1.0.2a9 with -g and the error is unchanged.
I have also discovered that I don't need to run any openmpi
application
to show up the error.
mpirun --help or mpirun show up the same error:
valiron@icare ~ > mpirun
*Segmentation fault (core dumped)
and
valiron@icare ~ > pstack core
core 'core' of 13842: mpirun
fffffd7ffee9dfe0 strlen () + 20
fffffd7ffeef6ab3 vsprintf () + 33
fffffd7fff180fd1 opal_vasprintf () + 41
fffffd7fff180f88 opal_asprintf () + 98
00000000004098a3 orterun () + 63
0000000000407214 main () + 34
000000000040708c ???????? ()
Seems very basic !
It turns out this was an error in our compatibility code for asprintf
(). We were doing something with va_list structures that Solaris
didn't like. I'm actually surprised that it worked on the UltraSparc
version of Solaris, but it has been for some time for us.
Anyway, I committed a fix at r9223 on the subversion trunk - it
should make tonight's nightly tarball for the trunk. I've also asked
the release managers for v1.0.2 to push the fix into that release.
Thanks for reporting the issue and for the account. Let me know if
you have any further problems.
Brian
--
Brian Barrett
Open MPI developer
http://www.open-mpi.org/