On Mon, 18 Mar 2024, Pierre Jolivet wrote: > > > > On 18 Mar 2024, at 5:13 PM, Satish Balay via petsc-users > > <[email protected]> wrote: > > > > Ah - the compiler did flag code bugs. > > > >> (current version is 0.3.26 but we can’t update because there is a huge > >> performance regression which makes the pipeline timeout) > > > > maybe we should retry - updating to the latest snapshot and see if this > > issue persists. > > Well, that’s easy to see it is _still_ broken: > https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/jobs/6419779589__;!!G_uCfscf7eWS!f4svx7Rv1mmcLfy5l0C9bXXrw9gwb49ykkTb28IAtZW0VgZ8vgdD8exUOZSL0TCEqqP5X-p-0ll6TetPkw$ > > The infamous gcc segfault that can’t let us run the pipeline, but that builds > fine when it’s you that connect to the machine (I bothered you about this a > couple of months ago in case you don’t remember, see > https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/merge_requests/7143__;!!G_uCfscf7eWS!f4svx7Rv1mmcLfy5l0C9bXXrw9gwb49ykkTb28IAtZW0VgZ8vgdD8exUOZSL0TCEqqP5X-p-0llrLiE4GQ$ > ).
> make[2]: *** [../../Makefile.tail:46: libs] Bus error (core dumped) Ah - ok - that's a strange error. I'm not sure how to debug it. [it fails when the build is invoked from configure - but not when its invoked directly from bash/shell.] Satish > > Thanks, > Pierre > > > > > Satish > > > > On Mon, 18 Mar 2024, Zongze Yang wrote: > > > >> The issue of openblas was resolved by this pr > >> https://urldefense.us/v3/__https://github.com/OpenMathLib/OpenBLAS/pull/4565__;!!G_uCfscf7eWS!b09n5clcTFuLceLY_9KfqtSsgmmCIBLFbqciRVCKvnvFw9zTaNF8ssK0MiQlBOXUJe7H88nl-7ExdfhB-cMXLQ2d$ > >> > >> > >> Best wishes, > >> Zongze > >> > >>> On 18 Mar 2024, at 00:50, Zongze Yang <[email protected]> wrote: > >>> > >>> It can be resolved by adding CFLAGS=-Wno-int-conversion. Perhaps the > >>> default behaviour of the new version compiler has been changed? > >>> > >>> Best wishes, > >>> Zongze > >>>> On 18 Mar 2024, at 00:23, Satish Balay <[email protected]> wrote: > >>>> > >>>> Hm - I just tried a build with balay/xcode15-mpich - and that goes > >>>> through fine for me. So don't know what the difference here is. > >>>> > >>>> One difference is - I have a slightly older xcode. However your compiler > >>>> appears to behave as using -Werror. Perhaps CFLAGS=-Wno-int-conversion > >>>> will help here? > >>>> > >>>> Satish > >>>> > >>>> ---- > >>>> Executing: gcc --version > >>>> stdout: > >>>> Apple clang version 15.0.0 (clang-1500.3.9.4) > >>>> > >>>> Executing: > >>>> /Users/zzyang/workspace/repos/petsc/arch-darwin-c-debug/bin/mpicc -show > >>>> stdout: gcc -fPIC -fno-stack-check -Qunused-arguments -g -O0 > >>>> -Wno-implicit-function-declaration -fno-common > >>>> -I/Users/zzyang/workspace/repos/petsc/arch-darwin-c-debug/include > >>>> -L/Users/zzyang/workspace/repos/petsc/arch-darwin-c-debug/lib -lmpi > >>>> -lpmpi > >>>> > >>>> /Users/zzyang/workspace/repos/petsc/arch-darwin-c-debug/bin/mpicc -O2 > >>>> -DMAX_STACK_ALLOC=2048 -Wall -DF_INTERFACE_GFORT -fPIC -DNO_WARMUP > >>>> -DMAX_CPU_NUMBER=12 -DMAX_PARALLEL_NUMBER=1 -DBUILD_SINGLE=1 > >>>> -DBUILD_DOUBLE=1 -DBUILD_COMPLEX=1 -DBUILD_COMPLEX16=1 > >>>> -DVERSION=\"0.3.21\" -march=armv8-a -UASMNAME -UASMFNAME -UNAME -UCNAME > >>>> -UCHAR_NAME -UCHAR_CNAME -DASMNAME=_lapack_wrappers > >>>> -DASMFNAME=_lapack_wrappers_ -DNAME=lapack_wrappers_ > >>>> -DCNAME=lapack_wrappers -DCHAR_NAME=\"lapack_wrappers_\" > >>>> -DCHAR_CNAME=\"lapack_wrappers\" -DNO_AFFINITY -I.. -c > >>>> src/lapack_wrappers.c -o src/lapack_wrappers.o > >>>> src/lapack_wrappers.c:570:81: error: incompatible integer to pointer > >>>> conversion passing 'blasint' (aka 'int') to parameter of type 'const > >>>> blasint *' (aka 'const int *'); take the address with & > >>>> [-Wint-conversion] > >>>> RELAPACK_sgemmt(uplo, transA, transB, n, k, alpha, A, ldA, B, ldB, > >>>> beta, C, info); > >>>> > >>>> ^~~~ > >>>> > >>>> & > >>>> > >>>> vs: > >>>> Executing: gcc --version > >>>> stdout: > >>>> Apple clang version 15.0.0 (clang-1500.1.0.2.5) > >>>> > >>>> Executing: /Users/balay/petsc/arch-darwin-c-debug/bin/mpicc -show > >>>> stdout: gcc -fPIC -fno-stack-check -Qunused-arguments -g -O0 > >>>> -Wno-implicit-function-declaration -fno-common > >>>> -I/Users/balay/petsc/arch-darwin-c-debug/include > >>>> -L/Users/balay/petsc/arch-darwin-c-debug/lib -lmpi -lpmpi > >>>> > >>>> > >>>> /Users/balay/petsc/arch-darwin-c-debug/bin/mpicc -O2 > >>>> -DMAX_STACK_ALLOC=2048 -Wall -DF_INTERFACE_GFORT -fPIC -DNO_WARMUP > >>>> -DMAX_CPU_NUMBER=24 -DMAX_PARALLEL_NUMBER=1 -DBUILD_SINGLE=1 > >>>> -DBUILD_DOUBLE=1 -DBUILD_COMPLEX=1 -DBUILD_COMPLEX16=1 > >>>> -DVERSION=\"0.3.21\" -march=armv8-a -UASMNAME -UASMFNAME -UNAME -UCNAME > >>>> -UCHAR_NAME -UCHAR_CNAME -DASMNAME=_lapack_wrappers > >>>> -DASMFNAME=_lapack_wrappers_ -DNAME=lapack_wrappers_ > >>>> -DCNAME=lapack_wrappers -DCHAR_NAME=\"lapack_wrappers_\" > >>>> -DCHAR_CNAME=\"lapack_wrappers\" -DNO_AFFINITY -I.. -c > >>>> src/lapack_wrappers.c -o src/lapack_wrappers.o > >>>> src/lapack_wrappers.c:570:81: warning: incompatible integer to pointer > >>>> conversion passing 'blasint' (aka 'int') to parameter of type 'const > >>>> blasint *' (aka 'const int *'); take the address with & > >>>> [-Wint-conversion] > >>>> RELAPACK_sgemmt(uplo, transA, transB, n, k, alpha, A, ldA, B, ldB, > >>>> beta, C, info); > >>>> > >>>> ^~~~ > >>>> > >>>> & > >>>> > >>>> > >>>> > >>>> > >>>> On Sun, 17 Mar 2024, Pierre Jolivet wrote: > >>>> > >>>>> Ah, my bad, I misread linux-opt-arm as a macOS runner, no wonder the > >>>>> option is not helping… > >>>>> Take Barry’s advice. > >>>>> Furthermore, it looks like OpenBLAS people are steering in the opposite > >>>>> direction as us, by forcing the use of ld-classic > >>>>> https://urldefense.us/v3/__https://github.com/OpenMathLib/OpenBLAS/commit/103d6f4e42fbe532ae4ea48e8d90d7d792bc93d2__;!!G_uCfscf7eWS!bY2l3X9Eb5PRzNQYrfPFXhgcUodHCiDinhQYga0PeQn1IQzJYD376fk-pZfktGAkpTvBmzy7BFDc9SrazFoooQ$ > >>>>> , so that’s another good argument in favor of -framework Accelerate. > >>>>> > >>>>> Thanks, > >>>>> Pierre > >>>>> > >>>>> PS: anyone benchmarked those > >>>>> https://urldefense.us/v3/__https://developer.apple.com/documentation/accelerate/sparse_solvers__;!!G_uCfscf7eWS!bY2l3X9Eb5PRzNQYrfPFXhgcUodHCiDinhQYga0PeQn1IQzJYD376fk-pZfktGAkpTvBmzy7BFDc9SrpnDvT5g$ > >>>>> ? I didn’t even know they existed. > >>>>> > >>>>>> On 17 Mar 2024, at 3:06 PM, Zongze Yang <[email protected] > >>>>>> <mailto:[email protected]>> wrote: > >>>>>> > >>>>>> This Message Is From an External Sender > >>>>>> This message came from outside your organization. > >>>>>> Understood. Thank you for your advice. > >>>>>> > >>>>>> Best wishes, > >>>>>> Zongze > >>>>>> > >>>>>>> On 17 Mar 2024, at 22:04, Barry Smith <[email protected] > >>>>>>> <mailto:[email protected]> <mailto:[email protected]>> wrote: > >>>>>>> > >>>>>>> > >>>>>>> I would just avoid the --download-openblas option. The BLAS/LAPACK > >>>>>>> provided by Apple should perform fine, perhaps even better than > >>>>>>> OpenBLAS on your system. > >>>>>>> > >>>>>>> > >>>>>>>> On Mar 17, 2024, at 9:58 AM, Zongze Yang <[email protected] > >>>>>>>> <mailto:[email protected]> <mailto:[email protected]>> wrote: > >>>>>>>> > >>>>>>>> This Message Is From an External Sender > >>>>>>>> This message came from outside your organization. > >>>>>>>> Adding the flag `--download-openblas-make-options=TARGET=GENERIC` > >>>>>>>> did not resolve the issue. The same error persisted. > >>>>>>>> > >>>>>>>> Best wishes, > >>>>>>>> Zongze > >>>>>>>> > >>>>>>>>> On 17 Mar 2024, at 20:58, Pierre Jolivet <[email protected] > >>>>>>>>> <mailto:[email protected]> <mailto:[email protected]>> wrote: > >>>>>>>>> > >>>>>>>>> > >>>>>>>>> > >>>>>>>>>> On 17 Mar 2024, at 1:04 PM, Zongze Yang <[email protected] > >>>>>>>>>> <mailto:[email protected]> <mailto:[email protected]>> wrote: > >>>>>>>>>> > >>>>>>>>>> Thank you for providing the instructions. I try the first option. > >>>>>>>>>> > >>>>>>>>>> Now, the error of the configuration is related to OpenBLAS. > >>>>>>>>>> Add `--CFLAGS=-Wno-int-conversion` to configure command resolve > >>>>>>>>>> this. Should this be reported to OpenBLAS? Or need to fix the > >>>>>>>>>> configure in petsc? > >>>>>>>>> > >>>>>>>>> I see our linux-opt-arm runner is using the additional flag > >>>>>>>>> '--download-openblas-make-options=TARGET=GENERIC', could you maybe > >>>>>>>>> try to add that as well? > >>>>>>>>> I don’t think there is much to fix on our end, OpenBLAS has been > >>>>>>>>> very broken lately on arm (current version is 0.3.26 but we can’t > >>>>>>>>> update because there is a huge performance regression which makes > >>>>>>>>> the pipeline timeout). > >>>>>>>>> > >>>>>>>>> Thanks, > >>>>>>>>> Pierre > >>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> The configure.log is attached. The errors are show below: > >>>>>>>>>> ``` > >>>>>>>>>> src/lapack_wrappers.c:570:81: error: incompatible integer to > >>>>>>>>>> pointer conversion passing 'blasint' (aka 'int') to parameter of > >>>>>>>>>> type 'const blasint *' (aka 'const int *'); take the address with > >>>>>>>>>> & [-Wint-conversion] > >>>>>>>>>> RELAPACK_sgemmt(uplo, transA, transB, n, k, alpha, A, ldA, > >>>>>>>>>> B, ldB, beta, C, info); > >>>>>>>>>> > >>>>>>>>>> ^~~~ > >>>>>>>>>> > >>>>>>>>>> & > >>>>>>>>>> src/../inc/relapack.h:74:216: note: passing argument to > >>>>>>>>>> parameter here > >>>>>>>>>> void RELAPACK_sgemmt(const char *, const char *, const char *, > >>>>>>>>>> const blasint *, const blasint *, const float *, const float *, > >>>>>>>>>> const blasint *, const float *, const blasint *, const float *, > >>>>>>>>>> float *, const blasint *); > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> ^ > >>>>>>>>>> src/lapack_wrappers.c:583:81: error: incompatible integer to > >>>>>>>>>> pointer conversion passing 'blasint' (aka 'int') to parameter of > >>>>>>>>>> type 'const blasint *' (aka 'const int *'); take the address with > >>>>>>>>>> & [-Wint-conversion] > >>>>>>>>>> RELAPACK_dgemmt(uplo, transA, transB, n, k, alpha, A, ldA, > >>>>>>>>>> B, ldB, beta, C, info); > >>>>>>>>>> > >>>>>>>>>> ^~~~ > >>>>>>>>>> > >>>>>>>>>> & > >>>>>>>>>> src/../inc/relapack.h:75:221: note: passing argument to > >>>>>>>>>> parameter here > >>>>>>>>>> void RELAPACK_dgemmt(const char *, const char *, const char *, > >>>>>>>>>> const blasint *, const blasint *, const double *, const double *, > >>>>>>>>>> const blasint *, const double *, const blasint *, const double *, > >>>>>>>>>> double *, const blasint *); > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> ^ > >>>>>>>>>> src/lapack_wrappers.c:596:81: error: incompatible integer to > >>>>>>>>>> pointer conversion passing 'blasint' (aka 'int') to parameter of > >>>>>>>>>> type 'const blasint *' (aka 'const int *'); take the address with > >>>>>>>>>> & [-Wint-conversion] > >>>>>>>>>> RELAPACK_cgemmt(uplo, transA, transB, n, k, alpha, A, ldA, > >>>>>>>>>> B, ldB, beta, C, info); > >>>>>>>>>> > >>>>>>>>>> ^~~~ > >>>>>>>>>> > >>>>>>>>>> & > >>>>>>>>>> src/../inc/relapack.h:76:216: note: passing argument to > >>>>>>>>>> parameter here > >>>>>>>>>> void RELAPACK_cgemmt(const char *, const char *, const char *, > >>>>>>>>>> const blasint *, const blasint *, const float *, const float *, > >>>>>>>>>> const blasint *, const float *, const blasint *, const float *, > >>>>>>>>>> float *, const blasint *); > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> ^ > >>>>>>>>>> src/lapack_wrappers.c:609:81: error: incompatible integer to > >>>>>>>>>> pointer conversion passing 'blasint' (aka 'int') to parameter of > >>>>>>>>>> type 'const blasint *' (aka 'const int *'); take the address with > >>>>>>>>>> & [-Wint-conversion] > >>>>>>>>>> RELAPACK_zgemmt(uplo, transA, transB, n, k, alpha, A, ldA, > >>>>>>>>>> B, ldB, beta, C, info); > >>>>>>>>>> > >>>>>>>>>> ^~~~ > >>>>>>>>>> > >>>>>>>>>> & > >>>>>>>>>> src/../inc/relapack.h:77:221: note: passing argument to > >>>>>>>>>> parameter here > >>>>>>>>>> void RELAPACK_zgemmt(const char *, const char *, const char *, > >>>>>>>>>> const blasint *, const blasint *, const double *, const double *, > >>>>>>>>>> const blasint *, const double *, const blasint *, const double *, > >>>>>>>>>> double *, const blasint *); > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> ^ > >>>>>>>>>> 4 errors generated. > >>>>>>>>>> ``` > >>>>>>>>>> > >>>>>>>>>> Best wishes, > >>>>>>>>>> Zongze > >>>>>>>>>> > >>>>>>>>>> <configure.log.tar.gz> > >>>>>>>>>> > >>>>>>>>>>> On 17 Mar 2024, at 18:48, Pierre Jolivet <[email protected] > >>>>>>>>>>> <mailto:[email protected]> <mailto:[email protected]>> wrote: > >>>>>>>>>>> > >>>>>>>>>>> You need this MR > >>>>>>>>>>> https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/merge_requests/7365__;!!G_uCfscf7eWS!bY2l3X9Eb5PRzNQYrfPFXhgcUodHCiDinhQYga0PeQn1IQzJYD376fk-pZfktGAkpTvBmzy7BFDc9SqG8HOUGQ$ > >>>>>>>>>>> > >>>>>>>>>>> <https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/merge_requests/7365__;!!G_uCfscf7eWS!eCQRfbol7FDQiO0o78iDit2saij_ydIUtCfRQnsQAt-h_YcXr2Yi2BFnFnqHZp0FO3Lhpyr2RKdHZ-T-OF94HpwQ$> > >>>>>>>>>>> main has been broken for macOS since > >>>>>>>>>>> https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/merge_requests/7341__;!!G_uCfscf7eWS!bY2l3X9Eb5PRzNQYrfPFXhgcUodHCiDinhQYga0PeQn1IQzJYD376fk-pZfktGAkpTvBmzy7BFDc9Soe8Kh_uQ$ > >>>>>>>>>>> > >>>>>>>>>>> <https://urldefense.us/v3/__https://gitlab.com/petsc/petsc/-/merge_requests/7341__;!!G_uCfscf7eWS!eCQRfbol7FDQiO0o78iDit2saij_ydIUtCfRQnsQAt-h_YcXr2Yi2BFnFnqHZp0FO3Lhpyr2RKdHZ-T-OIhlJwLx$>, > >>>>>>>>>>> so the alternative is to revert to the commit prior. > >>>>>>>>>>> It should work either way. > >>>>>>>>>>> > >>>>>>>>>>> Thanks, > >>>>>>>>>>> Pierre > >>>>>>>>>>> > >>>>>>>>>>>> On 17 Mar 2024, at 11:31 AM, Zongze Yang <[email protected] > >>>>>>>>>>>> <mailto:[email protected]> <mailto:[email protected]>> > >>>>>>>>>>>> wrote: > >>>>>>>>>>>> > >>>>>>>>>>>> > >>>>>>>>>>>> This Message Is From an External Sender > >>>>>>>>>>>> This message came from outside your organization. > >>>>>>>>>>>> Hi, PETSc Team, > >>>>>>>>>>>> > >>>>>>>>>>>> I am trying to install petsc with the following configuration > >>>>>>>>>>>> ``` > >>>>>>>>>>>> ./configure \ > >>>>>>>>>>>> --download-bison \ > >>>>>>>>>>>> --download-mpich \ > >>>>>>>>>>>> --download-mpich-configure-arguments=--disable-opencl \ > >>>>>>>>>>>> --download-hwloc \ > >>>>>>>>>>>> --download-hwloc-configure-arguments=--disable-opencl \ > >>>>>>>>>>>> --download-openblas \ > >>>>>>>>>>>> --download-openblas-make-options="'USE_THREAD=0 USE_LOCKING=1 > >>>>>>>>>>>> USE_OPENMP=0'" \ > >>>>>>>>>>>> --with-shared-libraries=1 \ > >>>>>>>>>>>> --with-fortran-bindings=0 \ > >>>>>>>>>>>> --with-zlib \ > >>>>>>>>>>>> LDFLAGS=-Wl,-ld_classic > >>>>>>>>>>>> ``` > >>>>>>>>>>>> > >>>>>>>>>>>> The log shows that > >>>>>>>>>>>> ``` > >>>>>>>>>>>> Exhausted all shared linker guesses. Could not determine how to > >>>>>>>>>>>> create a shared library! > >>>>>>>>>>>> ``` > >>>>>>>>>>>> > >>>>>>>>>>>> I recently updated the system and Xcode, as well as homebrew. > >>>>>>>>>>>> > >>>>>>>>>>>> The configure.log is attached. > >>>>>>>>>>>> > >>>>>>>>>>>> Thanks for your attention to this matter. > >>>>>>>>>>>> > >>>>>>>>>>>> Best wishes, > >>>>>>>>>>>> Zongze > >>>>>>>>>>>> <configure.log.tar.gz> > >>>>> > >>>> <configure.log.gz> > >>> > >> > >> > >
