[PATCH] PR target/117449: Restrict vector rotate match and split to pre-reload

2024-11-05 Thread Kyrylo Tkachov via Gcc
Hi all, The vector rotate splitter has some logic to deal with post-reload splitting but not all cases in aarch64_emit_opt_vec_rotate are post-reload-safe. In particular the ROTATE+XOR expansion for TARGET_SHA3 can create RTL that can later be simplified to a simple ROTATE post-reload, which would

RFC: IPA/LTO: Ordering functions for locality

2024-11-05 Thread Kyrylo Tkachov via Gcc
Hi all, I'd like to continue the discussion on teaching GCC to optimise code layout for locality between callees and callers. This is work that we've been doing at NVIDIA, primarily Prachi Godbole (CC'ed) and myself. This is a follow-up to the discussion we had at GNU Cauldron at the IPA/LTO BoF [

Re: [RFC] Enabling SVE with offloading to nvptx

2024-11-05 Thread Jakub Jelinek via Gcc
On Mon, Nov 04, 2024 at 10:21:58AM +, Andrew Stubbs wrote: > @@ -999,6 +1000,18 @@ omp_max_vf (void) > && OPTION_SET_P (flag_tree_loop_vectorize))) > return 1; > > + if (ENABLE_OFFLOADING && offload) > +{ > + for (const char *c = getenv ("OFFLOAD_TARGET_NAMES"); c;) > +

Re: feature request: a linker option to avoid merging variables from separate object files into shared cache lines

2024-11-05 Thread Mateusz Guzik via Gcc
On Tue, Nov 5, 2024 at 11:18 AM Florian Weimer wrote: > > * David Brown via Gcc: > > > I would have thought it would be better as part of the compiler. For > > each compilation unit, you generate one or more data sections > > depending on the variable initialisations, compiler options and target

Re: feature request: a linker option to avoid merging variables from separate object files into shared cache lines

2024-11-05 Thread Florian Weimer via Gcc
* David Brown via Gcc: > I would have thought it would be better as part of the compiler. For > each compilation unit, you generate one or more data sections > depending on the variable initialisations, compiler options and target > (.bss, .data, .rodata, .sbss, etc.). If the compiler has > "-al