https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117016

--- Comment #5 from Matthias Kretz (Vir) <mkretz at gcc dot gnu.org> ---
Wrt. working on a larger data set you might be interested in:
https://github.com/mattkretz/vir-simd?tab=readme-ov-file#simd-execution-policy-p0350

For the problem you seem to describe, I like to have a native_simd-aligned
array of scalars and then iterate over it using native_simd. If your algorithm
allows, the simplest epilogue is allocation of some extra values (this
allocation is free, because of alignment and how allocators work) and then
simply process a few more inputs and ignore the outputs from the padding.

Reply via email to