On 10/30/24 11:21 AM, Craig Blackmore wrote:
Consider that a short term problem, at least for glibc. I've got the
magic ifunc bits which introduce vector versions and also check for
fast unaligned support. Does that change the calculus in your mind?
Yes, with those bits in place it would seem less of an obvious win.
So given this has been on my TODO list for a few months, I just posted
the first patch to start lighting up vector ifuncs for glibc. Obviously
whatever feedback I get for the first patch (memset) will be folded into
the rest of the routines.
The goal (of course) would be to get them integrated in time for the
next glibc release, which would be early 2025..
This patch may still be useful in the meantime if I removed the loop
generation parts as it would still allow us to generate vector setmem
for smaller lengths than currently allowed.
Yea, which would unblock #7 of the series. Then we could circle back
on whether or not we should let setmem loop when expanded by the
compiler?
Ok, I'll follow up with a non-loop version of this patch.
Sounds good.
Thanks
jeff