Robin Dapp <[email protected]>於 2025年11月4日 週二,15:36寫道:

> > On 11/3/25 6:06 PM, KuanLin Chen wrote:
> >> I'll rename it in the next version.
> >> I'm curious why use_zero_stride_load should be 'false'. It seems to be
> >> the trigger of 'define_insn_and_split
> >> "*pred_strided_broadcast<mode>"'.
> >> I would appreciate it if you could let me know the reason.
> > To date we haven't seen a single micro-architecture where the the zero
> > strided load idiom performs better than a scalar load followed by a
> > vector broadcast.
>
> At this point we might even want to reconsider the default for "generic"?
> However all the SiFive cores seem to want/prefer it.  Kito is this still
> correct?


Sifive core has that optimization for part of the cores like x280, but not
for p470/p670, and seems like Tenstorrent Ascalon also doing that
optimization as well? (they set that on both LLVM and GCC).

>


>
> --
> Regards
>  Robin
>
>

Reply via email to