On 11/3/25 6:06 PM, KuanLin Chen wrote:
To date we haven't seen a single micro-architecture where the the zero strided load idiom performs better than a scalar load followed by a vector broadcast.I'll rename it in the next version. I'm curious why use_zero_stride_load should be 'false'. It seems to be the trigger of 'define_insn_and_split "*pred_strided_broadcast<mode>"'. I would appreciate it if you could let me know the reason.
Jeff
