https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117722

--- Comment #4 from JuzheZhong <juzhe.zhong at rivai dot ai> ---
(In reply to Robin Dapp from comment #3)
> First, pixel_sad_4x4 is not very hot, 8x8 and 16x16 are.
> 
> Second, we are vectorizing this, but with -mno-vector-strict-align.
> 
> IMHO we don't need to synthesize an usad pattern.

Why ? We tried implement ssad pattern, the codegen is same as Clang and better
than native codegen with -mno-vector-strict-align.
And not all RISC-V board support misalign load store.

Reply via email to