3] Patch series for TRN Intrinsics

Alan Lawrence Fri, 28 Mar 2014 08:29:58 -0700

Much like the ZIP and UZP intrinsics, the vtrn[q]_* intrinsics are implementedwith inline __asm__, which blocks compiler analysis. This series replaces thosecalls with __builtin_shuffle, which produce the same** assembler instructions.

** except for two-element vectors, where UZP, ZIP and TRN are all equivalent andthe backend chooses to output ZIP.


The first patch adds a bunch of tests, passing for the current asm 
implementation;
the second patch reimplements with __builtin_shuffle;

the third patch adds equivalent ARM tests using test bodies shared from thefirst patch.


OK for stage 1?

Cheers, Alan

[AArch64/ARM 0/3] Patch series for TRN Intrinsics

Reply via email to