From: Simon Guo <wei.guo.si...@gmail.com> There is some room to optimize memcmp() in powerpc for following 2 cases: (1) Even src/dst addresses are not aligned with 8 bytes at the beginning, memcmp() can align them and go with .Llong comparision mode without fallback to .Lshort comparision mode do compare buffer byte by byte. (2) VMX instructions can be used to speed up for large size comparision.
This patch set also updates selftest case to make it compiled. Simon Guo (3): powerpc: Align bytes before fall back to .Lshort in powerpc memcmp powerpc: enhance memcmp() with VMX instruction for long bytes comparision powerpc:selftest update memcmp selftest according to kernel change arch/powerpc/include/asm/asm-prototypes.h | 2 +- arch/powerpc/lib/copypage_power7.S | 2 +- arch/powerpc/lib/memcmp_64.S | 165 ++++++++++++++++++++- arch/powerpc/lib/memcpy_power7.S | 2 +- arch/powerpc/lib/vmx-helper.c | 2 +- .../selftests/powerpc/copyloops/asm/ppc_asm.h | 2 +- .../selftests/powerpc/stringloops/asm/ppc_asm.h | 31 ++++ 7 files changed, 197 insertions(+), 9 deletions(-) -- 1.8.3.1