[llvm-branch-commits] [libc] Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-08-05 Thread via llvm-branch-commits
https://github.com/pcc closed https://github.com/llvm/llvm-project/pull/99260 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] [libc] Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-08-02 Thread Schrodinger ZHU Yifan via llvm-branch-commits
https://github.com/SchrodingerZhu requested changes to this pull request. See previous comment https://github.com/llvm/llvm-project/pull/99260 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailma

[llvm-branch-commits] [libc] Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-08-02 Thread Schrodinger ZHU Yifan via llvm-branch-commits
SchrodingerZhu wrote: Hi, Thank you for the patch. Unfortunately, I think the proposed change is causing failures in tests: ``` Ran 5 tests. PASS: 5 FAIL: 0 [4171/5229] Running unit test libc.test.src.stdio.snprintf_test.__unit__ FAILED: libc/test/src/stdio/CMakeFiles/libc.test.src.stdio.s

[llvm-branch-commits] [libc] Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-07-16 Thread via llvm-branch-commits
https://github.com/lntue edited https://github.com/llvm/llvm-project/pull/99260 ___ llvm-branch-commits mailing list llvm-branch-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits

[llvm-branch-commits] libc: Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-07-16 Thread via llvm-branch-commits
llvmbot wrote: @llvm/pr-subscribers-libc Author: None (pcc) Changes We can use UMAXV.4S to reduce the comparison result in a single instruction. This improves performance by roughly 4% on Apple M1: Summary bin/libc.src.string.bcmp_benchmark3 --study-name="new bcmp" --sweep-mode --sweep

[llvm-branch-commits] libc: Use UMAXV.4S to reduce bcmp result. (PR #99260)

2024-07-16 Thread via llvm-branch-commits
https://github.com/pcc created https://github.com/llvm/llvm-project/pull/99260 We can use UMAXV.4S to reduce the comparison result in a single instruction. This improves performance by roughly 4% on Apple M1: Summary bin/libc.src.string.bcmp_benchmark3 --study-name="new bcmp" --sweep-mode --s