On Wed, 29 Oct 2025 00:01:57 GMT, Igor Rudenko <[email protected]> wrote:

> The manual byte-replication expression replaced with the multiplicative 
> expansion trick - a more compact and efficient equivalent.

I ran some benchmarks and was unable to see any significant performance 
improvements. I ran on Linux (a64/x64), macOS a64, and Windows x64. Can you 
provide benchmarks that support this is a performance improvement?


| Benchmark | Linux aarch64 | Linux x64 | MacOSX aarch64 |  Windows x64 |
| --- | --- | --- | --- | --- |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:2
 | -0.05% | -4.60% | -0.12% | -0.22% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:3
 | -0.01% | -0.17% | -1.18% | 0.04% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:4
 | -0.01% | 0.00% | -0.01% | 0.28% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:5
 | 0.35% | -0.00% | 0.04% | 0.18% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:512
 | -0.17% | -0.84% | 1.57% | -0.05% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:6
 | 0.00% | 0.02% | -0.01% | 0.36% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:64
 | -0.03% | -0.75% | -5.88% | -2.61% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:7
 | -0.11% | -0.25% | -0.02% | 0.16% |
| 
openjdk.bench.java.lang.foreign.SegmentBulkFill.nativeSegmentFillJava-ELEM_SIZE:8
 | -0.01% | -0.20% | -0.07% |  0.15% |

-------------

PR Comment: https://git.openjdk.org/jdk/pull/28031#issuecomment-3472046372

Reply via email to