On Mon, 26 Aug 2024 12:50:53 GMT, Francesco Nigro <d...@openjdk.org> wrote:

>> Per Minborg has updated the pull request incrementally with one additional 
>> commit since the last revision:
>> 
>>   Fix typo
>
> src/java.base/share/classes/jdk/internal/foreign/AbstractMemorySegmentImpl.java
>  line 200:
> 
>> 198:             switch ((int) length) {
>> 199:                 case 0 : checkReadOnly(false); checkValidState(); 
>> break; // Explicit tests
>> 200:                 case 1 : set(JAVA_BYTE, 0, value); break;
> 
> beware using a switch, because if this code if is too big to be inlined (or 
> we're unlucky) will die due to branch-mispredict in case the different "small 
> fills" are unstable/unpredictable.
> Having a test which feed different fill sizes per each iteration + counting 
> branch misses, will reveal if the improvement is worthy even with such cases

It is true, that this is a compromise where we give up inline space, code-cache 
space, and added complexity against the prospect of better small-size 
performance. Depending on the workload, this may or may not pay off. In the 
(presumably common) case where we allocate/fill small segments of constant 
sizes, this is likely a win. Writing a dynamic performance test sounds like a 
good idea.

> test/micro/org/openjdk/bench/java/lang/foreign/TestFill.java line 86:
> 
>> 84:     @Benchmark
>> 85:     public void buffer_fill() {
>> 86:         // Hopefully, the creation of the intermediate array will be 
>> optimized away.
> 
> This maybe won't....why not making the byte array a static final?

The size of the array varies with the length of the array. It seams that escape 
analysis works here though.

-------------

PR Review Comment: https://git.openjdk.org/jdk/pull/20712#discussion_r1731249923
PR Review Comment: https://git.openjdk.org/jdk/pull/20712#discussion_r1731252199

Reply via email to