On Fri, 12 May 2023 20:41:32 GMT, Chris Hennick <d...@openjdk.org> wrote:

>> This PR improves both the worst-case performance of `nextExponential` and 
>> `nextGaussian` and the distribution of output at the tails. It fixes the 
>> following imperfections:
>> 
>> * Repeatedly adding DoubleZigguratTables.exponentialX0 to extra causes a 
>> rounding error to accumulate at the tail of the distribution (probably 
>> starting around `2*exponentialX0 == 0x1.e46eff20739afp3 ~ 15.1`); this PR 
>> fixes that by tracking the multiple of exponentialX0 as a long. (This 
>> distortion is worst when `x > 0x1.0p56` since in that case, a rounding error 
>> means `extra + x == extra`.
>> * Reduces several equations using `Math.fma`. (This will almost certainly 
>> improve performance, and may or may not improve output distribution.)
>> * Uses the newly-extracted `computeWinsorizedNextExponential` function to 
>> prevent `nextGaussian` from going into the `nextExponential` tail twice.
>
> Chris Hennick has updated the pull request incrementally with two additional 
> commits since the last revision:
> 
>  - Merge remote-tracking branch 'origin/patch-1' into patch-1
>  - Optimize: move some code out of the fast path

Latest change looks good. I noted the improvement to the comment beginning "We 
didn't use the upper part of U1 after all".

-------------

PR Comment: https://git.openjdk.org/jdk/pull/8131#issuecomment-1546292204

Reply via email to