> This change allows use of the AVX512_VBMI instruction set to further optimize 
> decompression/parsing of polynomial coefficients for ML-KEM.  The speedup 
> gained in the ML-KEM benchmarks for key generation is between 0.4 to 0.5%, 
> encapsulation is  0.2 to 1.7%, and decapsulation is 0.3 to 2.0%.
> 
> Thank you to @sviswa7 and @ferakocz for their help in working through the 
> early stages of this code with me.

Shawn M Emery has updated the pull request incrementally with one additional 
commit since the last revision:

  Update to use OptoLoopAlignment for VBMILoop

-------------

Changes:
  - all: https://git.openjdk.org/jdk/pull/28815/files
  - new: https://git.openjdk.org/jdk/pull/28815/files/373b1339..f278a63f

Webrevs:
 - full: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=04
 - incr: https://webrevs.openjdk.org/?repo=jdk&pr=28815&range=03-04

  Stats: 1 line in 1 file changed: 1 ins; 0 del; 0 mod
  Patch: https://git.openjdk.org/jdk/pull/28815.diff
  Fetch: git fetch https://git.openjdk.org/jdk.git pull/28815/head:pull/28815

PR: https://git.openjdk.org/jdk/pull/28815

Reply via email to