Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-09 Thread duke
On Tue, 8 Apr 2025 21:27:08 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-09 Thread Ferenc Rakoczi
On Wed, 9 Apr 2025 17:09:09 GMT, Ferenc Rakoczi wrote: >> Overall very clean and nicely done PR. Thanks a lot for considering my >> inputs. > >> Overall very clean and nicely done PR. Thanks a lot for considering my >> inputs. > > That is in no small part thanks to the reviewers, especially to

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-09 Thread Sandhya Viswanathan
On Wed, 9 Apr 2025 17:09:09 GMT, Ferenc Rakoczi wrote: >> Overall very clean and nicely done PR. Thanks a lot for considering my >> inputs. > >> Overall very clean and nicely done PR. Thanks a lot for considering my >> inputs. > > That is in no small part thanks to the reviewers, especially to

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-09 Thread Ferenc Rakoczi
On Tue, 8 Apr 2025 21:58:57 GMT, Sandhya Viswanathan wrote: > Overall very clean and nicely done PR. Thanks a lot for considering my inputs. That is in no small part thanks to the reviewers, especially to Volodymyr! @lmesnik, @jatin-bhateja, @sviswa7 would one of you /sponsor me with the integ

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-08 Thread Sandhya Viswanathan
On Tue, 8 Apr 2025 21:27:08 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v13]

2025-04-08 Thread Ferenc Rakoczi
On Sat, 5 Apr 2025 00:27:05 GMT, Sandhya Viswanathan wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Reacting to comment by Sandhya. > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 345: > >> 343

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v14]

2025-04-08 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v13]

2025-04-07 Thread Sandhya Viswanathan
On Wed, 2 Apr 2025 07:38:34 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v9]

2025-04-05 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v12]

2025-04-05 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v12]

2025-04-05 Thread Ferenc Rakoczi
On Tue, 1 Apr 2025 22:43:36 GMT, Sandhya Viswanathan wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Reacting to comments by Volodymyr. > > src/hotspot/cpu/x86/stubGenerator_x86_64_sha3.cpp line 359: > >> 357:

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]

2025-04-05 Thread Ferenc Rakoczi
On Sat, 22 Mar 2025 16:36:08 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Fix windows build > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 121: > >> 119: static void

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-04-05 Thread Jatin Bhateja
On Mon, 31 Mar 2025 11:11:54 GMT, Ferenc Rakoczi wrote: >> src/hotspot/cpu/x86/vm_version_x86.cpp line 1252: >> >>> 1250: // Currently we only have them for AVX512 >>> 1251: #ifdef _LP64 >>> 1252: if (supports_evex() && supports_avx512bw()) { >> >> supports_evex check looks redundant. > > T

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-04-05 Thread Volodymyr Paprotski
On Sat, 22 Mar 2025 20:02:31 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v13]

2025-04-04 Thread Sandhya Viswanathan
On Wed, 2 Apr 2025 07:38:34 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-04-04 Thread Ferenc Rakoczi
On Thu, 6 Mar 2025 19:26:14 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Accepted review comments. > > src/hotspot/cpu/x86/stubGenerator_x86_64_sha3.cpp line 409: > >> 407: __ evmo

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v13]

2025-04-02 Thread Jatin Bhateja
On Wed, 2 Apr 2025 07:38:34 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-04-02 Thread Ferenc Rakoczi
On Thu, 27 Mar 2025 21:42:08 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with two >> additional commits since the last revision: >> >> - Further readability improvements. >> - Added asserts for array sizes > > src/hotspot/cpu/x86/stubGenerator

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v13]

2025-04-02 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v12]

2025-04-01 Thread Sandhya Viswanathan
On Mon, 31 Mar 2025 14:40:56 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v12]

2025-04-01 Thread Volodymyr Paprotski
On Mon, 31 Mar 2025 14:40:56 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-31 Thread Ferenc Rakoczi
On Sat, 22 Mar 2025 16:11:02 GMT, Volodymyr Paprotski wrote: >> These functions will not be used anywhere else and in ML_DSA.java all of the >> arrays passed to inrinsics are of the correct size. > > Works for me; just thought I would point it out, so its a 'premeditated' > decision. Well, I

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]

2025-03-31 Thread Ferenc Rakoczi
On Sun, 23 Mar 2025 00:26:20 GMT, Volodymyr Paprotski wrote: >> src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 216: >> >>> 214: // Zmm8-Zmm23 used as scratch registers >>> 215: // result goes to Zmm0-Zmm7 >>> 216: static void montMulByConst128(MacroAssembler *_masm) { >> >> wish t

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-31 Thread Ferenc Rakoczi
On Mon, 17 Mar 2025 19:22:41 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Made the intrinsics test separate from the pure java test. > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilit

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-31 Thread Ferenc Rakoczi
On Mon, 24 Mar 2025 15:16:20 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with two >> additional commits since the last revision: >> >> - Further readability improvements. >> - Added asserts for array sizes > > I still need to have a look at th

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-31 Thread Ferenc Rakoczi
On Sun, 23 Mar 2025 00:21:18 GMT, Volodymyr Paprotski wrote: >> src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 119: >> >>> 117: static address dilithiumAvx512PermsAddr() { >>> 118: return (address) dilithiumAvx512Perms; >>> 119: } >> >> Hear me out.. ... >> enums!! >> >> enum

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-31 Thread Ferenc Rakoczi
On Mon, 24 Mar 2025 02:38:37 GMT, Jatin Bhateja wrote: >> Ferenc Rakoczi has updated the pull request incrementally with two >> additional commits since the last revision: >> >> - Further readability improvements. >> - Added asserts for array sizes > > src/hotspot/cpu/x86/vm_version_x86.cpp l

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]

2025-03-24 Thread Volodymyr Paprotski
On Sat, 22 Mar 2025 16:45:31 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Fix windows build > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 216: > >> 214: // Zmm8-Zmm

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-24 Thread Volodymyr Paprotski
On Sat, 22 Mar 2025 20:02:31 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-24 Thread Volodymyr Paprotski
On Sat, 22 Mar 2025 20:38:19 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with two >> additional commits since the last revision: >> >> - Further readability improvements. >> - Added asserts for array sizes > > src/hotspot/cpu/x86/stubGenerator

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-23 Thread Jatin Bhateja
On Sat, 22 Mar 2025 20:02:31 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-22 Thread Volodymyr Paprotski
On Thu, 20 Mar 2025 21:06:30 GMT, Ferenc Rakoczi wrote: >> src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 58: >> >>> 56: >>> 57: ATTRIBUTE_ALIGNED(64) static const uint32_t dilithiumAvx512Perms[] = { >>> 58: // collect montmul results into the destination register >> >> same

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-22 Thread Volodymyr Paprotski
On Sat, 22 Mar 2025 20:02:31 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]

2025-03-22 Thread Volodymyr Paprotski
On Thu, 20 Mar 2025 20:37:25 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v11]

2025-03-22 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with two additional commits since the last revisio

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-03-20 Thread Ferenc Rakoczi
On Thu, 6 Mar 2025 19:27:12 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Accepted review comments. > > src/hotspot/cpu/x86/stubGenerator_x86_64_sha3.cpp line 426: > >> 424: __ subl

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-20 Thread Ferenc Rakoczi
On Mon, 17 Mar 2025 19:24:52 GMT, Volodymyr Paprotski wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Made the intrinsics test separate from the pure java test. > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilit

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v10]

2025-03-20 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v8]

2025-03-20 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-03-17 Thread Volodymyr Paprotski
On Thu, 6 Mar 2025 17:37:33 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-17 Thread Volodymyr Paprotski
On Wed, 12 Mar 2025 19:19:08 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-17 Thread Leonid Mesnik
On Wed, 12 Mar 2025 19:19:08 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v6]

2025-03-17 Thread Ferenc Rakoczi
On Wed, 12 Mar 2025 15:34:18 GMT, Leonid Mesnik wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Added validity test for the intrinsics. > > test/jdk/sun/security/provider/acvp/Launcher.java line 43: > >> 41: *

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v4]

2025-03-15 Thread Jatin Bhateja
On Wed, 5 Mar 2025 13:10:34 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v7]

2025-03-12 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v6]

2025-03-12 Thread Leonid Mesnik
On Wed, 12 Mar 2025 13:42:33 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request i

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v4]

2025-03-12 Thread Ferenc Rakoczi
On Thu, 6 Mar 2025 14:30:35 GMT, Jatin Bhateja wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Added alignment to loop entries. > > src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 2: > >> 1: /* >> 2:

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-12 Thread Ferenc Rakoczi
On Thu, 6 Mar 2025 09:32:19 GMT, Jatin Bhateja wrote: >> I think the easiest is to put a for (int i = 0; i < 1000; i++) loop around >> the switch statement in the run() method of the ML_DSA_Test class >> (test/jdk/sun/security/provider/acvp/ML_DSA_Test.java). (This is because the >> intrinsi

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-03-12 Thread Ferenc Rakoczi
On Mon, 10 Mar 2025 03:00:09 GMT, Leonid Mesnik wrote: > There are no any new tests in the PR. How fix has been tested by openjdk > tests? I have just added one. - PR Comment: https://git.openjdk.org/jdk/pull/23860#issuecomment-2717950685

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v6]

2025-03-12 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-11 Thread Jatin Bhateja
On Wed, 5 Mar 2025 18:27:44 GMT, Ferenc Rakoczi wrote: >> Hi @ferakocz , >> >> Thanks!, for efficient utilization of Decode ICache (please refer to Intel >> SDM section 3.4.2.5), code blocks should be aligned to 32-byte boundaries; a >> 64-byte aligned code is a superset of both 16 and 32 byt

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-03-09 Thread Leonid Mesnik
On Thu, 6 Mar 2025 17:37:33 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v4]

2025-03-06 Thread Jatin Bhateja
On Wed, 5 Mar 2025 13:10:34 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v5]

2025-03-06 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v4]

2025-03-06 Thread Ferenc Rakoczi
On Thu, 6 Mar 2025 08:37:57 GMT, Jatin Bhateja wrote: >> Ferenc Rakoczi has updated the pull request incrementally with one >> additional commit since the last revision: >> >> Added alignment to loop entries. > > src/hotspot/cpu/x86/stubGenerator_x86_64_sha3.cpp line 85: > >> 83: if (UseSH

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-05 Thread Ferenc Rakoczi
On Wed, 5 Mar 2025 14:03:00 GMT, Jatin Bhateja wrote: >> Hi, @jatin-bhateja, thanks for the suggestion. I have added __ >> align(OptoLoopAlignment); before all loop entries. > > Hi @ferakocz , > > Thanks!, for efficient utilization of Decode ICache (please refer to Intel > SDM section 3.4.2.5

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-05 Thread Jatin Bhateja
On Wed, 5 Mar 2025 11:33:06 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request wi

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-05 Thread Jatin Bhateja
On Wed, 5 Mar 2025 13:07:54 GMT, Ferenc Rakoczi wrote: >> src/hotspot/cpu/x86/stubGenerator_x86_64_dilithium.cpp line 292: >> >>> 290: __ movl(iterations, 2); >>> 291: >>> 292: __ BIND(L_loop); >> >> Hi @ferakocz , Kindly align loop entry address using __align64() here and at >> all the p

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-05 Thread Ferenc Rakoczi
On Wed, 5 Mar 2025 11:39:05 GMT, Jatin Bhateja wrote: >> Ferenc Rakoczi has updated the pull request with a new target base due to a >> merge or a rebase. The pull request now contains three commits: >> >> - Merged master. >> - Added comments, removed debugging printfs >> - JDK-8351034 Add A

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v4]

2025-03-05 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v2]

2025-03-05 Thread Jatin Bhateja
On Mon, 3 Mar 2025 19:00:59 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v3]

2025-03-05 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request with a new target base due to a merge or a rebase. The pull requ

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v2]

2025-03-04 Thread Mark Powers
On Mon, 3 Mar 2025 19:00:59 GMT, Ferenc Rakoczi wrote: >> By using the AVX-512 vector registers the speed of the computation of the >> ML-DSA algorithms (key generation, document signing, signature verification) >> can be approximately doubled. > > Ferenc Rakoczi has updated the pull request in

Re: RFR: 8351034: Add AVX-512 intrinsics for ML-DSA [v2]

2025-03-03 Thread Ferenc Rakoczi
> By using the AVX-512 vector registers the speed of the computation of the > ML-DSA algorithms (key generation, document signing, signature verification) > can be approximately doubled. Ferenc Rakoczi has updated the pull request incrementally with one additional commit since the last revision

RFR: 8351034: Add AVX-512 intrinsics for ML-DSA

2025-03-03 Thread Ferenc Rakoczi
By using the AVX-512 vector registers the speed of the computation of the ML-DSA algorithms (key generation, document signing, signature verification) can be approximately doubled. - Commit messages: - JDK-8351034 Add AVX-512 intrinsics for ML-DSA Changes: https://git.openjdk.org/