Re: [fpc-devel] Current work

2024-08-24 Thread J. Gareth Moreton via fpc-devel
Thanks Denis, I have to be slightly careful though because the example I gave with "addps" is not actually correct, because (%r8) could be unaligned (it works fine with "vaddps" though due to a paradigm change in the design of AVX compared to SSE).  In this situation a more correct example wou

Re: [fpc-devel] Current work

2024-08-23 Thread denisgolovan via fpc-devel
Nice work, Gareth. Vectorization is definitely a nice improvement for the compiler. -- Regards, Denis Golovan ___ fpc-devel maillist - fpc-devel@lists.freepascal.org https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Re: [fpc-devel] Current work

2024-08-23 Thread J. Gareth Moreton via fpc-devel
To show current progress, the .section .text.n_system.math.vectors$_$tvector3d_$__$$_$plus$tvector3d$tvector3d$$tvector3d,"ax" routine ("class operator TVector3D.+(const aVector1, aVector2: TVector3D): TVector3D;") on x86_64-win64 - before (this performs a component-wise addition of two 4-compo

Re: [fpc-devel] Current work

2024-08-23 Thread J. Gareth Moreton via fpc-devel
Hi everyone, So I'm getting ready to showcase my current vector work to others.  I do have a question though... Currently the feature is locked behind "-Sv", since this is specificially "support vector processing" and the feature is still experimental and inefficient in places, but is this t

[fpc-devel] Current work

2024-08-21 Thread J. Gareth Moreton via fpc-devel
Hi everyone, Just thought I'd give a heads-up on what's happening with me and the compiler improvements.  Also, I've been busy with contract work and have just had some minor surgery, so I'm not running on all cylinders currently. * Still waiting on administrator comments and feedback on my