[fpc-devel] Optimisation and memory alignment question

J. Gareth Moreton via fpc-devel Sun, 28 Feb 2021 02:11:24 -0800

Hi everyone,

So to get to the point, I've spotted another potential peepholeoptimisation specifically on x86_64:


    movq    (%rdx),%rax
    shrq    $32,%rax

Is it acceptable to change this to the following?

    movl    4(%rdx),%eax

Logically it's equivalent thanks to the guarantee that the upper 32-bitsof the destination register will be zeroed, but I know sometimes theremight be a penalty for reading from memory that isn't aligned to a16-byte boundary, say.

A "movl; shrl $16" version may be possible with movzx, but I'm notcertain if that will be even more inefficient due to the offset nowbeing 2 rather than 4.


Gareth aka. Kit


--
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

[fpc-devel] Optimisation and memory alignment question

Reply via email to