Re: [fpc-devel] Register renaming and false dependency question

Stefan Glienke via fpc-devel Sun, 17 Oct 2021 15:53:53 -0700

According to compiler explorer clang, gcc and msvc compile this to thesame code with -O3 as FPC does. So I would assume that is fine.


Am 17.10.2021 um 13:25 schrieb J. Gareth Moreton via fpc-devel:

Hi everyone,
While reading up on some algorithms, I came across a recommendation ofusing a shorter arithmetic function to change the value of a constantin a register rather than loading the new value directly. However,the algorithm assumes a RISC-like processor, so I'm not sure if itapplies to an Intel x86-64 processor. Consider the following:
movq $0xaaaaaaaaaaaaaaab,%rax
imulq   %rax,%rcx
movq $0x5555555555555555,%rax
cmpq    %rax,%rcx
setle  %al
This algorithm sets %al to 1 if %rcx is divisible by 3, and 0 if it'snot, and was compiled from the following Pascal code (under -O3, but-O1 produces almost exactly the same):
function IsDivisible3(Numerator: QWord): Boolean;
begin
  Result := (Numerator * $AAAAAAAAAAAAAAAB) <= $5555555555555555;
end;
(One of my merge requests produces this code from "Result := (x mod 3)= 0")
My question is this: can "movq $0x5555555555555555,%rax" be replacedwith "shrq $0x1,%rax" without incurring an additional pipeline stall? The MOV instruction takes 10 bytes to store, while "SHR 1" takes only3. Given that %rax is used beforehand and the CMP instruction has towait until the IMUL instruction has finished executing, logic tells methat I can get away with it here, but I'm not sure if the metric to goby is the execution speed of IMUL (i.e. the IMUL instruction is thelimiting factor before CMP can be executed), or the simple fact thatthe previous value of %rax was used and will be loaded with$AAAAAAAAAAAAAAAB by the time it comes to load it with a new value.
Gareth aka. Kit

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Re: [fpc-devel] Register renaming and false dependency question

Reply via email to