Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread Martin Frb via fpc-devel
On 28/11/2022 16:37, Martin Frb via fpc-devel wrote: "11.3μop cache" Apart from the qop cache there is the normal loading into the cache. I must admit I am not sure on the exact workings, but wasn't there something like loading entire cachelines?  If that is so (not sure), then of course mo

Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread Martin Frb via fpc-devel
On 28/11/2022 16:19, J. Gareth Moreton via fpc-devel wrote: I admit I can be disorganised sometimes and lose documents, so I apologise if you have sent them already and I mislaid them in my mess of a directory tree.  Believe me though, I want to swallow all of this up if it means squeezing out

Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread J. Gareth Moreton via fpc-devel
I admit I can be disorganised sometimes and lose documents, so I apologise if you have sent them already and I mislaid them in my mess of a directory tree.  Believe me though, I want to swallow all of this up if it means squeezing out every cycle I can out of the generated machine code! Curio

Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread Martin Frb via fpc-devel
On 28/11/2022 14:32, J. Gareth Moreton via fpc-devel wrote: On 28/11/2022 12:59, Martin Frb via fpc-devel wrote: Well first of all, you didn't move the balign in front of .Lj732 I do move the alignment hints, but if the label becomes dead (due to the zero-distance jump being 'collapsed'), the

Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread J. Gareth Moreton via fpc-devel
On 28/11/2022 12:59, Martin Frb via fpc-devel wrote: On 28/11/2022 07:22, J. Gareth Moreton via fpc-devel wrote: ...     testb   %al,%al     je .Lj733     subb    $1,%al     je     .Lj734     jmp    .Lj732     .balign 16,0x90 .Lj733:     ...     jmp    .Lj718     .balign 16,0x90 .Lj732:     

Re: [fpc-devel] Experimentation: "Branch stitching"

2022-11-28 Thread Martin Frb via fpc-devel
On 28/11/2022 07:22, J. Gareth Moreton via fpc-devel wrote: ...     testb   %al,%al     je .Lj733     subb    $1,%al     je     .Lj734     jmp    .Lj732     .balign 16,0x90 .Lj733:     ...     jmp    .Lj718     .balign 16,0x90 .Lj732:     movl    $2019050530,%ecx     call    VERBOSE_$$_INTERN