a-systems.com; pins...@gmail.com;
>> jim.wil...@linaro.org; benedikt.hu...@theobroma-systems.com; Evandro Menezes
>> Subject: [PATCH][AArch64] Increase code alignment
>>
>> Increase loop alignment on Cortex cores to 8 and set function alignment to
>> 16. This
-systems.com; Evandro Menezes
Subject: [PATCH][AArch64] Increase code alignment
Increase loop alignment on Cortex cores to 8 and set function alignment to
16. This makes things consistent across big.LITTLE cores, improves
performance of benchmarks with tight loops and reduces performance
ystems.com; Evandro Menezes
> Subject: [PATCH][AArch64] Increase code alignment
>
> Increase loop alignment on Cortex cores to 8 and set function alignment to
> 16. This makes things consistent across big.LITTLE cores, improves
> performance of benchmarks with tight loops and red
ping
From: Wilco Dijkstra
Sent: 03 June 2016 11:51
To: GCC Patches
Cc: nd; philipp.toms...@theobroma-systems.com; pins...@gmail.com;
jim.wil...@linaro.org; benedikt.hu...@theobroma-systems.com; Evandro Menezes
Subject: [PATCH][AArch64] Increase code alignment
Increase loop alignment on
On 06/03/16 17:22, Evandro Menezes wrote:
On 06/03/16 05:51, Wilco Dijkstra wrote:
It looks almost all AArch64 cores agree on alignment of 16 for
function, and 8 for loops and branches, so we should change
-mcpu=generic as well if there is no disagreement - feedback welcome.
I'll see what set
On Fri, Jun 3, 2016 at 3:51 AM, Wilco Dijkstra wrote:
> Increase loop alignment on Cortex cores to 8 and set function alignment to
> 16. This makes things consistent across big.LITTLE cores, improves
> performance of benchmarks with tight loops and reduces performance variations
> due to small
On 06/03/16 05:51, Wilco Dijkstra wrote:
It looks almost all AArch64 cores agree on alignment of 16 for function, and 8
for loops and branches, so we should change -mcpu=generic as well if there is
no disagreement - feedback welcome.
I'll see what sets of values Exynos M1 would be most comfor
Increase loop alignment on Cortex cores to 8 and set function alignment to 16.
This makes things consistent across big.LITTLE cores, improves performance of
benchmarks with tight loops and reduces performance variations due to small
changes in code layout. It looks almost all AArch64 cores agre