[Bug other/110946] 3x perf regression with -Os on M1 Pro

2023-08-08 Thread dave.rodgman at arm dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110946 --- Comment #6 from Dave Rodgman --- Under clang, we see that mbedtls_xor being inlined, or not, causes an equivalent perf difference. Note that mbedtls_xor is inline in the gcc O2 version and not in the gcc Os version. Not inline mbedtls_xor,

[Bug other/110946] 3x perf regression with -Os on M1 Pro

2023-08-08 Thread dave.rodgman at arm dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110946 --- Comment #5 from Dave Rodgman --- (In reply to Richard Biener from comment #3) > Note you shouldn't use -Os if you care about performance. GCC is quite > reasonable with code size increases at -O2 (as compared to other compilers). > Instead

[Bug other/110946] 3x perf regression with -Os on M1 Pro

2023-08-08 Thread dave.rodgman at arm dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110946 Dave Rodgman changed: What|Removed |Added Keywords|missed-optimization | Component|ipa

[Bug other/110946] 3x perf regression with -Os on M1 Pro

2023-08-08 Thread dave.rodgman at arm dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110946 --- Comment #1 from Dave Rodgman --- Disassembly under -Os: 139c : 139c: a9b67bfdstp x29, x30, [sp, #-160]! 13a0: 910003fdmov x29, sp 13a4: a9046bf9stp x25, x26, [sp,

[Bug other/110946] New: 3x perf regression with -Os on M1 Pro

2023-08-08 Thread dave.rodgman at arm dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110946 Bug ID: 110946 Summary: 3x perf regression with -Os on M1 Pro Product: gcc Version: 12.1.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: other