[Bug target/98387] GCC >= 6 cannot inline _mm_cmp_ps on SSE targets

2020-12-19 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387 --- Comment #5 from Daniel Han-Chen --- (In reply to H.J. Lu from comment #4) > (In reply to Daniel Han-Chen from comment #3) > > (In reply to H.J. Lu from comment #2) > > > _mm_cmp_ps is an AVX intrinsic. > > > > Yep noticed _mm_cmp_ps is only

[Bug target/98387] GCC >= 6 cannot inline _mm_cmp_ps on SSE targets

2020-12-19 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387 --- Comment #3 from Daniel Han-Chen --- (In reply to H.J. Lu from comment #2) > _mm_cmp_ps is an AVX intrinsic. Yep noticed _mm_cmp_ps is only in AVX. The weird part is it actually causes no errors when used on SSE only targets [ie Nehalem], and

[Bug target/98387] GCC >= 6 cannot inline _mm_cmp_ps on SSE targets

2020-12-18 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387 --- Comment #1 from Daniel Han-Chen --- Oh I just noticed _mm_cmp_ps isn't actually supported for SSE targets even in Intel's Intrinsics Guide: [_mm_cmp_ps first was supported in AVX] https://software.intel.com/sites/landingpage/IntrinsicsGuide/

[Bug c++/98387] New: GCC >= 6 cannot inline _mm_cmp_ps on SSE targets

2020-12-18 Thread danielhanchen at gmail dot com via Gcc-bugs
ent: c++ Assignee: unassigned at gcc dot gnu.org Reporter: danielhanchen at gmail dot com Target Milestone: --- https://gcc.godbolt.org/z/493ead GCC since version 6.1 cannot inline _mm_cmp_ps on targets supporting only SSE (Nehalem, Tremont etc). From >= SandyBridge, ever

[Bug c++/98317] Vector Extensions aligned(1) not generating unaligned loads/stores

2020-12-18 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317 Daniel Han-Chen changed: What|Removed |Added Resolution|--- |WORKSFORME Status|UNCONFIR

[Bug c++/98348] GCC 10.2 AVX512 Mask regression from GCC 9

2020-12-17 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98348 --- Comment #1 from Daniel Han-Chen --- I also just noticed that in GCC 10, an extra movdqa is done, which is also not necessary.

[Bug c++/98348] New: GCC 10.2 AVX512 Mask regression from GCC 9

2020-12-17 Thread danielhanchen at gmail dot com via Gcc-bugs
++ Assignee: unassigned at gcc dot gnu.org Reporter: danielhanchen at gmail dot com Target Milestone: --- In GCC 9, vector comparisons on 128 and 256bit vectors on a AVX512 machine used vpcmpeqd without any masks. In GCC 10, for 128bit and 256bit vectors, AVX512 mask

[Bug c++/98317] Vector Extensions aligned(1) not generating unaligned loads/stores

2020-12-16 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317 --- Comment #3 from Daniel Han-Chen --- Oh ok then. It's cause I was trying to do unaligned loads by following: https://stackoverflow.com/questions/9318115/loading-data-for-gccs-vector-extensions In it, it mentioned using typedef char __attribu

[Bug c++/98317] Vector Extensions aligned(1) not generating unaligned loads/stores

2020-12-16 Thread danielhanchen at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317 Daniel Han-Chen changed: What|Removed |Added CC||danielhanchen at gmail dot com

[Bug c++/98317] New: Vector Extensions aligned(1) not generating unaligned loads/stores

2020-12-16 Thread danielhanchen at gmail dot com via Gcc-bugs
Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: danielhanchen at gmail dot com Target Milestone: --- The ordering of aligned(1) causes GCC to generate movaps / movups. typedef float float128_tv1__attribute__ ((aligned(1

[Bug c++/96535] [10/11 Regression] GCC 10 ignoring function __attribute__ optimize for all x86 since r11-1019

2020-08-11 Thread danielhanchen at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96535 --- Comment #3 from Daniel Han-Chen --- Oh lolll I was just about to add a comment about further experimentation Seems like Jakub and Hongtao have found the root cause of the issues? Anyways what I was gonna write [probs not necessary anymore s

[Bug c++/96535] New: GCC 10 ignoring function __attribute__ optimize for all x86

2020-08-08 Thread danielhanchen at gmail dot com
Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: danielhanchen at gmail dot com Target Milestone: --- Hey GCC team! In GCC 10.x, it seems like any argument to __attribute__((optimize(...)) is ignored at the function level. GCC 9.x and