https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387
--- Comment #5 from Daniel Han-Chen ---
(In reply to H.J. Lu from comment #4)
> (In reply to Daniel Han-Chen from comment #3)
> > (In reply to H.J. Lu from comment #2)
> > > _mm_cmp_ps is an AVX intrinsic.
> >
> > Yep noticed _mm_cmp_ps is only
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387
--- Comment #3 from Daniel Han-Chen ---
(In reply to H.J. Lu from comment #2)
> _mm_cmp_ps is an AVX intrinsic.
Yep noticed _mm_cmp_ps is only in AVX. The weird part is it actually causes no
errors when used on SSE only targets [ie Nehalem], and
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98387
--- Comment #1 from Daniel Han-Chen ---
Oh I just noticed _mm_cmp_ps isn't actually supported for SSE targets even in
Intel's Intrinsics Guide: [_mm_cmp_ps first was supported in AVX]
https://software.intel.com/sites/landingpage/IntrinsicsGuide/
ent: c++
Assignee: unassigned at gcc dot gnu.org
Reporter: danielhanchen at gmail dot com
Target Milestone: ---
https://gcc.godbolt.org/z/493ead
GCC since version 6.1 cannot inline _mm_cmp_ps on targets supporting only SSE
(Nehalem, Tremont etc). From >= SandyBridge, ever
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317
Daniel Han-Chen changed:
What|Removed |Added
Resolution|--- |WORKSFORME
Status|UNCONFIR
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98348
--- Comment #1 from Daniel Han-Chen ---
I also just noticed that in GCC 10, an extra movdqa is done, which is also not
necessary.
++
Assignee: unassigned at gcc dot gnu.org
Reporter: danielhanchen at gmail dot com
Target Milestone: ---
In GCC 9, vector comparisons on 128 and 256bit vectors on a AVX512 machine used
vpcmpeqd without any masks.
In GCC 10, for 128bit and 256bit vectors, AVX512 mask
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317
--- Comment #3 from Daniel Han-Chen ---
Oh ok then.
It's cause I was trying to do unaligned loads by following:
https://stackoverflow.com/questions/9318115/loading-data-for-gccs-vector-extensions
In it, it mentioned using typedef char __attribu
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=98317
Daniel Han-Chen changed:
What|Removed |Added
CC||danielhanchen at gmail dot com
Priority: P3
Component: c++
Assignee: unassigned at gcc dot gnu.org
Reporter: danielhanchen at gmail dot com
Target Milestone: ---
The ordering of aligned(1) causes GCC to generate movaps / movups.
typedef float float128_tv1__attribute__ ((aligned(1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96535
--- Comment #3 from Daniel Han-Chen ---
Oh lolll I was just about to add a comment about further experimentation
Seems like Jakub and Hongtao have found the root cause of the issues?
Anyways what I was gonna write [probs not necessary anymore s
Priority: P3
Component: c++
Assignee: unassigned at gcc dot gnu.org
Reporter: danielhanchen at gmail dot com
Target Milestone: ---
Hey GCC team!
In GCC 10.x, it seems like any argument to __attribute__((optimize(...)) is
ignored at the function level. GCC 9.x and
12 matches
Mail list logo