[Bug target/87599] New: Broadcasting scalar to vector uses stack unnecessarily on x86

2018-10-12 Thread vgatherps at gmail dot com
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: vgatherps at gmail dot com Target Milestone: --- When compiled on GCC 8.2 with -O2, typedef long long __m128i __attribute__ ((__vector_size__ (16), __may_alias__)); __m128i vectorize

[Bug target/87599] Broadcasting scalar to vector uses stack unnecessarily on x86

2018-10-12 Thread vgatherps at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87599 --- Comment #2 from vgatherps at gmail dot com --- Thanks! That fixes the optimization. However, using something like -march=haswell or -march=corei7 does not result in this optimization being made, which as far as I know -march= would imply

[Bug target/87601] New: Missed opportunity for flag reuse and macro-op fusion on x86

2018-10-12 Thread vgatherps at gmail dot com
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: vgatherps at gmail dot com Target Milestone: --- When I compile the following code with gcc 8.2 and options -O2 (or Os) and -mtune=intel (or broadwell): int sum(int *vals, int l