https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
Andrew Pinski changed:
What|Removed |Added
Status|NEW |RESOLVED
Target Milestone|---
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #20 from Hongtao.liu ---
Fixed in GCC12, now gcc generate optimal codes.
main:
.LFB532:
.cfi_startproc
subq$8, %rsp
.cfi_def_cfa_offset 16
movaps .LC0(%rip), %xmm0
callprintv
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #19 from CVS Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:0fa4787bf34b173ce6f198e99b6f6dd8a3f98014
commit r12-3177-g0fa4787bf34b173ce6f198e99b6f6dd8a3f98014
Author: liuhongt
Date: Fri Dec
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #18 from Hongtao.liu ---
(In reply to Marc Glisse from comment #17)
> (In reply to Hongtao.liu from comment #15)
> > The issue can also be solved by folding __builtin_ia32_shufps to gimple
> > VEC_PERM_EXPR,
>
> Didn't you post a pat
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #17 from Marc Glisse ---
(In reply to Hongtao.liu from comment #15)
> The issue can also be solved by folding __builtin_ia32_shufps to gimple
> VEC_PERM_EXPR,
Didn't you post a patch to do that last year? What happened to it?
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #16 from Andrew Pinski ---
(In reply to Hongtao.liu from comment #15)
> > I think pass_combine should be extended to force illegitimate constant
> > to constant pool and recog load insn again, It looks like a general
> > optimization
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #15 from Hongtao.liu ---
> I think pass_combine should be extended to force illegitimate constant
> to constant pool and recog load insn again, It looks like a general
> optimization that better not do it in the backend.
The issue c
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
H.J. Lu changed:
What|Removed |Added
Assignee|hjl.tools at gmail dot com |unassigned at gcc dot
gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #10 from Marc Glisse ---
Author: glisse
Date: Mon May 20 14:53:29 2019
New Revision: 271422
URL: https://gcc.gnu.org/viewcvs?rev=271422&root=gcc&view=rev
Log:
[i386] Fold __builtin_ia32_shufpd to VEC_PERM_EXPR
2019-05-20 Marc Gliss
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
Allan Jensen changed:
What|Removed |Added
CC||linux at carewolf dot com
--- Comment #9
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #8 from Marc Glisse ---
Created attachment 45306
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=45306&action=edit
ix86_gimple_fold_builtin patch
Like this then?
I realized (because of the testsuite) that we do not currently va
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
Jakub Jelinek changed:
What|Removed |Added
CC||jakub at gcc dot gnu.org
--- Comment #7
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #6 from Marc Glisse ---
Created attachment 45303
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=45303&action=edit
example patch (untested)
Making the meaning of shuffles visible in GIMPLE could help a bit (although it
wouldn't
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
Marc Glisse changed:
What|Removed |Added
CC||glisse at gcc dot gnu.org
--- Comment #5 fr
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
Marc Glisse changed:
What|Removed |Added
CC||marc.glisse at normalesup
--- Comment #3 from pinskia at gcc dot gnu dot org 2010-02-23 01:42 ---
Confirmed.
--
pinskia at gcc dot gnu dot org changed:
What|Removed |Added
Status|UNCON
--- Comment #2 from pinskia at gcc dot gnu dot org 2010-02-23 01:42 ---
I think that is because nothing simplifies:
(vec_select:V4SF (vec_concat:V8SF (vec_select:V4SF (vec_concat:V8SF
(reg:V4SF 62)
(reg:V4SF 62))
(parallel [
--- Comment #1 from liranuna at gmail dot com 2010-02-23 01:37 ---
It appears I am missing a line in the code I posted:
#include
extern void printv(__m128 m);
int main()
{
__m128 m = _mm_set_ps(1.0f, 2.0f, 3.0f, 4.0f);
m = _mm_shuffle_ps(m, m, 0xC9); // Those two shuf
18 matches
Mail list logo