Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-04-01 Thread Richard Biener via Gcc-patches
On Fri, Apr 1, 2022 at 9:14 AM Hongtao Liu wrote: > > On Fri, Apr 1, 2022 at 2:54 PM Richard Biener via Gcc-patches > wrote: > > > > On Fri, Apr 1, 2022 at 8:47 AM liuhongt via Gcc-patches > > wrote: > > > > > > Update in V2: > > > 1. Use get_insns instead of FOR_EACH_BB_CFUN and FOR_BB_INSNS. >

Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-04-01 Thread Hongtao Liu via Gcc-patches
On Fri, Apr 1, 2022 at 2:54 PM Richard Biener via Gcc-patches wrote: > > On Fri, Apr 1, 2022 at 8:47 AM liuhongt via Gcc-patches > wrote: > > > > Update in V2: > > 1. Use get_insns instead of FOR_EACH_BB_CFUN and FOR_BB_INSNS. > > 2. Return for any_uncondjump_p and ANY_RETURN_P. > > 3. Add dump i

Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-31 Thread Richard Biener via Gcc-patches
On Fri, Apr 1, 2022 at 8:47 AM liuhongt via Gcc-patches wrote: > > Update in V2: > 1. Use get_insns instead of FOR_EACH_BB_CFUN and FOR_BB_INSNS. > 2. Return for any_uncondjump_p and ANY_RETURN_P. > 3. Add dump info for spliting instruction. > 4. Restrict ix86_split_stlf_stall_load under TARGET_SS

Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-31 Thread Richard Biener via Gcc-patches
On Fri, Apr 1, 2022 at 8:29 AM Hongtao Liu wrote: > > On Thu, Mar 31, 2022 at 6:45 PM Richard Biener via Gcc-patches > wrote: > > > > On Thu, Mar 31, 2022 at 7:51 AM liuhongt wrote: > > > > > > Since cfg is freed before machine_reorg, just do a rough calculation > > > of the window according to

[PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-31 Thread liuhongt via Gcc-patches
Update in V2: 1. Use get_insns instead of FOR_EACH_BB_CFUN and FOR_BB_INSNS. 2. Return for any_uncondjump_p and ANY_RETURN_P. 3. Add dump info for spliting instruction. 4. Restrict ix86_split_stlf_stall_load under TARGET_SSE2. Since cfg is freed before machine_reorg, just do a rough calculation of

Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-31 Thread Hongtao Liu via Gcc-patches
On Thu, Mar 31, 2022 at 6:45 PM Richard Biener via Gcc-patches wrote: > > On Thu, Mar 31, 2022 at 7:51 AM liuhongt wrote: > > > > Since cfg is freed before machine_reorg, just do a rough calculation > > of the window according to the layout. > > Also according to an experiment on CLX, set window

Re: [PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-31 Thread Richard Biener via Gcc-patches
On Thu, Mar 31, 2022 at 7:51 AM liuhongt wrote: > > Since cfg is freed before machine_reorg, just do a rough calculation > of the window according to the layout. > Also according to an experiment on CLX, set window size to 64. > > Currently only handle V2DFmode load since it doesn't need any scrat

[PATCH] Split vector load from parm_del to elemental loads to avoid STLF stalls.

2022-03-30 Thread liuhongt via Gcc-patches
Since cfg is freed before machine_reorg, just do a rough calculation of the window according to the layout. Also according to an experiment on CLX, set window size to 64. Currently only handle V2DFmode load since it doesn't need any scratch registers, and it's sufficient to recover cray performanc