date:20210209

Re: IA64 control speculation of loads

2021-02-09 Thread Benoît De Dinechin

Sorry my previous answer was cut.

The motivation for prepass global code motion is indeed that after register 
allocation, inter-block scheduling is even more restricted due to 
anti-dependencies, including those due to live-out on side exit branches. 
Global code motion is a key performance enabler especially for the non-temporal 
loads (i.e. L1 cache bypass loads), which have an exposed latency close to 20 
cycles on the current kvx cores.

The dataflow issues encountered with SEL_SCHED in prepass with control 
speculation enabled was inconsistent liveness reported by the compiler. I am 
running a test suite to reproduce it (saw it 3 months ago).

Here is again a motivating example where I expect the scheduler to speculate 
loads from the second to the first block in the loop, which dominates it, so in 
principle SCHED_RGN should do it:

  typedef struct list_cell_ {
struct list_cell_ *next;
float payload;
  } list_cell_, *list_cell;

  float
  list_sum(list_cell_ *list)
  {
float result = 0.0;
while (list->next) {
  list = list->next;
  result += 1.0f/list->payload;
  if (!list->next) break;
  list = list->next;
  result += 1.0f/list->payload;
}
return result;
  }

Here is the TARGET_SCHED_SET_SCHED_FLAGS, with comments that reflect my 
understanding on what to do. The commented line prevents SEL_SCHED with control 
speculation unless postpass (as in ia64):

  static void
  kvx_sched_set_sched_flags (struct spec_info_def *spec_info)   
   
  {
unsigned int *flags = &(current_sched_info->flags); 
   
// Speculative scheduling is enabled by non-zero spec_info->mask.   
   
spec_info->mask = 0;
   
if (*flags & (SEL_SCHED | SCHED_RGN))   
   
  {
//if (!sel_sched_p () || reload_completed)  
   
  {
// Must do this in case of speculation. 
   
*flags |= USE_DEPS_LIST | DO_SPECULATION;   
   
// Do control speculation only. 
   
spec_info->mask = BEGIN_CONTROL;
   
// Speculative scheduling without CHECK.
   
spec_info->flags = SEL_SCHED_SPEC_DONT_CHECK_CONTROL;   
   
// Dump into the sched_dump.
   
spec_info->dump = sched_dump;   
   
  }
  }
  }

The TARGET_SCHED_SET_SCHED_FLAGS is implemented by (should memoize to return 0 
if already speculated with the same ts, assuming not relevant here):

  static int
  kvx_sched_speculate_insn (rtx_insn *insn, ds_t ts, rtx *new_pat)
  {
rtx pattern = PATTERN (insn);
if (GET_CODE (pattern) == SET)
  {
rtx src = SET_SRC (pattern);
if (GET_CODE (src) == MEM)
  {
*new_pat = pattern;
return 1;
  }
  }
return -1;
  }

And TARGET_SCHED_NEEDS_BLOCK_P always returns false.

When I compile the motivating example above for the KVX, 
kvx_sched_speculate_insn() is indeed called with reload_completed==0 (prepass) 
for the two loads of the second block, but no code motion to the first block 
happens. Generated code is the same for SCHED_RGN (default) or SEL_SCHED 
(-fselective-scheduling), up to a renaming of the registers, although SEL_SCHED 
calls kvx_sched_speculate_insn() several times for each load.

For the ia64 on the motivating example, it seems there is no prepass control 
speculation either:

  ./gcc/ia64/gcc/cc1 -fpreprocessed list_sum2.i -quiet -dumpbase list_sum2.c 
-dp -auxbase list_sum2 -O3 -version -ffast-math -o list_sum2.s -da -dp 
-msched-control-spec -msched-in-control-spec
  grep _speculative list_sum2.c.*
  list_sum2.c.298r.mach:] UNSPEC_LDS)) 24 {movsf_speculative}
  ...

I noticed that the ia64 target uses the undocumented target hooks 
TARGET_SCHED_GET_INSN_SPEC_DS and TARGET_SCHED_GET_INSN_CHECKED_DS whose code 
is actually executed on this example.

Any recommendation

Re: GSoC

2021-02-09 Thread Martin Jambor

Hello,

On Sun, Feb 07 2021, Ravi Kumar via Gcc wrote:
> Hello Sir,
> I am Ravi Kumar. I am currently a 2nd year undergraduate(B.Tech) student. I 
> want to participate in GSoC 2021 and want to work under the mentorship of 
> GCC.
> WHY ME?
> Because:
> 1.I have a proper knowledge and experience in C and C++ Language.
> 2. I have learnt to use git and GitHub.
> 3. I also have a theoretical knowledge of compilers and compiler organization.
> 4. I am ready to give 4-5 hours daily to the project.
> 5.I am familiar with GCC source code.
>
> Since I am a new contributor and I have a little knowledge regarding this so 
> I am facing lot of difficulties. But I am a good learner and I want to 
> explore more in contribution and project completion under a mentor. Since I 
> have the required skills for the project ideas that you have provided, I 
> would love to work for it. 
> Here is the project that I am opting:
> Extend the static analysis pass
>

we are delighted that you decided to apply for GSoC and that you have
chosen GCC as the organization.  If you think you need any help
selecting a particular static analyzer project or improving your
application proposal, feel free to email the mailing list again.

Thanks,

Martin

Re: IA64 control speculation of loads

Re: GSoC

2 matches

Site Navigation

Mail list logo

Footer information