On Tue, 4 Oct 2011, Jakub Jelinek wrote: > On Tue, Oct 04, 2011 at 12:17:30PM +0200, Richard Guenther wrote: > > int *x; > > > > > void foo (int *p) > > > { > > > int * __restrict p1 = p; > > > int * __restrict p2 = p + 32; > > > int *q; > > > int i; > > x = p2; > > > q = p + 32; > > q = q - 31; > > > for (i = 0; i < 32; ++i) > > > p[i] = q[i]; > > > } > > Yes, this is valid and so is a modified version of the earlier > testcase where all accesses in the first loop are biased > (bar below, assuming y > 32 or y <= -32). > > int *x; > > void > foo (int *p) > { > int *__restrict p1 = p; > int *__restrict p2 = p + 32; > int *q; > int i; > x = p2; > q = p + 32; > q = q - 31; > for (i = 0; i < 32; ++i) > p[i] = q[i]; > } > > void > bar (int *p, int y) > { > int *__restrict p1 = p; > int *__restrict p2 = p + 32; > int *q; > int i; > for (i = 0; i < 32; ++i) > p1[i + y] = p2[i + y]; > q = (p + 32) - 31; > for (i = 0; i < 32; ++i) > p[i] = q[i]; > } > > > > > would be valid and we'd rely on CSE not to replace q = p + 32 > > with q = p2 (ignoring the fact that for a miscompile we need > > similar tricks for p1). It doesn't do that at the moment > > because we fold int * __restrict p2 = p + 32 to > > ((int * __restrict)p) + 32 and thus see > > > > p.0_4 = (int * restrict) p_2(D); > > p2_5 = p.0_4 + 128; > > > > vs. > > > > q_6 = p_2(D) + 128; > > > > but you are going to change that ;) > > But even with the "Restrict fixes" patch I've just checked in > and with the TYPE_RESTRICT check removal patch I don't see anything > wrong in the IL, the only thing that is PT (restr) is the stmt > computing p2, which is just stored into x and nothing else, and > in the second function only the first loop.
But that's by pure luck and not by design, no? Richard.