https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104526
Jakub Jelinek <jakub at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |aldyh at gcc dot gnu.org, | |amacleod at redhat dot com Priority|P3 |P1 Last reconfirmed| |2022-02-14 Ever confirmed|0 |1 Status|UNCONFIRMED |NEW --- Comment #2 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Seems like something EVRP should optimize. The pre- r12-6924 IL was: c.0_1 = c; _2 = *c.0_1; # RANGE [-1, 1] _3 = 1 / _2; # RANGE [1, 2] NONZERO 3 d_11 = 2 >> _3; and evrp properly figured out those ranges, that 1 / int is [-1, 1] and that 2 >> [-1, 1] is [1, 2]. But since r12-6924 the IL is: c.0_1 = c; _2 = *c.0_1; _11 = (unsigned int) _2; _12 = _11 + 1; _13 = _12 <= 2; _3 = _12 <= 2 ? _2 : 0; # RANGE [0, 2] NONZERO 3 d_14 = 2 >> _3; and the range for d_14 is too broad (includes 0) and no ranges are recorded for the other SSA_NAMEs. Now, __1 and _12 are of course VARYING, and because _13 is _Bool, it is also VARYING. The important missing part is that we don't realize that _12 <= 2 ? _2 : 0 implies [-1, 1] range. The _2 + 1U <= 2U is a standard pattern how ranges are encoded. Now if I rewrite the testcase by hand to: void foo(void); static int a, b = 1, *c = &b; int main() { for (; a; a--) { int e; int ct = *c; if (ct + 1U <= 2U) e = ct; else e = 0; int d = 2 >> e; if (!d) foo(); } } which is equivalent to doing the 1 / int PR95424 optimization by hand, but instead of having it in a COND_EXPR do it in separate bbs, i.e.: c.0_1 = c; ct_12 = *c.0_1; ct.1_2 = (unsigned int) ct_12; _3 = ct.1_2 + 1; if (_3 <= 2) goto <bb 5>; [INV] else goto <bb 4>; [INV] <bb 4> : <bb 5> : # RANGE [-1, 1] # e_7 = PHI <ct_12(3), 0(4)> # RANGE [1, 2] NONZERO 3 d_15 = 2 >> e_7; then evrp handles it just fine. So, Andrew/Aldy, how hard would it be to improve ranger COND_EXPR handling, so that it essentially does what we do for the PHI cases? I.e. from the COND_EXPR condition, compute "assertion" if condition is true or if condition is false, and use that on the COND_EXPR's second and third argument. So for the _3 = _12 <= 2 ? _2 : 0; comparison, for second argument the condition must be true which implies that _2 must be there [-1, 1], while for the third argument the condition must be false, but the argument is constant 0, so range is [0, 0], then just union those 2 ranges. As this is a P1 regression, if we can fix it, would be nice to get it into GCC 12.