https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102622
Aldy Hernandez <aldyh at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |amacleod at redhat dot com, | |jakub at gcc dot gnu.org, | |rguenth at gcc dot gnu.org --- Comment #12 from Aldy Hernandez <aldyh at gcc dot gnu.org> --- I had to download the Intel SDE to reproduce it, but was finally able to narrow it down to 4 jump threading paths: ./xg++ -B./ a.c -O3 -L/usr/lib/gcc/x86_64-redhat-linux/11/ -march=skylake-avx512 -fdump-tree-all-details -fdisable-tree-ethread -fdisable-tree-thread1 -fdisable-tree-thread2 -fdisable-tree-thread3 -fdisable-tree-thread4 -fdbg-cnt=registered_jump_thread:2-4:19-20 $ grep 'thread' a.c.* |grep Registering a.c.113t.vrp-thread1: [2] Registering jump thread: (9, 11) incoming edge; (11, 10) normal; a.c.113t.vrp-thread1: [4] Registering jump thread: (5, 7) incoming edge; (7, 8) normal; a.c.197t.vrp-thread2: [19] Registering jump thread: (15, 16) incoming edge; (16, 18) normal; a.c.197t.vrp-thread2: [20] Registering jump thread: (21, 16) incoming edge; (16, 18) normal; Things start getting challenging after the vectorizer and cunroll run. Such that by vrp-thread2, ranger figures out that the 2->3 edge is unreachable, _21 must be 0, and shit rolls downhill from there: unsigned long ivtmp.31; short int * vectp_arr_32.21; vector(8) short int * vectp_arr_32.20; unsigned short tmp.19; short int tmp.18; int D.4484; bool var_22_lsm_flag.15; int var_22_lsm.14; int D.4481; bool var_20_lsm_flag.13; int var_20_lsm.12; unsigned int f; int e; short int d; short int _2; unsigned int _3; bool _4; long long int _6; int _7; bool _21; unsigned short ivtmp_24; int _30(D); bool _33; unsigned short ivtmp_41; int _43(D); short int * _55; <signed-boolean:1> _64; int _66; unsigned long _74; <signed-boolean:1> _75; vector(8) <signed-boolean:1> vect_cst__76; void * _81; <bb 2> [local count: 8685306]: _2 = (short int) b_15(D); _3 = (unsigned int) _2; _4 = _3 != 0; _33 = _3 < b_15(D); _21 = _4 | _33; _64 = (<signed-boolean:1>) _21; _75 = -_64; vect_cst__76 = {_75, _75, _75, _75, _75, _75, _75, _75}; if (_75 == 0) goto <bb 4>; [100.00%] else goto <bb 3>; [20.00%] <bb 3> [local count: 1737061]: MEM <vector(8) short int> [(short int *)&arr_32] = { 0, 0, 0, 0, 0, 0, 0, 0 }; <bb 4> [local count: 8685306]: if (_21 != 0) goto <bb 5>; [50.00%] else goto <bb 6>; [50.00%] <bb 5> [local count: 2171327]: arr_32[8] = 0; <bb 6> [local count: 4342653]: if (_21 != 0) goto <bb 7>; [50.00%] else goto <bb 8>; [50.00%] [snip] [snip] The reason ranger concludes that 2->3 is unreachable is from analyzing block 2: <bb 2> [local count: 8685306]: _2 = (short int) b_15(D); _3 = (unsigned int) _2; _4 = _3 != 0; _33 = _3 < b_15(D); _21 = _4 | _33; _64 = (<signed-boolean:1>) _21; _75 = -_64; vect_cst__76 = {_75, _75, _75, _75, _75, _75, _75, _75}; if (_75 == 0) goto <bb 4>; [100.00%] else goto <bb 3>; [20.00%] On the 2->3 edge, _75 == -1 because this is a 1-bit signed integer. Solving back we have: _75 = - _64; ==> [-1,-1] = -_64; -1 for a 1-bit signed integer is TYPE_MIN_VALUE, and NEG(TYPE_MIN_VALUE) is unrepresentable. So, _75 cannot be -1, thus the edge is unexecutable. Questions: a) Is -(-1) representable in 1-bit signed? b) Could we somehow avoid creating the 1-bit signed in the vectorizer, since they are a source of endless exception? Thanks.