https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114995
--- Comment #9 from Aldy Hernandez <aldyh at gcc dot gnu.org> --- (In reply to Jakub Jelinek from comment #7) > The above examples just show misunderstanding what __builtin_assume_aligned > is and what it is not. You need to use the result of the built-in function > in the accesses to be able to use the alignment information, if you just try > to compare __builtin_assume_aligned (x, 32) == x, it will just fold as > always true. The design of the builtin is to attach the alignment > information to the result of the builtin function only. > > CCing Aldy/Andrew for whether prange can or could be taught to handle the > assume cases with uintptr_t and bitwise and + comparison. All the pieces are there to make it work, both with the assume aligned and with the uintptr_t case. And we could probably get it all without prange. For example: #include <cstdint> void foo (const float *); void bar1 (const float *array) { [[assume(array != nullptr)]]; const float *aligned = (const float *) __builtin_assume_aligned (array, 32); foo (aligned); } The __builtin_assume_aligned hasn't been expanded by evrp, so we should be able to add a range-op entry for it. This is what evrp sees: void bar1 (const float * array) { const float * aligned; <bb 2> : aligned_2 = __builtin_assume_aligned (array_1(D), 32); foo (aligned_2); return; } All we need is a range-op implementation for builtin_assume_aligned. The attached crude implementation does it. =========== BB 2 ============ <bb 2> : aligned_2 = __builtin_assume_aligned (array_1(D), 32); foo (aligned_2); return; aligned_2 : [prange] const float * [0, +INF] MASK 0xffffffff00000000 VALUE 0x0 That is, the bottom 32 bits are cleared. Andrew will have to comment on the uintptr_t idiom, because it gets expanded into an .ASSUME() function which I'm unfamiliar with. For this small function: void bar2 (const float *array) { [[assume((uintptr_t (array) & (32 - 1)) == 0)]]; foo (array); } evrp expands to: =========== BB 2 ============ Partial equiv (array.0_3 pe64 array_2(D)) <bb 2> : array.0_3 = (long unsigned int) array_2(D); _4 = array.0_3 & 31; _5 = _4 == 0; return _5; _4 : [irange] long unsigned int [0, 31] MASK 0x1f VALUE 0x0 I don't see any reason why we couldn't get that array.0_3 and array_2 are aligned to 32-bits. Maybe we don't set the value/mask pair for the bitwise_and::op1_range? The value/mask stuff is not very fleshed out, especially for the op1_range operators.