https://gcc.gnu.org/bugzilla/show_bug.cgi?id=41742
--- Comment #2 from Segher Boessenkool <segher at gcc dot gnu.org> --- With current trunk the loop code is better (uses stbu), but the unnecessary extend still is there: memset: cmpwi 0,5,0 beqlr 0 rlwinm 4,4,0,0xff mtctr 5 addi 9,3,-1 .p2align 4,,15 .L3: stbu 4,1(9) bdnz .L3 blr