FWIW the CGP sub_i_i_i opcode on the PowerBook 0x001048d4 <cgp_core+35652>: lwz r0,8(r30) 0x001048d8 <cgp_core+35656>: lwz r2,12(r30) 0x001048dc <cgp_core+35660>: lwzx r0,r27,r0 0x001048e0 <cgp_core+35664>: lwzx r2,r27,r2 0x001048e4 <cgp_core+35668>: lwz r9,4(r30) 0x001048e8 <cgp_core+35672>: subf r0,r2,r0 0x001048ec <cgp_core+35676>: stwx r0,r27,r9 0x001048f0 <cgp_core+35680>: lwzu r2,16(r30) 0x001048f4 <cgp_core+35684>: mtctr r2 0x001048f8 <cgp_core+35688>: bctr
Only slightly longer caused by the branch sequence but also quite compact. leo