https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108764
--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> --- slli a4,a2,3 sh3add a5,a2,a0 vs slli a2,a2,3 add a5,a0,a2 I think the first one is better really because you have two indepedent instructions and can be issued at the same time. Really this is all core specific and the generic tuning should be "generic" which means this is the correct tuning ...