https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93009
Andrew Pinski <pinskia at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|UNCONFIRMED |RESOLVED Resolution|--- |INVALID --- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> --- What is funny is: template <typename T1, typename T2, typename T3> __attribute__ ((__always_inline__)) inline auto FMA(T1 a, T2 b, T3 c) { return a*b+c; } It is called fma but it does not have to do the fused multiple add. For scalar types, you should use std::fma . BUT NOTE on targets where FMA does not exist, the function std::fma can be slow as the multiple is done in infinite precision.