https://gcc.gnu.org/bugzilla/show_bug.cgi?id=32392
--- Comment #3 from Eric Gallager <egallager at gcc dot gnu.org> --- (In reply to Tobias Burnus from comment #0) > Paolo Bonzini wrote: > >> That said, there is a whole bunch of applications that would kill for > >> -mrecip, > > even for 11bit ones. Games are one of them, for sure ;) > > What about -mrecip=0/1/2 for the number of NR steps? Or would two steps be > > slower than divss? > > > > I was thinking of adding this as a follow-up patch ;) Just look how the > > operations are grouped together. > > As Richard pointed out: Having two NR does not make sense. For some cases > doing with out Newton-Raphson is enough. (Example: Games -- or SPEC CPU > 2006: http://www.hpcwire.com/hpc/1556972.html) Link is dead; archive dot org link: http://web.archive.org/web/20120528224320/http://archive.hpcwire.com/hpc/1556972.html > > Other compilers have this option, e.g. Pathscale's -OPT:rsqrt=2 [yes, this > is used for SPEC runs ;-)]