https://gcc.gnu.org/bugzilla/show_bug.cgi?id=32392

--- Comment #3 from Eric Gallager <egallager at gcc dot gnu.org> ---
(In reply to Tobias Burnus from comment #0)
> Paolo Bonzini wrote:
> >> That said, there is a whole bunch of applications that would kill for 
> >> -mrecip, 
> > even for 11bit ones. Games are one of them, for sure ;)
> > What about -mrecip=0/1/2 for the number of NR steps? Or would two steps be 
> > slower than divss?
> >
> > I was thinking of adding this as a follow-up patch ;) Just look how the 
> > operations are grouped together.
> 
> As Richard pointed out: Having two NR does not make sense. For some cases
> doing with out Newton-Raphson is enough. (Example: Games -- or SPEC CPU
> 2006: http://www.hpcwire.com/hpc/1556972.html)

Link is dead; archive dot org link:
http://web.archive.org/web/20120528224320/http://archive.hpcwire.com/hpc/1556972.html

> 
> Other compilers have this option, e.g. Pathscale's -OPT:rsqrt=2 [yes, this
> is used for SPEC runs ;-)]

Reply via email to