On Sun, Oct 11, 2015 at 7:08 AM, Marek Olšák <mar...@gmail.com> wrote:
> On Sun, Oct 11, 2015 at 4:12 AM, Connor Abbott <cwabbo...@gmail.com> wrote:
>> FWIW, this isn't quite correct with ARB_shader_precision or GL4.1 --
>> it specifies that infinities should be correctly generated through
>> division by 0, which unsafe-fp-math doesn't guarantee. At least,
>> that's assuming this is similar to the "fast" per-instruction flag
>> (http://llvm.org/docs/LangRef.html#fast-math-flags) which says "This
>> flag implies all the others."
>
> We don't use the per-instruction flags yet.
>
> Sadly, we need this flag to be able to get (1/sqrt) -> RSQ and (1/x)
> -> RCP. LLVM doesn't have standard intrinsics for those instructions,
> so we have to unwind them and rely on LLVM to combine them. Without
> it, the TGSI->LLVM conversion would produce worse code.
>
> Marek

Right, but you're also allowing transforms which GLSL explicitly
disallows. It sounds like you need more precise control than what LLVM
currently offers.

Connor
_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to