I also recommend porting lp_create_builder, which might also improve things.

Marek

On Wed, Jan 10, 2018 at 10:31 PM, Samuel Pitoiset
<samuel.pitoi...@gmail.com> wrote:
> This is an optimisation that is recommended by Matt Arsenault,
> and used by RadeonSI.
>
> Signed-off-by: Samuel Pitoiset <samuel.pitoi...@gmail.com>
> ---
>  src/amd/common/ac_nir_to_llvm.c | 5 +++++
>  1 file changed, 5 insertions(+)
>
> diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
> index 1520342d05..b474c29244 100644
> --- a/src/amd/common/ac_nir_to_llvm.c
> +++ b/src/amd/common/ac_nir_to_llvm.c
> @@ -338,6 +338,11 @@ create_llvm_function(LLVMContextRef ctx, LLVMModuleRef 
> module,
>                                                      
> "amdgpu-max-work-group-size",
>                                                      max_workgroup_size);
>         }
> +
> +       LLVMAddTargetDependentFunctionAttr(main_function,
> +                                          "no-signed-zeros-fp-math",
> +                                          "true");
> +
>         if (unsafe_math) {
>                 /* These were copied from some LLVM test. */
>                 LLVMAddTargetDependentFunctionAttr(main_function,
> --
> 2.15.1
>
> _______________________________________________
> mesa-dev mailing list
> mesa-dev@lists.freedesktop.org
> https://lists.freedesktop.org/mailman/listinfo/mesa-dev
_______________________________________________
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/mesa-dev

Reply via email to