This sequence (where both x and w are used afterwards) wasn't handled.
mul.sat x, y, z
...
mov.sat w, x
We assumed that if x was used after the mov.sat, that we couldn't
propagate the saturate modifier, but in fact x was already saturated.
So ignore the live range check if the producing
To implement the unlit_centroid_workaround, previously we emitted
(+f0) pln(8) g20<1>F g16.4<0,1,0>F g4<8,8,1>F { align1 1Q };
(-f0) pln(8) g20<1>F g16.4<0,1,0>F g2<8,8,1>F { align1 1Q };
where the flag register contains the channel enable bits from g0.
Since the predicates are complementa
Although the HSW PRM shows it, the BSpec lists this workaround as being
for Ivybridge only.
total instructions in shared programs: 1994951 -> 1993675 (-0.06%)
instructions in affected programs: 27325 -> 26049 (-4.67%)
---
src/mesa/drivers/dri/i965/brw_device_info.c | 6 --
1 file changed,
And plumb them through. Also make the assert in the generator look like
the vec4 one.
---
src/mesa/drivers/dri/i965/brw_fs.h | 2 ++
src/mesa/drivers/dri/i965/brw_fs_generator.cpp | 14 --
2 files changed, 10 insertions(+), 6 deletions(-)
diff --git a/src/mesa/drivers/dri
Maybe lets us skip some PLN instructions if whole subspans are disabled?
---
src/mesa/drivers/dri/i965/brw_fs.cpp | 20 ++--
1 file changed, 14 insertions(+), 6 deletions(-)
diff --git a/src/mesa/drivers/dri/i965/brw_fs.cpp
b/src/mesa/drivers/dri/i965/brw_fs.cpp
index 1d58615..3b
This series optimizes the code emitted for the unlit centroid workaround.
Previously we emitted
mov(1) f0, g1.14 // move channel enables to flag
pln(8) dst, ... // centroid interpolation
(-f0) pln(8) dst, ... // non-centroid interpolation for unlit pixels
Patch 2 predicates the
Great, thanks Chris.
Ilia, I'll push the patches today.
Iago
On Sat, 2014-06-28 at 08:46 +1200, Chris Forbes wrote:
> Ilia,
>
> That bikeshed is already done :)
>
> I just send an r-b for v3 of patch 18, which is the last patch that lacked
> one.
>
> I think this is all good to land now.
>
On Sat, Jun 28, 2014 at 9:34 PM, Kenneth Graunke wrote:
> Apparently INTEL_DEBUG=fs has crashed on Broadwell for anything using
> ARB_fragment_program since commit 9cee3ff5. We need to NULL-check the
> right field.
>
> Signed-off-by: Kenneth Graunke
> Cc: "10.2"
> ---
> src/mesa/drivers/dri/i9
I tested RV670 in April and even fixed some bugs it had. I can't
complain about stability. It worked fine.
Marek
On Mon, Jun 30, 2014 at 1:18 AM, Dave Airlie wrote:
>> I've rewritten the core of the register merging algorithm to be able to cope
>> in the presence of (almost) arbitrarily nested l
Hello,
I am working on a project that uses EGL and defers drawing to render
modules. To maintain compatibility with Wayland and avoid being tied to X,
I have chosen OpenGL ES 3. As it is difficult for me to find information
about this API, I would rather use its desktop counterpart; however, as
sa
On 30 June 2014 01:06, Tomáš Trnka wrote:
> The register merging pass after GLSL-to-TGSI translation previously used
> simple register usage tracking routines that were unable to work inside
> loops (instead, all variables were assumed to be needed throughout the
> whole loop). This is obviously s
> I've rewritten the core of the register merging algorithm to be able to cope
> in the presence of (almost) arbitrarily nested loop and conditional
> structures. With this patch (tested on master), the Pixmark tests finally
> work just fine and some of the shadertoy demos have started working for
https://bugs.freedesktop.org/show_bug.cgi?id=80615
Alexandre Demers changed:
What|Removed |Added
Status|RESOLVED|REOPENED
Resolution|NOTABUG
The register merging pass after GLSL-to-TGSI translation previously used
simple register usage tracking routines that were unable to work inside
loops (instead, all variables were assumed to be needed throughout the
whole loop). This is obviously suboptimal for complex programs and the
lack of regi
Following is a patch fixing Mesa's inability to run several complex GLSL
shader benchmarks. I had never been able to successfully run the Pixmark
Piano and Volplosion tests from the GpuTest benchmark suite on my RV670 card.
They would always fail to render anything and report that "translation from
15 matches
Mail list logo