Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 8 ++--
1 file changed, 2 insertions(+), 6 deletions(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index 75cebf427ad..2f535259f78 100644
--- a/src/amd/common/ac_llvm_build.c
+++ b/src/amd
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 23 +--
1 file changed, 9 insertions(+), 14 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index af0c3cc679d..f4d408cd587 100644
--- a/src/amd/common
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 35 ++---
1 file changed, 6 insertions(+), 29 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index a25e5388bce..af0c3cc679d 100644
--- a/src/amd/common
For consistency regarding other store operations.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 9 ++---
1 file changed, 6 insertions(+), 3 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index a7b3fdf64aa..ff29345ffe5
Use the raw version (ie. IDXEN=0) because vindex is unused.
v2: - use raw version
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 66 +
1 file changed, 42 insertions(+), 24 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd
New buffer intrinsics have a separate soffset parameter.
v2: - use the raw version as vindex is not used
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 66 ++
1 file changed, 26 insertions(+), 40 deletions(-)
diff --git a/src/amd/common
v2: do not force enable IDXEN when unecessary
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 111
src/amd/common/ac_llvm_build.h | 26
src/amd/common/ac_nir_to_llvm.c | 26 ++--
3 files changed, 142 insertions(+), 21
The struct version sets IDXEN=1, while the raw version sets IDXEN=0.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c| 48 +++
src/amd/common/ac_llvm_build.h| 37
src/amd/vulkan/radv_nir_to_llvm.c | 13 +
3
vindex is always 0.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 6 +++---
src/amd/common/ac_llvm_build.h | 1 -
src/amd/common/ac_nir_to_llvm.c | 2 --
3 files changed, 3 insertions(+), 6 deletions(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common
Since the driver now uses typed buffer loads, we don't have to
account for the format.
This fixes few CTS regressions on SI.
Fixes: a66b186bebf ("radv: use typed buffer loads for vertex input fetches")
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 3 +--
Hi,
This small series makes use of new LLVM 8 buffer intrinsics.
No CTS regressions on GFX8 with LLVM 7, 8 and master.
v3: use different names (ie. struct vs raw) for IDXEN
V2: fix use of IDXEN for GFX9
Please review,
Thanks!
Samuel Pitoiset (11):
ac: make use of ac_get_store_intr_attribs
v3: - always use the raw version as vindex is unused
v2: - do not force enable IDXEN when unecessary
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 8
1 file changed, 8 insertions(+)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 8 ++--
1 file changed, 2 insertions(+), 6 deletions(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index 0cab4d4a9b5..d21cd7e0e28 100644
--- a/src/amd/common/ac_llvm_build.c
+++ b/src/amd
vindex is always 0.
v3: use the raw version
v2: do not force enable IDXEN when unecessary
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 20
src/amd/common/ac_llvm_build.h | 9 +
src/amd/common/ac_nir_to_llvm.c | 25
New buffer intrinsics have a separate soffset parameter.
v3: - use ac_build_raw_tbuffer_store()
v2: - use the raw version as vindex is not used
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 66 ++
1 file changed, 26 insertions(+), 40
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 35 ++---
1 file changed, 6 insertions(+), 29 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index 7f63b506b93..949a7a74834 100644
--- a/src/amd/common
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 127 +
src/amd/common/ac_llvm_build.h | 29
2 files changed, 156 insertions(+)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index 1843bbbee5f
Trivial.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 9 +++--
1 file changed, 3 insertions(+), 6 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index c77547c319b..247bd20a7ea 100644
--- a/src/amd/common/ac_nir_to_llvm.c
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 23 +--
1 file changed, 9 insertions(+), 14 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index 949a7a74834..5aad5807bb5 100644
--- a/src/amd/common
Use the raw version (ie. IDXEN=0) because vindex is unused.
v2: - use raw version
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 66 +
1 file changed, 42 insertions(+), 24 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd
Similar to ac_build_buffer_load_format().
v2: - fix out of bounds access
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 100
src/amd/common/ac_llvm_build.h | 11
src/amd/common/ac_nir_to_llvm.c | 29 +++--
3 files changed, 119
For consistency regarding other store operations.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 9 ++---
1 file changed, 6 insertions(+), 3 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index 18297ed99b1..947dc359d49
in VM faults.
Cc: mesa-sta...@lists.freedesktop.org
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=107563
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 3 +--
1 file changed, 1 insertion(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_cmd_buffer.c b/src/amd/v
This fixes a rendering issue with Hellblade and DXVK.
Fixes: a66b186bebf ("radv: use typed buffer loads for vertex input fetches")
Reported-by: Philip Rebohle
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_nir_to_llvm.c | 23 ++-
1 file changed, 14 insert
Results of my benchmarks are:
3 runs at 1080p:
GFX8: -1%
GFX9: -1.12%
3 runs at 4k:
GFX8: -2%
GFX9: -1.85%
I'm actually not sure if we want to remove it...
On 3/15/19 11:25 AM, Timothy Arceri wrote:
This was added in 8a7d4092d260 but no longer seems to have any
impact on performance.
---
Original patch by Rhys Perry.
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 10 --
src/amd/common/ac_llvm_build.h | 2 ++
2 files changed, 10 insertions(+), 2 deletions(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 18 ++
src/amd/common/ac_llvm_build.h | 8
2 files changed, 26 insertions(+)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index c5f1c85f269..d52f1b3f42d 100644
--- a/src
types to glsl_base_to_llvm_type
ac/nir: implement 8-bit conversions
radv: enable VK_KHR_8bit_storage
Samuel Pitoiset (3):
ac: add various int8 definitions
ac: add ac_build_tbuffer_load_byte() helper
ac: add ac_build_tbuffer_store_byte() helper
docs/features.txt | 2 +-
src
From: Rhys Perry
v2: remove 16-bit additions and rebase
Signed-off-by: Rhys Perry
---
src/amd/common/ac_nir_to_llvm.c | 3 +++
1 file changed, 3 insertions(+)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index f3e8f89ba9b..c7212ff3293 100644
--- a/src/amd/com
From: Rhys Perry
Signed-off-by: Rhys Perry
---
src/amd/common/ac_nir_to_llvm.c | 9 +++--
1 file changed, 7 insertions(+), 2 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index 34c4e2a69fa..f3e8f89ba9b 100644
--- a/src/amd/common/ac_nir_to_llvm
From: Rhys Perry
Signed-off-by: Rhys Perry
---
docs/features.txt | 2 +-
src/amd/vulkan/radv_device.c | 9 +
src/amd/vulkan/radv_extensions.py | 1 +
src/amd/vulkan/radv_shader.c | 1 +
4 files changed, 12 insertions(+), 1 deletion(-)
diff --git a/docs/feature
From: Rhys Perry
Signed-off-by: Rhys Perry
---
src/amd/common/ac_nir_to_llvm.c | 4
1 file changed, 4 insertions(+)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index c7212ff3293..f0b0c24ec08 100644
--- a/src/amd/common/ac_nir_to_llvm.c
+++ b/src/amd/comm
From: Rhys Perry
Signed-off-by: Rhys Perry
---
src/amd/common/ac_nir_to_llvm.c | 65 -
1 file changed, 55 insertions(+), 10 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c b/src/amd/common/ac_nir_to_llvm.c
index 7a03e9c15b2..34c4e2a69fa 100644
--- a/sr
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 19 +++
src/amd/common/ac_llvm_build.h | 9 +
2 files changed, 28 insertions(+)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index d52f1b3f42d..f6078634336 100644
--- a
-> 120509 (0.43 %)
v2: - fix vertex descriptors
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 32 +++
src/amd/vulkan/radv_device.c | 2 ++
src/amd/vulkan/radv_nir_to_llvm.c | 21 +++-
src/amd/vulkan/radv_private.h
This commit breaks some CTS with RADV (eg.
dEQP-VK.ssbo.phys.layout.single_basic_type.std430.bvec2) and it
introduces one compiler warning (minor stuff).
Is the Rb tag missing too?
Thanks!
On 3/19/19 5:57 AM, GitLab Mirror wrote:
Module: Mesa
Branch: master
Commit: c95afe56a8033a87dca71cc931
If an image has CMASK/FMASK or FMASK/DCC it shouldn't be needed
to sync between the two clears because they don't initialize
the same memory range.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 31 +--
1 file changed, 13 inserti
Unnecessary to have a separate function for CMASK, and calling
the radv_clear_XXX() helpers will allow us to remove some
cache flushes.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 48
src/amd/vulkan/radv_private.h| 18
The clear operation (ie. compute) doesn't use the CB caches.
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_cmd_buffer.c | 3 +--
1 file changed, 1 insertion(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_cmd_buffer.c b/src/amd/vulkan/radv_cmd_buffer.c
index 5bb3b5
On 3/19/19 1:56 PM, Bas Nieuwenhuizen wrote:
That it does not use it is exactly why we need to make sure the CB
data is not in the CB cache by flushing it?
Why only for DCC?
On Tue, Mar 19, 2019 at 12:15 PM Samuel Pitoiset
wrote:
The clear operation (ie. compute) doesn't use the CB c
enough?
On 3/19/19 2:03 PM, Samuel Pitoiset wrote:
On 3/19/19 1:56 PM, Bas Nieuwenhuizen wrote:
That it does not use it is exactly why we need to make sure the CB
data is not in the CB cache by flushing it?
Why only for DCC?
On Tue, Mar 19, 2019 at 12:15 PM Samuel Pitoiset
wrote:
The clear
On 3/19/19 8:24 PM, Bas Nieuwenhuizen wrote:
On Wed, Mar 13, 2019 at 5:38 PM Samuel Pitoiset
wrote:
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_nir_to_llvm.c | 35 ++---
1 file changed, 6 insertions(+), 29 deletions(-)
diff --git a/src/amd/common
shaders:
SGPRS: 229584 -> 229592 (0.00 %)
VGPRS: 163268 -> 163168 (-0.06 %)
Spilled SGPRs: 8682 -> 8650 (-0.37 %)
Code Size: 12819572 -> 12707884 (-0.87 %) bytes
Max Waves: 24398 -> 24392 (-0.02 %)
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 9 -
This gives +10% FPS with Doom on my Vega56.
Signed-off-by: Samuel Pitoiset
---
src/compiler/nir/nir_opt_if.c | 87 +++
1 file changed, 87 insertions(+)
diff --git a/src/compiler/nir/nir_opt_if.c b/src/compiler/nir/nir_opt_if.c
index bc128f79f3c..47a8a65aad3
s why LLVM isn't able to improve that itself, but I think
it might good to have this optimization directly in NIR.
Please review,
Thanks!
Samuel Pitoiset (1):
nir: add a pass that removes continue blocks
src/compiler/nir/nir_opt_if.c | 87 +++
1 file c
On 3/20/19 11:47 AM, Timothy Arceri wrote:
On 20/3/19 9:41 pm, Samuel Pitoiset wrote:
28717 shaders in 14931 tests
Totals:
SGPRS: 1267317 -> 1267549 (0.02 %)
VGPRS: 896876 -> 895920 (-0.11 %)
Spilled SGPRs: 24701 -> 26367 (6.74 %)
Code Size: 48379452 -> 48507880 (0.27 %) byt
On 3/20/19 1:07 AM, Bas Nieuwenhuizen wrote:
On Tue, Mar 19, 2019 at 9:28 AM Samuel Pitoiset
wrote:
From: Rhys Perry
Signed-off-by: Rhys Perry
---
src/amd/common/ac_nir_to_llvm.c | 9 +++--
1 file changed, 7 insertions(+), 2 deletions(-)
diff --git a/src/amd/common/ac_nir_to_llvm.c
This fixes the world.
Fixes: 5f5ac19f138 ("radv: Implement VK_EXT_pipeline_creation_feedback.")"
Signed-off-by: Samuel Pitoiset
---
src/amd/vulkan/radv_pipeline.c | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/src/amd/vulkan/radv_pipeline.c b
and
the Vega merged stages.
Reviewed-by: Samuel Pitoiset
:04 04 ea08bcac9b3630e10bf333c79227bcd0ed9a894b
0924580849d9bc268e47be1248311ff3b5488c86 M src
I'm on the 'old' way compiling today:
./autogen.sh --prefix=/usr/local --with-dri-drivers=""
--with
Signed-off-by: Samuel Pitoiset
---
src/amd/common/ac_llvm_build.c | 5 -
1 file changed, 4 insertions(+), 1 deletion(-)
diff --git a/src/amd/common/ac_llvm_build.c b/src/amd/common/ac_llvm_build.c
index b5bb399eef1..2c6fecb91db 100644
--- a/src/amd/common/ac_llvm_build.c
+++ b/src/amd
Hi Benjamin,
In my opinion, your commit message is a bit confusing
I think you meant "unit" instead of "sign", right?
Besides, I'm not sure we want to add a new unit for temperature but I
could live with it.
Marek, what do you think?
See my comments, below.
On 09/03/2015 09:43 PM, Benjam
On 09/04/2015 08:57 PM, Benjamin Bellec wrote:
Currently, the temperature is displayed with a "%" symbol in
gallium/hud, which is quite odd.
Marek suggested to only change the value "100" to another value so
that this symbol is no more displayed. That works very well. This is
the patch #1. I
Kepler and I launched some piglit tests
related to those queries. No regressions.
Feel free to review,
Thanks.
Samuel.
Samuel Pitoiset (5):
nvc0: add a header file for nvc0_query
nvc0: move nvc0_so_target_save_offset() to its correct location
nvc0: move SW queries to nvc0_query_sw.c/h files
This will allow to split SW and HW queries in an upcoming patch.
While we are at it, make use of nvc0_query struct instead of pipe_query.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/Makefile.sources | 1 +
src/gallium/drivers/nouveau/nvc0/nvc0_context.h| 12
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/nvc0/nvc0_query.c | 21 -
src/gallium/drivers/nouveau/nvc0/nvc0_query.h | 3 ---
src/gallium/drivers/nouveau/nvc0/nvc0_state.c | 19 +++
3 files changed, 19 insertions(+), 24 deletions(-)
diff
Global performance counters (PCOUNTER) will be added to
nvc0_query_hw_pm.c/h files.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/Makefile.sources | 2 +
src/gallium/drivers/nouveau/nvc0/nvc0_query_hw.c | 707 +--
src/gallium/drivers/nouveau/nvc0
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/Makefile.sources |2 +
src/gallium/drivers/nouveau/nvc0/nvc0_query.c | 1122 +--
src/gallium/drivers/nouveau/nvc0/nvc0_query.h | 121 ---
src/gallium/drivers/nouveau/nvc0/nvc0_query_hw.c | 1128
Loosely based on freedreno driver.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/Makefile.sources | 2 +
src/gallium/drivers/nouveau/nvc0/nvc0_query.c| 31
src/gallium/drivers/nouveau/nvc0/nvc0_query.h| 56 +++
src/gallium/drivers/nouveau/nvc0
Reviewed-by: Samuel Pitoiset
On 10/05/2015 09:21 PM, Ilia Mirkin wrote:
I started seeing a lot of situations on nv30 where fence emission
wouldn't fit into the previous buffer (causing assertions). This ensures
that whenever checking for space, we always leave a bit of extra room
for the
Compute class value for GF110+ is 0x91c0 and not 0x92c0. This fixes
compute support and MP performance counters on GF110.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/nv_object.xml.h | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/src/gallium/drivers
This patch looks fine except that it should be a bit more normalized. I
mean, sometimes you break when PUSH_SPACE fails, sometimes not. Same for
PUSH_SPACE calls, sometimes you add it sometimes not.
Did you run a full piglit test this time ? :)
See my comment below.
On 10/10/2015 11:09 AM, Il
Does this fix those texelFetch piglit tests ? Or is it the second patch ?
Anyway, this patch is :
Reviewed-by: Samuel Pitoiset
On 10/10/2015 08:12 AM, Ilia Mirkin wrote:
Right now we emit on every kick, but this is only necessary if something
will ever be able to observe that the fence
On 10/10/2015 09:42 PM, Ilia Mirkin wrote:
On Sat, Oct 10, 2015 at 3:41 PM, Samuel Pitoiset
wrote:
This patch looks fine except that it should be a bit more normalized. I
mean, sometimes you break when PUSH_SPACE fails, sometimes not. Same for
PUSH_SPACE calls, sometimes you add it sometimes
:
- check if views[i] and views[i]->texture are not NULL
- fix use of nv50->textures_coherent
- check if vb[i].buffer is not NULL
- clear out the flag for UBO
Changes from v2:
- forgot to apply some changes for nv50 (texture/vertex bufs)
Signed-off-by: Samuel Pitoiset
---
src/gallium/d
om v2:
- fix typo
Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75165
Signed-off-by: Samuel Pitoiset
---
src/gallium/tests/trivial/compute.c | 603
1 file changed, 330 insertions(+), 273 deletions(-)
diff --git a/src/gallium/tests/trivial/compute
Reviewed-by: Samuel Pitoiset
On 01/03/2016 01:38 AM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
---
.../drivers/nouveau/codegen/nv50_ir_emit_gm107.cpp | 1 +
.../drivers/nouveau/codegen/nv50_ir_emit_nvc0.cpp | 5 -
.../drivers/nouveau/codegen/nv50_ir_from_tgsi.cpp | 23
Cc: Ilia Mirkin
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_program.h | 8
1 file changed, 4 insertions(+), 4 deletions(-)
diff --git a/src/mesa/state_tracker/st_program.h
b/src/mesa/state_tracker/st_program.h
index a8571f0..a745315 100644
--- a/src/mesa
On 01/03/2016 07:03 PM, Ilia Mirkin wrote:
Reviewed-by: Ilia Mirkin
Can you guess where I copy-pasted these from? :)
Two lines above? :-)
On Sun, Jan 3, 2016 at 12:47 PM, Samuel Pitoiset
wrote:
Cc: Ilia Mirkin
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_program.h
Did you write piglit tests for this new extension?
I don't see any tests which use MultiDrawArraysIndirectCountARB() or
MultiDrawElementsIndirectCountARB().
Are you going to submit the rest of the series for nvc0?
On 01/02/2016 09:38 PM, Ilia Mirkin wrote:
I chose to make separate macros for
On 01/03/2016 05:37 AM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
---
src/gallium/auxiliary/tgsi/tgsi_build.c| 62 +
src/gallium/auxiliary/tgsi/tgsi_dump.c | 10 +--
src/gallium/auxiliary/tgsi/tgsi_parse.c| 4 +-
src/gallium/auxilia
I assume this is going to replace the old RES stuff?
On 01/03/2016 05:37 AM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
---
src/gallium/auxiliary/tgsi/tgsi_dump.c | 5 +++
src/gallium/auxiliary/tgsi/tgsi_strings.c | 1 +
src/gallium/auxiliary/tgsi/tgsi_text.c | 5 +++
src/ga
On 01/04/2016 09:18 PM, Ilia Mirkin wrote:
On Mon, Jan 4, 2016 at 3:04 PM, Samuel Pitoiset
wrote:
Did you write piglit tests for this new extension?
I don't see any tests which use MultiDrawArraysIndirectCountARB() or
MultiDrawElementsIndirectCountARB().
http://patchwork.freedeskto
On 01/04/2016 09:25 PM, Ilia Mirkin wrote:
On Mon, Jan 4, 2016 at 3:13 PM, Samuel Pitoiset
wrote:
On 01/03/2016 05:37 AM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
diff --git a/src/gallium/auxiliary/tgsi/tgsi_strings.c
b/src/gallium/auxiliary/tgsi/tgsi_strings.c
index fd926b3..ae30399
Trivial.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/nouveau/nvc0/nvc0_compute.c | 10 --
1 file changed, 4 insertions(+), 6 deletions(-)
diff --git a/src/gallium/drivers/nouveau/nvc0/nvc0_compute.c
b/src/gallium/drivers/nouveau/nvc0/nvc0_compute.c
index 6ce96ae..795c027
Re-binding compute constant buffers after launching a grid have no effects
because they are not currently validated and because dirty_cp is not updated
accordingly. This might also prevent weird future behaviours when UBOs will
be binded for compute.
Signed-off-by: Samuel Pitoiset
---
src
This patch fixes the following deqp tests related to compute shaders:
- max_compute_atomic_counter_buffers
- max_compute_atomic_counters
- max_compute_image_uniforms
- max_compute_uniform_components
Thanks for fixing this.
Reviewed-by: Samuel Pitoiset
On 01/20/2016 09:06 AM, Ilia Mirkin wrote
.functional.compute.basic.shared_atomic_op_single_invocation
Tested-by: Samuel Pitoiset
On 01/22/2016 01:47 AM, Jordan Justen wrote:
Fixes: dEQP-GLES31.functional.compute.basic.shared_atomic_op_multiple_groups
From: https://android.googlesource.com/platform/external/deqp
Reported-by: Ilia Mirkin
Signed-off-by
On 01/19/2016 03:30 AM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
Reviewed-by: Marek Olšák
---
src/gallium/drivers/trace/tr_context.c| 38 +++
src/gallium/drivers/trace/tr_dump_state.c | 18 +++
src/gallium/drivers/trace/tr_dump_state.h | 2
Like other resources, the indirect draw buffer must be unwrapped.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/trace/tr_context.c | 15 ++-
1 file changed, 14 insertions(+), 1 deletion(-)
diff --git a/src/gallium/drivers/trace/tr_context.c
b/src/gallium/drivers/trace
Like other resources, the indirect draw buffer must be unwrapped.
Changes from v2:
- do it for indirect_params
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/trace/tr_context.c | 17 -
1 file changed, 16 insertions(+), 1 deletion(-)
diff --git a/src/gallium/drivers
Looks good know.
Reviewed-by: Samuel Pitoiset
On 01/24/2016 07:59 PM, Ilia Mirkin wrote:
Signed-off-by: Ilia Mirkin
Reviewed-by: Marek Olšák (v1)
v1 -> v2: add arg_begin/arg_end around buffer array
---
src/gallium/drivers/trace/tr_context.c| 40 +++
This introduces pipe_grid_info which contains all information to
describe a launch_grid call. This will be used to implement indirect
compute in the same fashion as indirect draw.
Signed-off-by: Samuel Pitoiset
---
src/gallium/drivers/ilo/ilo_gpgpu.c| 8 ++
src/gallium
Compute needs a new and different validation path.
Signed-off-by: Samuel Pitoiset
---
src/gallium/include/state_tracker/st_api.h | 8 +
src/mesa/state_tracker/st_atom.c | 48 --
src/mesa/state_tracker/st_atom.h | 4 ++-
src/mesa
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_extensions.c | 1 +
1 file changed, 1 insertion(+)
diff --git a/src/mesa/state_tracker/st_extensions.c
b/src/mesa/state_tracker/st_extensions.c
index 2374dcc..ec30d99 100644
--- a/src/mesa/state_tracker/st_extensions.c
+++ b/src/mesa
This state tracker implements DispatchCompute() and DispatchComputeIndirect().
Signed-off-by: Samuel Pitoiset
---
src/mesa/Makefile.sources | 2 +
src/mesa/state_tracker/st_cb_compute.c | 85 ++
src/mesa/state_tracker/st_cb_compute.h | 38
From: Ilia Mirkin
Looks like the various max's were never plumbed through.
Signed-off-by: Ilia Mirkin
Signed-off-by: Samuel Pitoiset
---
src/glsl/builtin_variables.cpp | 15 ++-
src/glsl/glsl_parser_extras.cpp | 7 +++
src/glsl/glsl_parser_extras.h| 5 +
src
This will allow to query the underlying drivers for the maximum
total storage size of all variables declared as with
PIPE_COMPUTE_CAP_MAX_LOCAL_SIZE.
Signed-off-by: Samuel Pitoiset
---
src/glsl/main.cpp| 1 +
src/mesa/main/config.h | 5 -
src/mesa/main
en if they don't *really* support compute shaders. Comments are very
welcome here.
Please review, thanks!
Ilia Mirkin (1):
mesa: make compute maximums reflect driver-provided values
Samuel Pitoiset (16):
mesa: do not use a constant for MAX_COMPUTE_SHARED_SIZE
gallium/cso: add supp
According to the spec, this also increases the following minimum values:
- MAX_COMBINED_TEXTURE_IMAGE_UNITS 96 (6*16), was 80
- MAX_UNIFORM_BUFFER_BINDINGS 72 (6*12), was 60
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_extensions.c | 24 ++--
1
Like indirect draw, we need to store a resource and an offset that
needs to be 4 byte aligned. When indirect is used, the size of the
grid (in blocks) is stored with three 32-bit integers.
Signed-off-by: Samuel Pitoiset
---
src/gallium/include/pipe/p_state.h | 12
1 file changed
This adds a new SHARED attribute to TGSI_FILE_BUFFER. Only
one shared buffer can be declared and its size is defined by
MAX_COMPUTE_SHARED_SIZE. Note that an atomic buffer can also
be shared.
Signed-off-by: Samuel Pitoiset
---
src/gallium/auxiliary/tgsi/tgsi_build.c| 1 +
src/gallium
Signed-off-by: Samuel Pitoiset
---
src/gallium/auxiliary/cso_cache/cso_context.c | 53 +++
src/gallium/auxiliary/cso_cache/cso_context.h | 6 +++
2 files changed, 59 insertions(+)
diff --git a/src/gallium/auxiliary/cso_cache/cso_context.c
b/src/gallium/auxiliary
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_atom.c| 2 +-
src/mesa/state_tracker/st_atom.h| 1 +
src/mesa/state_tracker/st_atom_shader.c | 36 +
src/mesa/state_tracker/st_cb_program.c | 28 +++
src/mesa/state_tracker/st_context.c | 1
LOCAL_INVOCATION_ID, WORK_GROUP_ID and NUM_WORK_GROUPS are respectively
mapped to THREAD_ID, BLOCK_ID and GRID_SIZE.
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_glsl_to_tgsi.cpp | 6 ++
1 file changed, 6 insertions(+)
diff --git a/src/mesa/state_tracker/st_glsl_to_tgsi.cpp
This adds GLSL intrinsics for load/store and atomic operations.
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_extensions.c | 3 +
src/mesa/state_tracker/st_glsl_to_tgsi.cpp | 95 ++
2 files changed, 98 insertions(+)
diff --git a/src/mesa
At least, one shader buffer must be available for compute shaders.
Signed-off-by: Samuel Pitoiset
---
src/gallium/include/pipe/p_state.h | 2 +-
src/mesa/state_tracker/st_extensions.c | 6 --
2 files changed, 5 insertions(+), 3 deletions(-)
diff --git a/src/gallium/include/pipe
Loosely based on tessellation shaders.
Signed-off-by: Samuel Pitoiset
---
src/gallium/auxiliary/hud/hud_context.c| 3 +++
src/gallium/auxiliary/postprocess/pp_run.c | 3 +++
src/gallium/auxiliary/util/u_blit.c| 3 +++
src/mesa/state_tracker/st_cb_bitmap.c | 3 +++
src/mesa
This binds atomics, constants, samplers, ssbos, textures and ubos.
Signed-off-by: Samuel Pitoiset
---
src/mesa/state_tracker/st_atom.c| 6
src/mesa/state_tracker/st_atom.h| 5
src/mesa/state_tracker/st_atom_atomicbuf.c | 18 +++
src/mesa
The size of shared variables needs to be stored in gl_compute_program
in order to set up pipe_compute_state::req_local_mem. According to the
spec, there are no predefined inputs nor any fixed-function outputs.
Signed-off-by: Samuel Pitoiset
---
src/mesa/main/mtypes.h | 5
Signed-off-by: Samuel Pitoiset
---
I have forgot this one...
src/gallium/drivers/trace/tr_context.c| 80 +++
src/gallium/drivers/trace/tr_dump_state.c | 51
src/gallium/drivers/trace/tr_dump_state.h | 4 ++
src/gallium/drivers/trace
801 - 900 of 5029 matches
Mail list logo