From: "Juan A. Suarez Romero" <jasua...@igalia.com> Take into account offset values less than a full register (32 bytes) when getting the var from register.
This is required when dealing with an operation that writes half of the register (like one d2x in IVB/BYT, which uses exec_size == 4). v2: - Take in account this offset < 32 in liveness analysis too (Curro) v3: - Change formula in var_from_reg() (Curro) - Remove useless changes (Curro) Signed-off-by: Samuel Iglesias Gonsálvez <sigles...@igalia.com> --- src/intel/compiler/brw_vec4_live_variables.h | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/src/intel/compiler/brw_vec4_live_variables.h b/src/intel/compiler/brw_vec4_live_variables.h index 8807c453743..04bdf8cc26f 100644 --- a/src/intel/compiler/brw_vec4_live_variables.h +++ b/src/intel/compiler/brw_vec4_live_variables.h @@ -88,7 +88,7 @@ var_from_reg(const simple_allocator &alloc, const src_reg ®, assert(reg.file == VGRF && reg.nr < alloc.count && c < 4); const unsigned csize = DIV_ROUND_UP(type_sz(reg.type), 4); unsigned result = - 8 * (alloc.offsets[reg.nr] + reg.offset / REG_SIZE) + + 8 * alloc.offsets[reg.nr] + reg.offset / 4 + (BRW_GET_SWZ(reg.swizzle, c) + k / csize * 4) * csize + k % csize; /* Do not exceed the limit for this register */ assert(result < 8 * (alloc.offsets[reg.nr] + alloc.sizes[reg.nr])); @@ -102,7 +102,7 @@ var_from_reg(const simple_allocator &alloc, const dst_reg ®, assert(reg.file == VGRF && reg.nr < alloc.count && c < 4); const unsigned csize = DIV_ROUND_UP(type_sz(reg.type), 4); unsigned result = - 8 * (alloc.offsets[reg.nr] + reg.offset / REG_SIZE) + + 8 * alloc.offsets[reg.nr] + reg.offset / 4 + (c + k / csize * 4) * csize + k % csize; /* Do not exceed the limit for this register */ assert(result < 8 * (alloc.offsets[reg.nr] + alloc.sizes[reg.nr])); -- 2.11.0 _______________________________________________ mesa-dev mailing list mesa-dev@lists.freedesktop.org https://lists.freedesktop.org/mailman/listinfo/mesa-dev