RE: [PATCH] drm/amdgpu: psp_ring_destroy cause psp->km_ring.ring_mem NULL

2018-12-17 Thread Yu, Xiangliang
Reviewed-by: Xiangliang Yu -Original Message- From: Lou, Wentao Sent: Tuesday, December 18, 2018 3:52 PM To: amd-gfx@lists.freedesktop.org Cc: Lou, Wentao Subject: [PATCH] drm/amdgpu: psp_ring_destroy cause psp->km_ring.ring_mem NULL psp_ring_destroy inside psp_load_fw cause psp->km

[PATCH] drm/amdgpu: psp_ring_destroy cause psp->km_ring.ring_mem NULL

2018-12-17 Thread wentalou
psp_ring_destroy inside psp_load_fw cause psp->km_ring.ring_mem NULL. Call Trace occurred when psp_cmd_submit. should be psp_ring_stop instead. Change-Id: Ib332004b3b9edc9e002adc532b2d45cdad929b05 Signed-off-by: Wentao Lou --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c | 2 +- 1 file changed, 1 ins

[PATCH] drm/amdgpu: set WRITE_BURST_LENGTH to 64B to workaround SDMA1 hang

2018-12-17 Thread Jim Qu
effect asics: VEGA10 and VEGA12 Change-Id: Idfac315920a4fae771b080ff836bd4c5c980348e Signed-off-by: Jim Qu Acked-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c b/drivers/gp

RE: [PATCH 1/2] drm/amdgpu/psp: Fix to get wrong xgmi session id

2018-12-17 Thread Xu, Feifei
Hi Xiangliang, Could you add more comment on the session_id in the commit message? Like using the session_id to distinguish each VF v.s. PF etc. With that added,Reviewed-by: Feifei Xu Thanks Feifei -Original Message- From: amd-gfx On Behalf Of Xiangliang Yu Sent: Thursday, December 13

RE: [PATCH 2/2] drm/amdgpu/psp: Fix can't detect psp INVOKE command failed

2018-12-17 Thread Xu, Feifei
Reviewed-by: Feifei Xu -Original Message- From: amd-gfx On Behalf Of Xiangliang Yu Sent: Thursday, December 13, 2018 3:42 PM To: amd-gfx@lists.freedesktop.org Cc: Yu, Xiangliang Subject: [PATCH 2/2] drm/amdgpu/psp: Fix can't detect psp INVOKE command failed There isn't ucode when execu

RE: [PATCH 2/2] drm/amdgpu/psp: Fix can't detect psp INVOKE command failed

2018-12-17 Thread Yu, Xiangliang
Anyone to help reviewing? Thanks! -Original Message- From: amd-gfx On Behalf Of Xiangliang Yu Sent: Thursday, December 13, 2018 3:42 PM To: amd-gfx@lists.freedesktop.org Cc: Yu, Xiangliang Subject: [PATCH 2/2] drm/amdgpu/psp: Fix can't detect psp INVOKE command failed There isn't ucode

[PATCH 1/2] drm/amdgpu: Add per device sdma_doorbell_range field

2018-12-17 Thread Zeng, Oak
Different ASIC has different sdma doorbell range. Add a per device sdma_doorbell_range field and initialize it. Change-Id: Idd980db1a72cfb373e24ac23ba3e48bb329ed4ad Signed-off-by: Oak Zeng --- drivers/gpu/drm/amd/amdgpu/amdgpu_doorbell.h | 2 ++ drivers/gpu/drm/amd/amdgpu/vega10_reg_init.c | 1 +

[PATCH 2/2] drm/amdgpu: Fix sdma doorbell range setting

2018-12-17 Thread Zeng, Oak
Different ASIC has different SDMA queues so different SDMA doorbell range. Introduce an extra parameter to sdma_doorbell_range function and set sdma doorbell range correctly. Change-Id: I9b8d75b04f5a47ef1c6fd7cc1caaefd98dd2ff2b Signed-off-by: Oak Zeng --- drivers/gpu/drm/amd/amdgpu/amdgpu.h|

RE: [PATCH] drm/amdgpu: Add more process info in VM for debug

2018-12-17 Thread Huang, Trigger
Yeah, make sense. Let me give another patch: [PATCH] drm/amdgpu: print process info when job timeout -Original Message- From: Koenig, Christian Sent: Monday, December 17, 2018 9:39 PM To: Liu, Monk ; Huang, Trigger ; amd-gfx@lists.freedesktop.org Cc: Grodzovsky, Andrey ; Qu, Jim Subje

[PATCH] drm/amdgpu: print process info when job timeout

2018-12-17 Thread Trigger Huang
When a job is timeout, try to print the related process information for debugging Signed-off-by: Trigger Huang --- drivers/gpu/drm/amd/amdgpu/amdgpu_job.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_job.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_job.c in

Re: [PATCH] drm/amdgpu: Use sdma_engine array

2018-12-17 Thread Alex Deucher
On Mon, Dec 17, 2018 at 1:05 PM Zeng, Oak wrote: > > Use sdma_engine[8] array instead of sdma_engine0~7 so it is easier > to program. > > Change-Id: I04873ddfd82e613c28908ae81665c6b02836d093 > Signed-off-by: Oak Zeng Reviewed-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c

Re: [WIP PATCH 02/15] drm/dp_mst: Refactor drm_dp_update_payload_part1()

2018-12-17 Thread Wentland, Harry
On 2018-12-14 3:47 a.m., Daniel Vetter wrote: > On Thu, Dec 13, 2018 at 08:25:31PM -0500, Lyude Paul wrote: >> There should be no functional changes here > > Would be good to explain what you did refactor here, instead of me trying > to reconstruct it from the patch. Especially pre-coffee that hel

Re: [WIP PATCH 01/15] drm/dp_mst: Remove bogus conditional in drm_dp_update_payload_part1()

2018-12-17 Thread Wentland, Harry
On 2018-12-14 3:42 a.m., Daniel Vetter wrote: > On Thu, Dec 13, 2018 at 08:25:30PM -0500, Lyude Paul wrote: >> There's no reason we need this, it's just confusing looking. >> >> Signed-off-by: Lyude Paul >> Cc: Juston Li >> --- >> drivers/gpu/drm/drm_dp_mst_topology.c | 4 +--- >> 1 file chang

[PATCH v4 1/2] drm/sched: Refactor ring mirror list handling.

2018-12-17 Thread Andrey Grodzovsky
Decauple sched threads stop and start and ring mirror list handling from the policy of what to do about the guilty jobs. When stoppping the sched thread and detaching sched fences from non signaled HW fenes wait for all signaled HW fences to complete before rerunning the jobs. v2: Fix resubmission

[PATCH v4 2/2] drm/sched: Rework HW fence processing.

2018-12-17 Thread Andrey Grodzovsky
Expedite job deletion from ring mirror list to the HW fence signal callback instead from finish_work, together with waiting for all such fences to signal in drm_sched_stop we garantee that already signaled job will not be processed twice. Remove the sched finish fence callback and just submit finis

[PATCH 4/7] drm: Move the legacy kms disable_all helper to crtc helpers

2018-12-17 Thread Daniel Vetter
It's not a core function, and the matching atomic functions are also not in the core. Plus the suspend/resume helper is also already there. Needs a tiny bit of open-coding, but less midlayer beats that I think. v2: Rebase onto ast (which gained a new user). Cc: Sam Bobroff Reviewed-by: Alex Deu

Re: [PATCH v2] drm/amd/display: Add below the range support for FreeSync

2018-12-17 Thread Wentland, Harry
On 2018-12-05 12:08 p.m., Nicholas Kazlauskas wrote: > [Why] > When the flip-rate is below the minimum supported variable refresh rate > range for the monitor the front porch wait will timeout and be > frequently misaligned resulting in stuttering and/or flickering. > > The FreeSync module can sti

Re: After Vega 56/64 GPU hang I unable reboot system

2018-12-17 Thread Grodzovsky, Andrey
On 12/17/2018 01:51 PM, Wentland, Harry wrote: > On 2018-12-15 4:42 a.m., Mikhail Gavrilov wrote: >> On Sat, 15 Dec 2018 at 00:36, Wentland, Harry wrote: >>> Looks like there's an error before this happens that might get us into this >>> mess: >>> >>> [ 229.741741] [drm:amdgpu_job_timedout [am

Re: After Vega 56/64 GPU hang I unable reboot system

2018-12-17 Thread Wentland, Harry
On 2018-12-15 4:42 a.m., Mikhail Gavrilov wrote: > On Sat, 15 Dec 2018 at 00:36, Wentland, Harry wrote: >> >> Looks like there's an error before this happens that might get us into this >> mess: >> >> [ 229.741741] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, >> signaled seq=286

Re: [PATCH v3 1/2] drm/sched: Refactor ring mirror list handling.

2018-12-17 Thread Koenig, Christian
Am 17.12.18 um 17:57 schrieb Grodzovsky, Andrey: > > On 12/17/2018 10:27 AM, Christian König wrote: >> Am 10.12.18 um 22:43 schrieb Andrey Grodzovsky: >>> Decauple sched threads stop and start and ring mirror >>> list handling from the policy of what to do about the >>> guilty jobs. >>> When stoppp

[PATCH] drm/amdgpu: Use sdma_engine array

2018-12-17 Thread Zeng, Oak
Use sdma_engine[8] array instead of sdma_engine0~7 so it is easier to program. Change-Id: I04873ddfd82e613c28908ae81665c6b02836d093 Signed-off-by: Oak Zeng --- drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c | 8 drivers/gpu/drm/amd/amdgpu/amdgpu_doorbell.h | 9 + drivers/gpu/drm

Re: [PATCH libdrm 1/5] [libdrm] new syncobj extension

2018-12-17 Thread Emil Velikov
Hi Chunming Zhou, On Fri, 2 Nov 2018 at 08:27, Chunming Zhou wrote: > > Signed-off-by: Chunming Zhou > --- > include/drm/drm.h | 38 ++ Please read through include/drm/README about how include/drm/ should be updated. Thanks Emil _

Re: [PATCH v3 1/2] drm/sched: Refactor ring mirror list handling.

2018-12-17 Thread Grodzovsky, Andrey
On 12/17/2018 10:27 AM, Christian König wrote: > Am 10.12.18 um 22:43 schrieb Andrey Grodzovsky: >> Decauple sched threads stop and start and ring mirror >> list handling from the policy of what to do about the >> guilty jobs. >> When stoppping the sched thread and detaching sched fences >> from

[PATCH] drm/radeon: avoid a possible array overflow

2018-12-17 Thread Yang Xiao
From: Young Xiao When looking up the connector type make sure the index is valid. Avoids a later crash if we read past the end of the array. See commit e1718d97aa88 ("drm/amdgpu: avoid a possible array overflow") for detail. Signed-off-by: Young Xiao --- drivers/gpu/drm/radeon/radeon_atombio

Re: [PATCH v3 1/2] drm/sched: Refactor ring mirror list handling.

2018-12-17 Thread Christian König
Am 10.12.18 um 22:43 schrieb Andrey Grodzovsky: Decauple sched threads stop and start and ring mirror list handling from the policy of what to do about the guilty jobs. When stoppping the sched thread and detaching sched fences from non signaled HW fenes wait for all signaled HW fences to complet

Re: [PATCH] drm/amdgpu: correct the return value for error case

2018-12-17 Thread Deucher, Alexander
Reviewed-by: Alex Deucher From: amd-gfx on behalf of Evan Quan Sent: Monday, December 17, 2018 4:59:02 AM To: amd-gfx@lists.freedesktop.org Cc: Quan, Evan Subject: [PATCH] drm/amdgpu: correct the return value for error case It should not return 0 for error cas

Re: [PATCH] drm/amd/display: Skip fast cursor updates for fb changes

2018-12-17 Thread Grodzovsky, Andrey
On 12/17/2018 04:53 AM, Michel Dänzer wrote: > On 2018-12-15 6:25 a.m., Grodzovsky, Andrey wrote: >> On 12/14/2018 02:17 PM, Kazlauskas, Nicholas wrote: >>> On 12/14/18 2:06 PM, Grodzovsky, Andrey wrote: In general I agree with Michel that  DRM solution is required to properly address t

[PATCH v2] drm/amdgpu:Improves robustness of SOC15_WAIT_ON_RREG

2018-12-17 Thread Zhu, James
If register value is updating, reset timeout counter. It improves robustness of SOC15_WAIT_ON_RREG. Signed-off-by: James Zhu --- drivers/gpu/drm/amd/amdgpu/soc15_common.h | 9 +++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/soc15_common.h b/dri

[PATCH] drm/amdgpu:Improves robustness of SOC15_WAIT_ON_RREG

2018-12-17 Thread Zhu, James
If monitor register value is updating, reset timeout counter. It improves robustness of SOC15_WAIT_ON_RREG. Signed-off-by: James Zhu --- drivers/gpu/drm/amd/amdgpu/soc15_common.h | 7 +-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/soc15_common.h

Re: [PATCH] drm/amdgpu: Add more process info in VM for debug

2018-12-17 Thread Koenig, Christian
Hi guys, We could also print shaders, IBs or textures to figure out what's going wrong, but that would be overkill as well. The log is to notice that something is wrong and not to a detailed crash report. The PID is perfectly sufficient to identify the process which triggered an issue and whe

[PATCH] drm/amdgpu: correct the return value for error case

2018-12-17 Thread Evan Quan
It should not return 0 for error case as '0' is actually a special value for index. Change-Id: Iced8b4427d4403f86826a7c8e3c9d1da3394246c Signed-off-by: Evan Quan --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h | 12 ++-- drivers/gpu/drm/amd/amdgpu/amdgpu_xgmi.c | 15 +-- drivers

Re: [PATCH] drm/amd/display: Skip fast cursor updates for fb changes

2018-12-17 Thread Michel Dänzer
On 2018-12-15 6:25 a.m., Grodzovsky, Andrey wrote: > On 12/14/2018 02:17 PM, Kazlauskas, Nicholas wrote: >> On 12/14/18 2:06 PM, Grodzovsky, Andrey wrote: >>> In general I agree with Michel that  DRM solution is required to >>> properly address this but since now it's not really obvious what is the

RE: [PATCH] drm/amdgpu: Add more process info in VM for debug

2018-12-17 Thread Liu, Monk
Hi Christian, I think for some SRIOV customers they need this rich information, Maybe we can use an kernel option to let user select if rich or simple information should be printed upon job TDR ? In SRIOV branch we can set it enable by default while set by default disable for drm-next /Monk

RE: [PATCH] drm/amdgpu: Add more process info in VM for debug

2018-12-17 Thread Huang, Trigger
Hi Christian, Yes, if the test machine is still there for debugging, we can login it and check a lot of things, such as ' ps -p 1 -o args ' as you suggested. But sometimes, the system is not alive anymore, and we only got some log files (such as kern.log ) from QA or customers. And at this tim