Re: [PATCH] drm/amd/display: avoid NPD when ASIC does not support DMUB

2025-03-13 Thread Leo Li
On 2025-03-13 07:29, Thadeu Lima de Souza Cascardo wrote: On Wed, Feb 05, 2025 at 10:06:38AM -0300, Thadeu Lima de Souza Cascardo wrote: ctx->dmub_srv will de NULL if the ASIC does not support DMUB, which is tested in dm_dmub_sw_init. However, it will be dereferenced in dmub_hw_lock_mgr_cmd

Re: [PATCH] drm/amd/display: avoid NPD when ASIC does not support DMUB

2025-03-13 Thread Thadeu Lima de Souza Cascardo
On Wed, Feb 05, 2025 at 10:06:38AM -0300, Thadeu Lima de Souza Cascardo wrote: > ctx->dmub_srv will de NULL if the ASIC does not support DMUB, which is > tested in dm_dmub_sw_init. > > However, it will be dereferenced in dmub_hw_lock_mgr_cmd if > should_use_dmub_lock returns true. > > This has be

Re: [PATCH] drm/amdgpu/gfx12: correct cleanup of 'me' field with gfx_v12_0_me_fini()

2025-03-13 Thread Alex Deucher
Applied. Thanks! Alex On Wed, Mar 12, 2025 at 6:09 AM Wentao Liang wrote: > > In gfx_v12_0_cp_gfx_load_me_microcode_rs64(), gfx_v12_0_pfp_fini() is > incorrectly used to free 'me' field of 'gfx', since gfx_v12_0_pfp_fini() > can only release 'pfp' field of 'gfx'. The release function of 'me' fi

Re: [PATCH 1/7] drm/amd/amdgpu: Simplify SDMA reset mechanism by removing dynamic callbacks

2025-03-13 Thread Alex Deucher
I think as long as the locking is correct, the src shouldn't matter. You just need to stop the kernel queues (and save state) and evict the user queues (since HWS is responsible for saving the MQDs of the non-guilty user queues). If KFD detected the hang (e.g., queue eviction fails when called for

Re: [PATCH 1/7] drm/amd/amdgpu: Simplify SDMA reset mechanism by removing dynamic callbacks

2025-03-13 Thread Christian König
Am 12.03.25 um 10:23 schrieb Zhang, Jesse(Jie): > > [AMD Official Use Only - AMD Internal Distribution Only] > > >   > >   > > *From:*Koenig, Christian > *Sent:* Wednesday, March 12, 2025 4:39 PM > *To:* Zhang, Jesse(Jie) ; amd-gfx@lists.freedesktop.org > *Cc:* Deucher, Alexander ; Kim, Jonathan

[PATCH 08/11] drm/amdgpu/gfx12: add support for disable_kq

2025-03-13 Thread Alex Deucher
Plumb in support for disabling kernel queues. v2: use ring counts per Felix' suggestion v3: fix stream fault handler, enable EOP interrupts v4: fix MEC interrupt offset (Sunil) Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/gfx_v12_0.c | 183 + 1 file changed

Re: [PATCH 1/2] drm: Create an app info option for wedge events

2025-03-13 Thread Raag Jadav
On Wed, Mar 12, 2025 at 06:59:33PM -0300, André Almeida wrote: > Em 12/03/2025 07:06, Raag Jadav escreveu: > > On Tue, Mar 11, 2025 at 07:09:45PM +0200, Raag Jadav wrote: > > > On Mon, Mar 10, 2025 at 06:27:53PM -0300, André Almeida wrote: > > > > Em 01/03/2025 02:53, Raag Jadav escreveu: > > > > >

[PATCH 05/11] drm/amdgpu/mes: update hqd masks when disable_kq is set

2025-03-13 Thread Alex Deucher
Make all resources available to user queues. Suggested-by: Sunil Khatri Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c b/drivers

[PATCH 02/11] drm/amdgpu: add ring flag for no user submissions

2025-03-13 Thread Alex Deucher
This would be set by IPs which only accept submissions from the kernel, not userspace, such as when kernel queues are disabled. Don't expose the rings to userspace and reject any submissions in the CS IOCTL. Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amd

[PATCH 06/11] drm/amdgpu/mes: make more vmids available when disable_kq=1

2025-03-13 Thread Alex Deucher
If we don't have kernel queues, the vmids can be used by the MES for user queues. Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c | 2 +- drivers/gpu/drm/amd/amdgpu/gmc_v10_0.c | 2 +- drivers/gpu/drm/amd/amdgpu/gmc_v12_0.c | 2 +- 3 files cha

[PATCH 10/11] drm/amdgpu/sdma6: add support for disable_kq

2025-03-13 Thread Alex Deucher
When the parameter is set, disable user submissions to kernel queues. Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c b/drivers/gpu/drm/amd/amdgpu/sdm

[PATCH 07/11] drm/amdgpu/gfx11: add support for disable_kq

2025-03-13 Thread Alex Deucher
Plumb in support for disabling kernel queues in GFX11. We have to bring up a GFX queue briefly in order to initialize the clear state. After that we can disable it. v2: use ring counts per Felix' suggestion v3: fix stream fault handler, enable EOP interrupts v4: fix MEC interrupt offset (Sunil)

[PATCH 11/11] drm/amdgpu/sdma7: add support for disable_kq

2025-03-13 Thread Alex Deucher
When the parameter is set, disable user submissions to kernel queues. Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/sdma_v7_0.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v7_0.c b/drivers/gpu/drm/amd/amdgpu/sdm

[PATCH 03/11] drm/amdgpu/gfx: add generic handling for disable_kq

2025-03-13 Thread Alex Deucher
Add proper checks for disable_kq functionality in gfx helper functions. Add special logic for families that require the clear state setup. v2: use ring count as per Felix suggestion v3: fix num_gfx_rings handling in amdgpu_gfx_graphics_queue_acquire() Signed-off-by: Alex Deucher --- drivers/gp

[PATCH 09/11] drm/amdgpu/sdma: add flag for tracking disable_kq

2025-03-13 Thread Alex Deucher
For SDMA, we still need kernel queues for paging so they need to be initialized, but we no not want to accept submissions from userspace when disable_kq is set. Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.h | 1 + 1 file changed, 1 insertion(

[PATCH V5 00/11] Add disable kernel queue support

2025-03-13 Thread Alex Deucher
To better evaluate user queues, add a module parameter to disable kernel queues. With this set kernel queues are disabled and only user queues are available. This frees up hardware resources for use in user queues which would otherwise be used by kernel queues and provides a way to validate user

[PATCH 01/11] drm/amdgpu: add parameter to disable kernel queues

2025-03-13 Thread Alex Deucher
On chips that support user queues, setting this option will disable kernel queues to be used to validate user queues without kernel queues. Reviewed-by: Prike Liang Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 1 + drivers/gpu/drm/amd/amdg

[PATCH 04/11] drm/amdgpu/mes: centralize gfx_hqd mask management

2025-03-13 Thread Alex Deucher
Move it to amdgpu_mes to align with the compute and sdma hqd masks. No functional change. v2: rebase on new changes Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c | 22 ++ drivers/gpu/drm/amd/amdgpu/mes_v11_0.c | 16 +++

Re: [PATCH] drm/amd/display: avoid NPD when ASIC does not support DMUB

2025-03-13 Thread Rodrigo Siqueira
On 03/13, Thadeu Lima de Souza Cascardo wrote: > On Wed, Feb 05, 2025 at 10:06:38AM -0300, Thadeu Lima de Souza Cascardo wrote: > > ctx->dmub_srv will de NULL if the ASIC does not support DMUB, which is > > tested in dm_dmub_sw_init. > > > > However, it will be dereferenced in dmub_hw_lock_mgr_cmd

Re: [PATCH] drm/amd/display: avoid NPD when ASIC does not support DMUB

2025-03-13 Thread Alex Deucher
Applied. Thanks. Alex On Thu, Mar 13, 2025 at 11:17 AM Leo Li wrote: > > > > On 2025-03-13 07:29, Thadeu Lima de Souza Cascardo wrote: > > On Wed, Feb 05, 2025 at 10:06:38AM -0300, Thadeu Lima de Souza Cascardo > > wrote: > >> ctx->dmub_srv will de NULL if the ASIC does not support DMUB, which

Re: [RFC PATCH 2/7] drm/amd/display: start using drm_edid helpers to parse EDID caps

2025-03-13 Thread Alex Hung
On 3/8/25 07:26, Melissa Wen wrote: Groundwork that allocates a temporary drm_edid from raw edid to take advantage of DRM common-code helpers instead of driver-specific code. Signed-off-by: Melissa Wen --- .../gpu/drm/amd/display/amdgpu_dm/amdgpu_dm_helpers.c | 11 +-- 1 file chan

Re: commit 7ffb791423c7 breaks steam game

2025-03-13 Thread Balbir Singh
On 3/14/25 05:12, Bert Karwatzki wrote: > Am Donnerstag, dem 13.03.2025 um 22:47 +1100 schrieb Balbir Singh: >> >> >> Anyway, I think the nokaslr result is interesting, it seems like with nokaslr >> even the older kernels have problems with the game >> >> Could you confirm if with nokaslr >> > Now

Re: [PATCH 07/11] drm/amdgpu/gfx11: add support for disable_kq

2025-03-13 Thread Rodrigo Siqueira
On 03/13, Alex Deucher wrote: > Plumb in support for disabling kernel queues in > GFX11. We have to bring up a GFX queue briefly in > order to initialize the clear state. After that > we can disable it. > > v2: use ring counts per Felix' suggestion > v3: fix stream fault handler, enable EOP inte

Re: [PATCH 11/11] drm/amdgpu/sdma7: add support for disable_kq

2025-03-13 Thread Rodrigo Siqueira
On 03/13, Alex Deucher wrote: > When the parameter is set, disable user submissions > to kernel queues. > > Reviewed-by: Sunil Khatri > Signed-off-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/sdma_v7_0.c | 4 > 1 file changed, 4 insertions(+) > > diff --git a/drivers/gpu/drm/amd/am

Re: [RFC PATCH 0/7] drm/amd/display: more DRM edid common-code to the display driver

2025-03-13 Thread Alex Hung
The series look fine to me, except one small error in patch 2. I can send this series to promotion tests once the error is addressed. Let me also check others for comments. Hi Harry and Leo, Do you have other concerns before I sent this series to promotion tests? On 3/8/25 07:26, Melissa Wen

Re: [PATCH] drm/amdgpu: Add debug masks for HDCP LC FW testing

2025-03-13 Thread Pillai, Aurabindo
[AMD Official Use Only - AMD Internal Distribution Only] Reviewed-by: Aurabindo Pillai -- Regards, Jay From: Dominik Kaszewski Sent: Thursday, March 13, 2025 4:52 AM To: amd-gfx@lists.freedesktop.org Cc: Pillai, Aurabindo Subject: [PATCH] drm/amdgpu: Add deb

[PATCH] drm/amdgpu: drop drm_firmware_drivers_only()

2025-03-13 Thread Alex Deucher
There are a number of systems and cloud providers out there that have nomodeset hardcoded in their kernel parameters to block nouveau for the nvidia driver. This prevents the amdgpu driver from loading. Unfortunately the end user cannot easily change this. The preferred way to block modules from

Re: [PATCH V5 00/11] Add disable kernel queue support

2025-03-13 Thread Rodrigo Siqueira
n 03/13, Alex Deucher wrote: > To better evaluate user queues, add a module parameter > to disable kernel queues. With this set kernel queues > are disabled and only user queues are available. This > frees up hardware resources for use in user queues which > would otherwise be used by kernel queu

[PATCH 04/11] drm/amdgpu/mes: centralize gfx_hqd mask management

2025-03-13 Thread Alex Deucher
Move it to amdgpu_mes to align with the compute and sdma hqd masks. No functional change. v2: rebase on new changes Reviewed-by: Sunil Khatri Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_mes.c | 22 ++ drivers/gpu/drm/amd/amdgpu/mes_v11_0.c | 16 +++

Re: [PATCH 07/11] drm/amdgpu/gfx11: add support for disable_kq

2025-03-13 Thread Alex Deucher
On Thu, Mar 13, 2025 at 6:08 PM Rodrigo Siqueira wrote: > > On 03/13, Alex Deucher wrote: > > Plumb in support for disabling kernel queues in > > GFX11. We have to bring up a GFX queue briefly in > > order to initialize the clear state. After that > > we can disable it. > > > > v2: use ring coun

Re: [PATCH 02/11] drm/amdgpu: add ring flag for no user submissions

2025-03-13 Thread Alex Deucher
On Thu, Mar 13, 2025 at 5:53 PM Rodrigo Siqueira wrote: > > On 03/13, Alex Deucher wrote: > > This would be set by IPs which only accept submissions > > from the kernel, not userspace, such as when kernel > > queues are disabled. Don't expose the rings to userspace > > and reject any submissions i

[PATCH 3/3] drm/amdgpu: don't free conflicting apertures for non-display devices

2025-03-13 Thread Alex Deucher
PCI_CLASS_ACCELERATOR_PROCESSING devices won't ever be the sysfb, so there is no need to free conflicting apertures. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 15 +++ 1 file changed, 11 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd

[PATCH 1/3] drm/amdgpu: drop drm_firmware_drivers_only()

2025-03-13 Thread Alex Deucher
There are a number of systems and cloud providers out there that have nomodeset hardcoded in their kernel parameters to block nouveau for the nvidia driver. This prevents the amdgpu driver from loading. Unfortunately the end user cannot easily change this. The preferred way to block modules from

[PATCH 2/3] drm/amdgpu: adjust drm_firmware_drivers_only() handling

2025-03-13 Thread Alex Deucher
Move to probe so we can check the PCI device type and only apply the drm_firmware_drivers_only() check for PCI DISPLAY classes. Also add a module parameter to override the nomodeset kernel parameter as a workaround for platforms that have this hardcoded on their kernel command lines. Signed-off-b

Re: [PATCH V5 00/11] Add disable kernel queue support

2025-03-13 Thread Alex Deucher
On Thu, Mar 13, 2025 at 6:21 PM Rodrigo Siqueira wrote: > > n 03/13, Alex Deucher wrote: > > To better evaluate user queues, add a module parameter > > to disable kernel queues. With this set kernel queues > > are disabled and only user queues are available. This > > frees up hardware resources

[PATCH] drm/amdgpu: Enable ACA by default for psp v13_0_6/v13_0_14

2025-03-13 Thread Xiang Liu
Enable ACA by default for psp v13_0_6/v13_0_14. Signed-off-by: Xiang Liu --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 -- 1 file changed, 4 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 7cf8a3036828..cf

RE: [PATCH] drm/amdgpu: Fix computation for remain size of CPER ring

2025-03-13 Thread Zhang, Hawking
[AMD Official Use Only - AMD Internal Distribution Only] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Liu, Xiang(Dean) Sent: Thursday, March 13, 2025 11:28 To: amd-gfx@lists.freedesktop.org Cc: Zhang, Hawking ; Zhou1, Tao ; Liu, Xiang(Dean) Subject: [PATCH] drm

RE: [PATCH] drm/amdgpu: format old RAS eeprom data into V3 version

2025-03-13 Thread Zhang, Hawking
[AMD Official Use Only - AMD Internal Distribution Only] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: amd-gfx On Behalf Of Tao Zhou Sent: Wednesday, March 12, 2025 18:06 To: amd-gfx@lists.freedesktop.org Cc: Zhou1, Tao Subject: [PATCH] drm/amdgpu: format old RAS

Re: [PATCH 00/63] Fix CONFIG_DRM_USE_DYNAMIC_DEBUG=y

2025-03-13 Thread jim . cromie
hello everyone, sorry for the late reply. I have a cleaner version cooking now. less inter-commit churn, by bringing more cleanups forward. I'll send a -v2 soon. (lets forget all the meandering crap versions I sent) Louis, thanks for testing ! I wrote the test script and submod.ko so the

[PATCH] drm/amdgpu: Add debug masks for HDCP LC FW testing

2025-03-13 Thread Dominik Kaszewski
HDCP Locality Check is being moved to FW, add debug flags to control its behavior in existing hardware for validation purposes. Signed-off-by: Dominik Kaszewski --- drivers/gpu/drm/amd/include/amd_shared.h | 12 1 file changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/incl

RE: [PATCH] drm/amdgpu: Enable ACA by default for psp v13_0_6/v13_0_14

2025-03-13 Thread Zhang, Hawking
[AMD Official Use Only - AMD Internal Distribution Only] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Liu, Xiang(Dean) Sent: Thursday, March 13, 2025 15:17 To: amd-gfx@lists.freedesktop.org Cc: Zhang, Hawking ; Zhou1, Tao ; Liu, Xiang(Dean) Subject: [PATCH] drm

Re: [PATCH 1/2] drm: Create an app info option for wedge events

2025-03-13 Thread André Almeida
Em 12/03/2025 07:06, Raag Jadav escreveu: On Tue, Mar 11, 2025 at 07:09:45PM +0200, Raag Jadav wrote: On Mon, Mar 10, 2025 at 06:27:53PM -0300, André Almeida wrote: Em 01/03/2025 02:53, Raag Jadav escreveu: On Fri, Feb 28, 2025 at 06:54:12PM -0300, André Almeida wrote: Hi Raag, On 2/28/25 11

Re: [PATCH 5/8] drm/amdgpu: rework how the cleaner shader is emitted v3

2025-03-13 Thread SRINIVASAN SHANMUGAM
On 2/18/2025 9:43 PM, Christian König wrote: Instead of emitting the cleaner shader for every job which has the enforce_isolation flag set only emit it for the first submission from every client. v2: add missing NULL check v3: fix another NULL pointer deref Signed-off-by: Christian König ---

Re: [PATCH 1/8] drm/amdgpu: grab an additional reference on the gang fence v2

2025-03-13 Thread SRINIVASAN SHANMUGAM
On 3/7/2025 7:18 PM, Christian König wrote: We keep the gang submission fence around in adev, make sure that it stays alive. v2: fix memory leak on retry Signed-off-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 10 +- 1 file changed, 9 insertions(+), 1 delet

Re: commit 7ffb791423c7 breaks steam game

2025-03-13 Thread Balbir Singh
On 3/14/25 09:22, Bert Karwatzki wrote: > Am Freitag, dem 14.03.2025 um 08:54 +1100 schrieb Balbir Singh: >> On 3/14/25 05:12, Bert Karwatzki wrote: >>> Am Donnerstag, dem 13.03.2025 um 22:47 +1100 schrieb Balbir Singh: Anyway, I think the nokaslr result is interesting, it seems like

Re: [PATCH 5/8] drm/amdgpu: rework how the cleaner shader is emitted v3

2025-03-13 Thread SRINIVASAN SHANMUGAM
On 3/7/2025 7:18 PM, Christian König wrote: Instead of emitting the cleaner shader for every job which has the enforce_isolation flag set only emit it for the first submission from every client. v2: add missing NULL check v3: fix another NULL pointer deref Signed-off-by: Christian König ---

RE: [PATCH] drm/amdgpu: drop drm_firmware_drivers_only()

2025-03-13 Thread Russell, Kent
[AMD Official Use Only - AMD Internal Distribution Only] Reviewed-by: Kent Russell > -Original Message- > From: amd-gfx On Behalf Of Alex > Deucher > Sent: Thursday, March 13, 2025 5:05 PM > To: amd-gfx@lists.freedesktop.org > Cc: Deucher, Alexander > Subject: [PATCH] drm/amdgpu: dro