[PATCH 2/2] drm/amdgpu: add critical address check for bad page retirement

2025-07-23 Thread YiPeng Chai
Add critical address check for bad page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 8 1 file changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 0ad3a9eedfd2

[PATCH 1/2] drm/amdgpu: support ras critical address check

2025-07-23 Thread YiPeng Chai
Support ras critical address check. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 89 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 14 2 files changed, 103 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu

[PATCH V2] drm/amdgpu: add command to check address validity

2025-07-17 Thread YiPeng Chai
Add command to check address validity and remove unused command codes. v2: The command interface adds new parameters to support multiple check address strategies. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 63 + drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: add command to check address validity

2025-07-15 Thread YiPeng Chai
Add command to check address validity and remove unused command codes. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 58 +++-- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 ++ 2 files changed, 29 insertions(+), 32 deletions(-) diff --git a/drivers

[PATCH] drm/amdgpu: query the allocated vram address block info

2025-07-09 Thread YiPeng Chai
: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 38 drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.h | 17 + 2 files changed, 55 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c index

[PATCH] drm/amdgpu: Remove useless timeout error message

2025-06-29 Thread YiPeng Chai
The timeout is only used to interrupt polling and not need to print a error message. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 11 +-- 1 file changed, 1 insertion(+), 10 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH] drm/amdgpu: Reduce redundant gpu resets on nbio v7.4

2024-10-21 Thread YiPeng Chai
On nbio v7.4, ras controller interrupt and athub interrupt are generated after injecting UE to PCIE, but gpu reset only needs to be triggered once. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/nbio_v7_4.c | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/drivers

[PATCH] amd/amdgpu: Reduce unnecessary repetitive GPU resets

2024-09-20 Thread YiPeng Chai
In multiple GPUs case, after a GPU has started resetting all GPUs on hive, other GPUs do not need to trigger GPU reset again. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 21 - 1 file changed, 20 insertions(+), 1 deletion(-) diff --git a/drivers

[PATCH] drm/amdgpu: fix ras UE error injection failure issue

2024-07-21 Thread YiPeng Chai
The ras command shared memory is allocated from VRAM and the response status of the command buffer will not be zero due to gpu being in fatal error state after ras UE error injection. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c | 4 +--- 1 file changed, 1 insertion

[PATCH 1/3] drm/amdgpu: optimize umc v12 address conversion function

2024-07-17 Thread YiPeng Chai
Split into 3 parts: 1. Convert soc physical address via ras ta. 2. Expand bad pages from soc physical address. 3. Dump bad address info. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 116 - 1 file changed, 77 insertions(+), 39 deletions(-) diff

[PATCH 3/3] drm/amdgpu: Remove unused code

2024-07-17 Thread YiPeng Chai
Remove unused code. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 29 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 10 --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 86 - drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 5 -- 4 files changed

[PATCH 2/3] drm/amdgpu: optimize logging deferred error info

2024-07-17 Thread YiPeng Chai
1. Use pa_pfn as the radix-tree key index to log deferred error info. 2. Use local array to store expanded bad pages. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 2 +- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 14 ++ drivers/gpu/drm/amd/amdgpu/umc_v12_0.c

[PATCH V2] drm/amdgpu: add mutex to protect ras shared memory

2024-07-14 Thread YiPeng Chai
Add mutex to protect ras shared memory. v2: Add TA_RAS_COMMAND__TRIGGER_ERROR command call status check. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c| 123 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h| 1 + drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: avoid repeatedly executing gpu ras reset

2024-07-08 Thread YiPeng Chai
When a gpu in hive is performing ras reset, other gpus in hive do not need to schedule recovery work to reset the gpu. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 20 +++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm

[PATCH V2 1/2] drm/amdgpu: flush all cached ras bad pages to eeprom

2024-07-08 Thread YiPeng Chai
Before uninstalling gpu driver, flush all cached ras bad pages to eeprom. v2: Put the same code into a function and reuse the function. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 35 - 1 file changed, 29 insertions(+), 6 deletions

[PATCH V2 2/2] drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is completed

2024-07-08 Thread YiPeng Chai
. v2: 1. Add the above description to code comments. 2. Reuse existing function. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 18 ++ 2 files changed, 23 insertions(+), 1 deletion(-) diff --

[PATCH 2/2] drm/amdgpu: timely save bad pages to eeprom after gpu ras reset is complete

2024-07-03 Thread YiPeng Chai
plete. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 14 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 6 ++ 2 files changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH 1/2] drm/amdgpu: flush all cached ras bad pages to eeprom

2024-07-03 Thread YiPeng Chai
Before uninstalling gpu driver, flush all cached ras bad pages to eeprom. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 17 + 1 file changed, 17 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-07-01 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/aldebaran.c | 2 -- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c| 3 +++ 3 files changed, 5 insertions(+), 3 deletions(-) diff

[PATCH V2] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-06-30 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 15 +-- 1 file changed, 13 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH] drm/amdgpu: sysfs node disable query error count during gpu reset

2024-06-28 Thread YiPeng Chai
Sysfs node disable query error count during gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH] drm/amdgpu: add mutex to protect ras shared memory

2024-06-25 Thread YiPeng Chai
Add mutex to protect ras shared memory. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c| 124 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h| 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 + 3 files changed, 87 insertions(+), 40 deletions

[PATCH V2 4/4] drm/amdgpu: add gpu reset check and exception handling

2024-06-19 Thread YiPeng Chai
Add gpu reset check and exception handling for page retirement. v2: Clear poison consumption messages cached in fifo after non mode-1 reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 52 + 1 file changed, 52 insertions(+) diff --git a

[PATCH V2 2/4] drm/amdgpu: refine poison creation interrupt handler

2024-06-19 Thread YiPeng Chai
: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 37 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 18 insertions(+), 20 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH V2 3/4] drm/amdgpu: refine poison consumption interrupt handler

2024-06-19 Thread YiPeng Chai
1. The poison fifo is only used for poison consumption requests. 2. Merge reset requests when poison fifo caches multiple poison consumption messages Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 56 - drivers/gpu/drm/amd/amdgpu

[PATCH V2 1/4] drm/amdgpu: add variable to record the deferred error number read by driver

2024-06-19 Thread YiPeng Chai
Add variable to record the deferred error number read by driver. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 62 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 +- 3 files changed, 48

[PATCH 4/5] drm/amdgpu: add completion to wait for ras reset to complete

2024-06-17 Thread YiPeng Chai
Add completion to wait for ras reset to complete. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 11 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 5/5] drm/amdgpu: add gpu reset check and exception handling

2024-06-17 Thread YiPeng Chai
Add gpu reset check and exception handling for page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 + 1 file changed, 43 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu

[PATCH 3/5] drm/amdgpu: refine poison consumption interrupt handler

2024-06-17 Thread YiPeng Chai
1. The poison fifo is only used for poison consumption requests. 2. Merge reset requests when poison fifo caches multiple poison consumption messages Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 58 + drivers/gpu/drm/amd/amdgpu

[PATCH 2/5] drm/amdgpu: refine poison creation interrupt handler

2024-06-17 Thread YiPeng Chai
: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 41 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 21 insertions(+), 21 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 1/5] drm/amdgpu: add variable to record the deferred error number read by driver

2024-06-17 Thread YiPeng Chai
Add variable to record the deferred error number read by driver. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 62 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 +- 3 files changed, 48

[PATCH 5/5] drm/amdgpu: add gpu reset check before page retirement thread runs

2024-06-12 Thread YiPeng Chai
If gpu is recovering, clear all message reset flags in fifo and wait for gpu to complete recovery. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 12 1 file changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 2/5] drm/amdgpu: add threshold to interrupt waiting for DE data to be ready

2024-06-12 Thread YiPeng Chai
If the number of messages to be processed in the fifo exceeds the threshold, it will not continue to wait for the DE data to be ready. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 13 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 4 +++- 2 files changed

[PATCH 4/5] drm/amdgpu: wait for gpu to complete reset

2024-06-12 Thread YiPeng Chai
Add completion to wait for gpu to complete reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 12 drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 13 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm

[PATCH 3/5] drm/amdgpu: clear all messages reset flags in fifo before gpu reset

2024-06-12 Thread YiPeng Chai
To avoid resetting the gpu repeatedly, clear all message reset flags in the fifo before the first gpu reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 59 - 1 file changed, 58 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd

[PATCH 1/5] drm/amdgpu: add condition check for waking up thread

2024-06-12 Thread YiPeng Chai
1. Cannot add messages to fifo in gpu reset mode. 2. Only when the message is successfully saved to the fifo, the thread can be awakened. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 16 ++-- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 18

[PATCH] drm/amdgpu: change log level

2024-05-06 Thread YiPeng Chai
Change log level. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 4 ++-- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c

[PATCH] drm/amdgpu: add mutex to protect ras shared memory

2024-04-28 Thread YiPeng Chai
Add mutex to protect ras shared memory. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.c| 121 ++--- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h| 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 + 3 files changed, 84 insertions(+), 40 deletions

[PATCH 2/2] drm/amdgpu: Remove redundant function call

2024-04-27 Thread YiPeng Chai
Remove redundant function call. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 22 ++ 1 file changed, 6 insertions(+), 16 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 1/2] drm/amdgpu: Remove unused code

2024-04-27 Thread YiPeng Chai
Remove unused code. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 69 -- 1 file changed, 69 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index 8df84feaf046..12bae67be91c 100644 --- a

[PATCH V2] drm/amdgpu: Fix ras mode2 reset failure in ras aca mode

2024-04-23 Thread YiPeng Chai
Fix ras mode2 reset failure in ras aca mode. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index edb3cd0cef96..11a70991152c 100644

[PATCH] drm/amdgpu: Fix ras mode2 reset failure in ras aca mode

2024-04-22 Thread YiPeng Chai
Fix ras mode2 reset failure in ras aca mode for sdma v4_4_2 and gfx v9_4_3. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gfx_v9_4_3.c | 4 drivers/gpu/drm/amd/amdgpu/sdma_v4_4_2.c | 4 2 files changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v9_4_3.c

[PATCH 15/15] drm/amdgpu: Use new interface to reserve bad page

2024-04-17 Thread YiPeng Chai
Use new interface to reserve bad page. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index d1a2ab944b7d

[PATCH 14/15] drm/amdgpu: Fix address translation defect

2024-04-17 Thread YiPeng Chai
retired_page is page frame and should be expanded to the full address when querying status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd

[PATCH 13/15] drm/amdgpu: support ACA logging ecc errors

2024-04-17 Thread YiPeng Chai
support ACA logging ecc errors. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 5 + 1 file changed, 5 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index bd917eb6ea24..8df84feaf046 100644 --- a/drivers

[PATCH 12/15] drm/amdgpu: add poison consumption handler

2024-04-17 Thread YiPeng Chai
Add poison consumption handler. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 ++--- 1 file changed, 39 insertions(+), 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 11/15] drm/amdgpu: prepare to handle pasid poison consumption

2024-04-17 Thread YiPeng Chai
Prepare to handle pasid poison consumption. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c| 9 - drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.h| 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 20 --- drivers/gpu/drm/amd/amdgpu

[PATCH 09/15] drm/amdgpu: add condition check for amdgpu_umc_fill_error_record

2024-04-17 Thread YiPeng Chai
Add condition check for amdgpu_umc_fill_error_record. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 20 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 2 +- 3 files changed, 19 insertions(+), 4 deletions

[PATCH 07/15] drm/amdgpu: umc v12_0 logs ecc errors

2024-04-17 Thread YiPeng Chai
1. umc v12_0 logs ecc errors. 2. Reserve newly detected ecc error pages. 3. Add tag for bad pages, so that they can be retired later. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 67 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 7

[PATCH 10/15] drm/amdgpu: retire bad pages for umc v12_0

2024-04-17 Thread YiPeng Chai
Retire bad pages for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 57 +- 1 file changed, 55 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c index

[PATCH 08/15] drm/amdgpu: Add delay work to retire bad pages

2024-04-17 Thread YiPeng Chai
Add delay work to retire bad pages. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 36 - drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 2 +- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 3 +++ 4 files

[PATCH 06/15] drm/amdgpu: umc v12_0 converts error address

2024-04-17 Thread YiPeng Chai
Umc v12_0 converts error address. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 94 +- drivers/gpu/drm/amd/amdgpu/umc_v12_0.h | 12 2 files changed, 105 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v12_0.c b

[PATCH 05/15] drm/amdgpu: add interface to update umc v12_0 ecc status

2024-04-17 Thread YiPeng Chai
Add interface to update umc v12_0 ecc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 9 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 6 + drivers/gpu/drm/amd/amdgpu/umc_v12_0.c

[PATCH 04/15] drm/amdgpu: add poison creation handler

2024-04-17 Thread YiPeng Chai
Add poison creation handler. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 74 +++-- 1 file changed, 69 insertions(+), 5 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index

[PATCH 01/15] drm/amdgpu: Add interface to reserve bad page

2024-04-17 Thread YiPeng Chai
Add interface to reserve bad page. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 19 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 4 2 files changed, 23 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd

[PATCH 03/15] drm/amdgpu: prepare for logging ecc errors

2024-04-17 Thread YiPeng Chai
Prepare for logging ecc errors. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 33 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 23 + 2 files changed, 56 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b

[PATCH 02/15] drm/amdgpu: add message fifo to handle RAS poison events

2024-04-17 Thread YiPeng Chai
Add message fifo to handle RAS poison events. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 32 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 18 ++ 2 files changed, 50 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH V2] drm/amdgpu: Fix incorrect return value

2024-04-12 Thread YiPeng Chai
add new nodes for the addresses that are not in the reserved_pages list and reservations_pending list. V2: Avoid repeated locking/unlocking. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 25 +--- 1 file changed, 16 insertions(+), 9 deletions

[PATCH] drm/amdgpu: Fix incorrect return value

2024-04-03 Thread YiPeng Chai
add new nodes for the addresses that are not in the reserved_pages list and reservations_pending list. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 28 +--- 1 file changed, 19 insertions(+), 9 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: Need to resume ras during gpu reset for gfx v9_4_3 sriov

2024-01-30 Thread YiPeng Chai
Need to resume ras during gpu reset for gfx v9_4_3 sriov Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index afc0b4eb7f8e

[PATCH] drm/amdgpu: Fix driver uninstallation problem

2024-01-23 Thread YiPeng Chai
/0x80 [ 484.496866] ? exc_page_fault+0x87/0x170 [ 484.496868] ? asm_exc_page_fault+0x8/0x30 [ 484.496871] entry_SYSCALL_64_after_hwframe+0x44/0xae Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 6 +- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a

[PATCH V2 3/5] drm/amdgpu: Use asynchronous polling to handle umc_v12_0 poisoning

2024-01-17 Thread YiPeng Chai
Use asynchronous polling to handle umc_v12_0 poisoning. v2: 1. Change function name. 2. Change the debugging information content. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 139 ++-- drivers

[PATCH V2 4/5] drm/amdgpu: add interface to check mca umc status

2024-01-17 Thread YiPeng Chai
Add interface to check mca umc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 12 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 +++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20

[PATCH V2 5/5] drm/amdgpu:Support retiring multiple MCA error address pages

2024-01-17 Thread YiPeng Chai
Support retiring multiple MCA error address pages in one in-band query for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 8 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 66

[PATCH V2 2/5] drm/amdgpu: Prepare for asynchronous processing of umc page retirement

2024-01-17 Thread YiPeng Chai
Preparing for asynchronous processing of umc page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 34 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 2 files changed, 39 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH V2 1/5] drm/amdgpu: Add log info for umc_v12_0 and smu_v13_0_6

2024-01-17 Thread YiPeng Chai
Add log info for umc_v12_0 and smu_v13_0_6. v2: Delete redundant logs. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 11 +++ drivers/gpu/drm/amd/amdkfd/kfd_events.c | 6 +- 2 files changed, 16 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm

[PATCH 1/5] drm/amdgpu: Add log info for umc_v12_0 and smu_v13_0_6

2024-01-16 Thread YiPeng Chai
Add log info for umc_v12_0 and smu_v13_0_6. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 11 +++ drivers/gpu/drm/amd/amdkfd/kfd_events.c | 6 +- .../gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c| 13 + 3 files

[PATCH 2/5] drm/amdgpu: Prepare for asynchronous processing of umc page retirement

2024-01-16 Thread YiPeng Chai
Preparing for asynchronous processing of umc page retirement. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 34 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 2 files changed, 39 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH 5/5] drm/amdgpu:Support retiring multiple MCA error address pages

2024-01-16 Thread YiPeng Chai
Support retiring multiple MCA error address pages in one in-band query for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 43 +--- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 8 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 66

[PATCH 4/5] drm/amdgpu: add interface to check mca umc status

2024-01-16 Thread YiPeng Chai
Add interface to check mca umc status. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 12 ++- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 4 +++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20

[PATCH 3/5] drm/amdgpu: Use asynchronous polling to handle umc_v12_0 poisoning

2024-01-16 Thread YiPeng Chai
Use asynchronous polling to handle umc_v12_0 poisoning. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 143 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 3 + 3 files changed, 120 insertions(+), 31

[PATCH 1/4] drm/amdgpu: MCA supports recording umc address information

2023-12-18 Thread YiPeng Chai
MCA supports recording umc address information. V2: Move err_addr variable from struct ras_err_node to struct ras_err_info. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_mca.c | 13 +++-- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 22

[PATCH 3/4] drm/amdgpu: Add umc page retirement for umc v12_0

2023-12-18 Thread YiPeng Chai
Add umc page retirement for umc v12_0. V2: 1. Changed umc page retirement check condition to call umc_v12_0_is_uncorrectable_error. 2. Use memset to clear the contents of the umc error address structure. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c | 56

[PATCH 4/4] drm/amd/pm: smu v13_0_6 supports ecc info by default

2023-12-18 Thread YiPeng Chai
smu v13_0_6 supports ecc info by default. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c | 8 1 file changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c b/drivers/gpu/drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c

[PATCH 2/4] drm/amdgpu: Add poison mode check error condition for umc v12_0

2023-12-18 Thread YiPeng Chai
Add poison mode check error condition for umc v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v12_0.c| 20 ++- drivers/gpu/drm/amd/amdgpu/umc_v12_0.h| 4 ++-- .../drm/amd/pm/swsmu/smu13/smu_v13_0_6_ppt.c | 4 ++-- 3 files changed, 19

[PATCH] drm/amdgpu: support saving bad pages after gpu ras reset

2023-10-18 Thread YiPeng Chai
Support saving bad pages after gpu ras reset for umc_v12_0. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 40 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 35 ++ drivers/gpu/drm

[PATCH] drm/amdgpu: Enable ras for mp0 v13_0_6 sriov

2023-08-15 Thread YiPeng Chai
Enable ras for mp0 v13_0_6 sriov Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 7689395e44fd..378478cf9c21 100644 --- a/drivers

[PATCH V2] drm/amdgpu: mode1 reset needs to recover mp1 for mp0 v13_0_10

2023-08-09 Thread YiPeng Chai
Mode1 reset needs to recover mp1 in fatal error case for mp0 v13_0_10. v2: Define a macro to wrap psp function calls. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h | 5 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 2 ++ drivers/gpu/drm/amd/amdgpu/psp_v13_0.c

[PATCH] drm/amdgpu: mode1 reset needs to recover mp1 for mp0 v13_0_10

2023-08-08 Thread YiPeng Chai
Mode1 reset needs to recover mp1 in fatal error case for mp0 v13_0_10. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp.h | 1 + drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 3 +++ drivers/gpu/drm/amd/amdgpu/psp_v13_0.c | 24 +++- 3 files changed, 27

[PATCH 2/2] drm/amdgpu: fix incorrect vmhub index

2023-07-19 Thread YiPeng Chai
Fix incorrect vmhub index. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c | 5 - 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c index d04fc0f19a29..c0b588e5d6aa 100644 --- a

[PATCH 1/2] drm/amdgpu: fix printing empty string array

2023-07-19 Thread YiPeng Chai
Fix printing empty string array. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c b/drivers/gpu/drm/amd/amdgpu/gmc_v11_0.c index c571f0d95994..d04fc0f19a29

[PATCH 2/2] drm/amdgpu: not update the same version ras ta

2023-07-19 Thread YiPeng Chai
not update the same version ras ta. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 20 +++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c index

[PATCH 1/2] drm/amdgpu: add ta initialization failure check condition

2023-07-19 Thread YiPeng Chai
Add ta initialization failure check condition. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_psp_ta.c index

[PATCH] drm/amdgpu: gpu recovers from fatal error in poison mode

2023-06-25 Thread YiPeng Chai
Fatal error occurs in ras poison mode, mode1 reset is used to recover gpu. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 11 +++ drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 1 + 2 files changed, 12 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: change reserved vram info print

2023-05-25 Thread YiPeng Chai
The link object of mgr->reserved_pages is the blocks variable in struct amdgpu_vram_reservation, not the link variable in struct drm_buddy_block. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_vram_mgr.c | 7 --- 1 file changed, 4 insertions(+), 3 deletions(-) diff --

[PATCH] drm/amdgpu: perform mode2 reset for sdma fed error on gfx v11_0_3

2023-05-16 Thread YiPeng Chai
perform mode2 reset for sdma fed error on gfx v11_0_3. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 8 +++- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 5 + drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 14 +- 3 files changed, 25 insertions(+), 2

[PATCH V2] amd/amdgpu: perform soft reset for sdma fed error

2023-05-15 Thread YiPeng Chai
When testing sdma ib ring fails to detect sdma hang for sdma fed error, force to perform soft reset. V2: Add poison mode support check for special code path. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 19 +++ 1 file changed, 19 insertions

[PATCH] amd/amdgpu: perform soft reset for sdma fed error

2023-05-12 Thread YiPeng Chai
When testing sdma ib ring fails to detect sdma hang for sdma fed error, force to perform soft reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 16 1 file changed, 16 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c b/drivers/gpu

[PATCH 3/3] drm/amdgpu: adjust gpu reset sequence for gfx v11_0_3

2023-04-27 Thread YiPeng Chai
When gfx ras poison consumption causes gpu reset on gfx v11_0_3, the sequence of gpu reset is "soft reset -> mode2 reset -> mode1 reset". If the previous reset fails, fall back to the next reset. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdg

[PATCH 1/3] drm/amdgpu: add variable to record gpu reset reason

2023-04-27 Thread YiPeng Chai
Add variable to record gpu reset reason. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h | 3 +++ drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 6 +- 2 files changed, 8 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.h b/drivers/gpu/drm

[PATCH 2/3] drm/amdgpu: don't output mes error message when gfx hang during gpu reset

2023-04-27 Thread YiPeng Chai
: recover vram bo from shadow done [ 390.931067] amdgpu :63:00.0: amdgpu: GPU reset(1) succeeded! Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 2 ++ drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 10 ++ drivers/gpu/drm/amd/amdgpu

[PATCH] drm/amdgpu: add gfx v11_0_3 fed irq handling for sriov

2023-04-11 Thread YiPeng Chai
Add gfx v11_0_3 fed irq handling for sriov. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c | 14 +++--- 1 file changed, 11 insertions(+), 3 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/gfx_v11_0_3.c b/drivers/gpu/drm/amd

[PATCH 2/2] drm/amdgpu: optimize redundant code in umc_v6_7

2023-04-02 Thread YiPeng Chai
Optimize redundant code in umc_v6_7. Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/umc_v6_7.c | 162 +++--- 1 file changed, 71 insertions(+), 91 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/umc_v6_7.c b/drivers/gpu/drm/amd/amdgpu/umc_v6_7.c index

[PATCH 1/2] drm/amdgpu: optimize redundant code in umc_v8_10

2023-04-02 Thread YiPeng Chai
Optimize redundant code in umc_v8_10 Signed-off-by: YiPeng Chai --- drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c | 31 drivers/gpu/drm/amd/amdgpu/amdgpu_umc.h | 7 + drivers/gpu/drm/amd/amdgpu/umc_v8_10.c | 197 +--- 3 files changed, 115 insertions(+), 120 deletions

[PATCH 2/3] drm/amdgpu: reinit mes ip block during reset on SRIOV

2023-03-20 Thread YiPeng Chai
Reinit mes ip block during reset on SRIOV. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c index

[PATCH 3/3] drm/amdgpu: resume ras for gfx v11_0_3 during reset on SRIOV

2023-03-20 Thread YiPeng Chai
Gfx v11_0_3 supports ras on SRIOV, so need to resume ras during reset. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_device.c b

[PATCH 1/3] drm/amdgpu: enable ras for mp0 v13_0_10 on SRIOV

2023-03-20 Thread YiPeng Chai
Enable ras for mp0 v13_0_10 on SRIOV. Signed-off-by: YiPeng Chai Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c index 63dfcc98152d

[PATCH 2/2] drm/amdgpu: Optimize sdma ras block initialization code for sdma v4_0

2023-01-18 Thread YiPeng Chai
Optimize sdma ras block initialization code for sdma v4_0. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c | 21 + 1 file changed, 5 insertions(+), 16 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/sdma_v4_0.c b/drivers

[PATCH 1/2] drm/amdgpu: Add sdma ras function on sdma v6_0_3

2023-01-18 Thread YiPeng Chai
Add sdma ras function on sdma v6_0_3. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou --- drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.c | 35 drivers/gpu/drm/amd/amdgpu/amdgpu_sdma.h | 1 + drivers/gpu/drm/amd/amdgpu/sdma_v6_0.c | 24 3 files changed

[PATCH 2/2] drm/amdgpu: Adjust ras support check condition for special asic

2023-01-12 Thread YiPeng Chai
ras block supports ras function. Signed-off-by: YiPeng Chai Reviewed-by: Tao Zhou Reviewed-by: Hawking Zhang --- drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c | 17 - 1 file changed, 16 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras.c b/drivers/gpu

  1   2   >