Re: [PATCH 2/2] drm/amd/amdgpu: force flush resubmit job

2021-02-25 Thread Christian König
Good catch, but the approach for the fix is incorrect. The device reset count can only be incremented after taking the reset lock and stopping the scheduler, otherwise a whole bunch of different race conditions can happen. Christian. Am 25.02.21 um 08:56 schrieb Chen, JingWen: [AMD Official

RE: [PATCH 2/2] drm/amd/amdgpu: force flush resubmit job

2021-02-25 Thread Liu, Monk
[AMD Official Use Only - Internal Distribution Only] Yeah, that sounds better than original fix Thanks Christian -- Monk Liu | Cloud-GPU Core team -- -Original Message- From: Koenig, Christian Sent: Thurs

Re: [PATCH v3 01/11] drm/atomic: Pass the full state to planes async atomic check and update

2021-02-25 Thread Maxime Ripard
Hi, On Wed, Feb 24, 2021 at 12:33:45PM +0100, Thomas Zimmermann wrote: > Hi Maxime, > > for the whole series: > > Acked-by: Thomas Zimmermann Applied the whole series, thanks to everyone involved in the review, it's been a pretty daunting one :) Maxime signature.asc Description: PGP signatu

[PATCH] drm/amdgpu: Replace DEFINE_SIMPLE_ATTRIBUTE with DEFINE_DEBUGFS_ATTRIBUTE

2021-02-25 Thread Yang Li
Fix the following coccicheck warning: ./drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c:1589:0-23: WARNING: fops_ib_preempt should be defined with DEFINE_DEBUGFS_ATTRIBUTE ./drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c:1592:0-23: WARNING: fops_sclk_set should be defined with DEFINE_DEBUGFS_ATTRIBUTE Re

[PATCH] drm/amd/amdgpu: move inc gpu_reset_counter after drm_sched_stop

2021-02-25 Thread Jingwen Chen
Move gpu_reset_counter after drm_sched_stop to avoid race condition caused by job submitted between reset_count +1 and drm_sched_stop. Signed-off-by: Jingwen Chen --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/

Re: [PATCH] drm/amd/amdgpu: move inc gpu_reset_counter after drm_sched_stop

2021-02-25 Thread Christian König
Am 25.02.21 um 10:16 schrieb Jingwen Chen: Move gpu_reset_counter after drm_sched_stop to avoid race condition caused by job submitted between reset_count +1 and drm_sched_stop. Signed-off-by: Jingwen Chen Reviewed-by: Christian König --- drivers/gpu/drm/amd/amdgpu/amdgpu_device.c | 3 ++

Re: [PATCH v4 00/14] RFC Support hot device unplug in amdgpu

2021-02-25 Thread Daniel Vetter
On Wed, Feb 24, 2021 at 11:30:50AM -0500, Andrey Grodzovsky wrote: > > On 2021-02-19 5:24 a.m., Daniel Vetter wrote: > > On Thu, Feb 18, 2021 at 9:03 PM Andrey Grodzovsky > > wrote: > > > Looked a bit into it, I want to export sync_object to FD and import from > > > that FD > > > such that I wi

Re: [PATCH 088/159] drm/ttm: ioremap buffer properly according to TTM placement flag

2021-02-25 Thread Christian König
The whole patch set needs a rebase since the TTM_PL_FLAG_* for controlling the caching doesn't exists any more upstream. How should we approach that? Thanks, Christian. Am 24.02.21 um 23:17 schrieb Alex Deucher: From: Oak Zeng If TTM placement flag is cached, buffer is intended to be mapped

Re: [PATCH] drm/amdgpu: add ih call to process until checkpoint

2021-02-25 Thread Christian König
Am 25.02.21 um 04:15 schrieb Felix Kuehling: On 2021-02-24 10:54 a.m., Kim, Jonathan wrote: [AMD Official Use Only - Internal Distribution Only] -Original Message- From: Koenig, Christian Sent: Wednesday, February 24, 2021 4:17 AM To: Kim, Jonathan ; amd- g...@lists.freedesktop.org

[PATCH] drm/amd/display: remove unnecessary conversion to bool

2021-02-25 Thread Jiapeng Chong
Fix the following coccicheck warnings: ./drivers/gpu/drm/amd/display/dc/dcn30/dcn30_dpp_cm.c:243:67-72: WARNING: conversion to bool not needed here. Reported-by: Abaci Robot Signed-off-by: Jiapeng Chong --- drivers/gpu/drm/amd/display/dc/dcn30/dcn30_dpp_cm.c | 2 +- 1 file changed, 1 insertion

[PATCH] drm/amd/display: fix 64-bit integer division

2021-02-25 Thread Arnd Bergmann
From: Arnd Bergmann The new display synchronization code caused a regression on all 32-bit architectures: ld.lld: error: undefined symbol: __aeabi_uldivmod >>> referenced by dce_clock_source.c >>> >>> gpu/drm/amd/display/dc/dce/dce_clock_source.o:(get_pixel_clk_frequency_100hz) >>

Re: [PATCH] drm/amdgpu: enable one vf mode on navi21 vf

2021-02-25 Thread Deucher, Alexander
[AMD Official Use Only - Internal Distribution Only] Acked-by: Alex Deucher From: Horace Chen Sent: Thursday, February 25, 2021 7:04 AM To: amd-gfx@lists.freedesktop.org Cc: Grodzovsky, Andrey ; Quan, Evan ; Chen, Horace ; Tuikov, Luben ; Koenig, Christian ; D

[PATCH] drm/amd/display: Fix an uninitialized index variable

2021-02-25 Thread Arnd Bergmann
From: Arnd Bergmann clang points out that the new logic uses an always-uninitialized array index: drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:9810:38: warning: variable 'i' is uninitialized when used here [-Wuninitialized] timing = &edid->detailed_timing

Re: [PATCH] drm/amdgpu: add ih call to process until checkpoint

2021-02-25 Thread Felix Kuehling
Am 2021-02-25 um 8:53 a.m. schrieb Christian König: > > > Am 25.02.21 um 04:15 schrieb Felix Kuehling: >> On 2021-02-24 10:54 a.m., Kim, Jonathan wrote: >>> [AMD Official Use Only - Internal Distribution Only] >>> -Original Message- From: Koenig, Christian Sent: Wednesday, F

[PATCH] amdgpu/pm: read_sensor() report failure apporpriately

2021-02-25 Thread Shirish S
report -ENOTSUPP instead of -EINVAL, so that if userspace fails to read sensor data can figure it out the failure correctly. Signed-off-by: Shirish S --- drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu10_hwmgr.c | 2 +- drivers/gpu/drm/amd/pm/powerplay/hwmgr/smu7_hwmgr.c | 2 +- drivers/gpu/drm/am

Re: [PATCH v2] drm/scheduler: Fix hang when sched_entity released

2021-02-25 Thread Andrey Grodzovsky
On 2021-02-25 2:53 a.m., Christian König wrote: Am 24.02.21 um 16:13 schrieb Andrey Grodzovsky: Ping Sorry, I've been on vacation this week. Andrey On 2021-02-20 7:12 a.m., Andrey Grodzovsky wrote: On 2/20/21 3:38 AM, Christian König wrote: Am 18.02.21 um 17:41 schrieb Andrey Grodzov

Re: [PATCH v4 00/14] RFC Support hot device unplug in amdgpu

2021-02-25 Thread Andrey Grodzovsky
On 2021-02-25 5:25 a.m., Daniel Vetter wrote: On Wed, Feb 24, 2021 at 11:30:50AM -0500, Andrey Grodzovsky wrote: On 2021-02-19 5:24 a.m., Daniel Vetter wrote: On Thu, Feb 18, 2021 at 9:03 PM Andrey Grodzovsky wrote: Looked a bit into it, I want to export sync_object to FD and import from t

[PATCH] drm/amdgpu: enable TMZ by default on Raven asics

2021-02-25 Thread Alex Deucher
This has been stable for a while. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/amdgpu_gmc.c | 10 ++ 2 files changed, 12 insertions(+), 2 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers/gpu/drm

Re: [PATCH] drm/amdgpu: add ih call to process until checkpoint

2021-02-25 Thread Christian König
Am 25.02.21 um 16:35 schrieb Felix Kuehling: Am 2021-02-25 um 8:53 a.m. schrieb Christian König: Am 25.02.21 um 04:15 schrieb Felix Kuehling: On 2021-02-24 10:54 a.m., Kim, Jonathan wrote: [AMD Official Use Only - Internal Distribution Only] -Original Message- From: Koenig, Christ

Re: [PATCH] drm/amdgpu: Remove amdgpu_device arg from free_sgt api

2021-02-25 Thread Christian König
Am 25.02.21 um 03:49 schrieb Ramesh Errabolu: Currently callers have to provide handle of amdgpu_device, which is not used by the implementation. It is unlikely this parameter will become useful in future, thus removing it Signed-off-by: Ramesh Errabolu Reviewed-by: Christian König --- d

Re: [PATCH v6 3/3] drm/amd/display: Skip modeset for front porch change

2021-02-25 Thread Kazlauskas, Nicholas
On 2021-02-12 8:08 p.m., Aurabindo Pillai wrote: [Why] A seamless transition between modes can be performed if the new incoming mode has the same timing parameters as the optimized mode on a display with a variable vtotal min/max. Smooth video playback usecases can be enabled with this seamless

Re: [PATCH 147/159] drm/amdgpu: restore aldebaran save ttmp and trap config on init (v2)

2021-02-25 Thread Deucher, Alexander
[AMD Official Use Only - Internal Distribution Only] I dropped the KFD debugger hunks and just added the gfx 9.4.2 changes since these were required for a bunch of later patches that build on that file that are not dependent on debugger. I can rework the commit message if you'd like. Alex ___

Re: [PATCH] drm/amdgpu: add ih call to process until checkpoint

2021-02-25 Thread Felix Kuehling
Am 2021-02-25 um 11:48 a.m. schrieb Christian König: > > > Am 25.02.21 um 16:35 schrieb Felix Kuehling: >> Am 2021-02-25 um 8:53 a.m. schrieb Christian König: >>> >>> Am 25.02.21 um 04:15 schrieb Felix Kuehling: On 2021-02-24 10:54 a.m., Kim, Jonathan wrote: > [AMD Official Use Only - Inte

Re: [PATCH v2] drm/scheduler: Fix hang when sched_entity released

2021-02-25 Thread Christian König
Am 25.02.21 um 17:03 schrieb Andrey Grodzovsky: On 2021-02-25 2:53 a.m., Christian König wrote: Am 24.02.21 um 16:13 schrieb Andrey Grodzovsky: Ping Sorry, I've been on vacation this week. Andrey On 2021-02-20 7:12 a.m., Andrey Grodzovsky wrote: On 2/20/21 3:38 AM, Christian König wr

Re: [PATCH 147/159] drm/amdgpu: restore aldebaran save ttmp and trap config on init (v2)

2021-02-25 Thread Felix Kuehling
Am 2021-02-25 um 1:32 p.m. schrieb Deucher, Alexander: > > [AMD Official Use Only - Internal Distribution Only] > > > I dropped the KFD debugger hunks and just added the gfx 9.4.2 changes > since these were required for a bunch of later patches that build on > that file that are not dependent on d

[PATCH 1/3] drm/amdgpu: update default timeout of Aldebaran SQ watchdog

2021-02-25 Thread Alex Deucher
From: Harish Kasiviswanathan Signed-off-by: Harish Kasiviswanathan Reivewed-by: Hawking Zhang Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 4 ++-- drivers/gpu/drm/amd/amdgpu/gfx_v9_4_2.c | 7 +++ 2 files changed, 9 insertions(+), 2 deletions(-) diff --git a/d

[PATCH 2/3] drm/amdgpu: Make noretry the default on Aldebaran

2021-02-25 Thread Alex Deucher
From: Felix Kuehling This is needed for best machine learning performance. XNACK can still be enabled per-process if needed. Signed-off-by: Felix Kuehling Reviewed-by: Alex Deucher Reviewed-by: Philip Yang Tested-by: Alex Sierra Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/am

[PATCH 3/3] drm/amdgpu: harvest edc status when connected to host via xGMI

2021-02-25 Thread Alex Deucher
From: Dennis Li When connected to a host via xGMI, system fatal errors may trigger warm reset, driver has no change to query edc status before reset. Therefore in this case, driver should harvest previous error loging registers during boot, instead of only resetting them. v2: 1. IP's ras_manager

[PATCH 135/159] drm/amdgpu: workaround the TMR MC address issue (v2)

2021-02-25 Thread Alex Deucher
From: Oak Zeng With the 2-level gart page table, vram is squeezed into gart aperture and FB aperture is disabled. Therefore all VRAM virtual addresses are in the GART aperture. However currently PSP requires TMR addresses in FB aperture. So we need some design change at PSP FW level to support

Re: [PATCH] drm/amdgpu: add ih call to process until checkpoint

2021-02-25 Thread Christian König
Am 25.02.21 um 19:33 schrieb Felix Kuehling: [SNIP] This in turn can lead to starvation of the work handler and so a life lock as well. I won't touch rptr or wptr at all for this. Not sure what's your idea here, using ih->lock. Is it to completely drain all IRQs until the IH ring is complet

[PATCH 1/5] drm/amdgpu: add asic callback for querying video codec info (v3)

2021-02-25 Thread Alex Deucher
This will be used by a new INFO ioctl query to fetch the decode and encode capabilities from the kernel driver rather than hardcoding them in mesa. This gives us more fine grained control of capabilities using information that is only availabl in the kernel (e.g., platform limitations or bandwidth

[PATCH 3/5] drm/amdgpu: add INFO ioctl support for querying video caps (v4)

2021-02-25 Thread Alex Deucher
We currently hardcode these in mesa, but querying them from the kernel makes more sense since there may be board specific limitations that the kernel driver is better suited to determining. Userpace patches that use this interface: https://gitlab.freedesktop.org/leoliu/drm/-/commits/info_video_cap

[PATCH 4/5] drm/amdgpu: bump driver version for new video codec INFO ioctl query

2021-02-25 Thread Alex Deucher
So mesa can check when to query the kernel vs use hardcoded codec bandwidth data. Reviewed-by: Leo Liu Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_drv.c b/drivers

[PATCH 2/5] drm/amdgpu: add video decode/encode cap tables and asic callbacks (v3)

2021-02-25 Thread Alex Deucher
For each asic family. Will be used to populate tables for the new INFO ioctl query. v2: add max_pixels_per_frame to handle the portrait case v3: fix copy paste typos Reviewed-by: Leo Liu (v1) Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/cik.c | 75 ++ drivers/gpu/drm/

[PATCH 5/5] drm/amdgpu/codec: drop the internal codec index

2021-02-25 Thread Alex Deucher
And just use the ioctl index. They are the same. Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu.h | 9 - drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c | 16 - drivers/gpu/drm/amd/amdgpu/cik.c| 12 --- drivers/gpu/drm/amd/amdgpu/nv.c | 36 +++

Re: [PATCH 1/5] drm/amdgpu: add asic callback for querying video codec info (v3)

2021-02-25 Thread Christian König
Am 25.02.21 um 21:16 schrieb Alex Deucher: This will be used by a new INFO ioctl query to fetch the decode and encode capabilities from the kernel driver rather than hardcoding them in mesa. This gives us more fine grained control of capabilities using information that is only availabl in the ke

Re: [PATCH] drm/amdgpu: add missing df counter disable write

2021-02-25 Thread Alex Deucher
On Tue, Feb 23, 2021 at 4:34 PM Jonathan Kim wrote: > > Request to stop DF performance counters is missing the actual write to the > controller register. > > Reported-by: Chris Freehill > Signed-off-by: Jonathan Kim Acked-by: Alex Deucher > --- > drivers/gpu/drm/amd/amdgpu/df_v3_6.c | 2 ++ >

Re: [PATCH v2] drm/scheduler: Fix hang when sched_entity released

2021-02-25 Thread Andrey Grodzovsky
On 2021-02-25 1:42 p.m., Christian König wrote: Am 25.02.21 um 17:03 schrieb Andrey Grodzovsky: On 2021-02-25 2:53 a.m., Christian König wrote: Am 24.02.21 um 16:13 schrieb Andrey Grodzovsky: Ping Sorry, I've been on vacation this week. Andrey On 2021-02-20 7:12 a.m., Andrey Grodzovs

Re: [PATCH] drm/amdgpu: Replace DEFINE_SIMPLE_ATTRIBUTE with DEFINE_DEBUGFS_ATTRIBUTE

2021-02-25 Thread Alex Deucher
On Thu, Feb 25, 2021 at 4:02 AM Yang Li wrote: > > Fix the following coccicheck warning: > ./drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c:1589:0-23: WARNING: > fops_ib_preempt should be defined with DEFINE_DEBUGFS_ATTRIBUTE > ./drivers/gpu/drm/amd/amdgpu/amdgpu_debugfs.c:1592:0-23: WARNING: > fops_

Re: [PATCH] drm/amd/display: remove unnecessary conversion to bool

2021-02-25 Thread Alex Deucher
On Thu, Feb 25, 2021 at 4:19 AM Jiapeng Chong wrote: > > Fix the following coccicheck warnings: > > ./drivers/gpu/drm/amd/display/dc/dcn30/dcn30_dpp_cm.c:243:67-72: > WARNING: conversion to bool not needed here. > > Reported-by: Abaci Robot > Signed-off-by: Jiapeng Chong Applied. Thanks! Alex

Re: [PATCH] drm/amd/display: fix 64-bit integer division

2021-02-25 Thread Arnd Bergmann
On Thu, Feb 25, 2021 at 3:33 PM Arnd Bergmann wrote: > > From: Arnd Bergmann > > The new display synchronization code caused a regression > on all 32-bit architectures: > > ld.lld: error: undefined symbol: __aeabi_uldivmod > >>> referenced by dce_clock_source.c > >>> > >>> gpu/drm/

[PATCH v3] drm/scheduler: Fix hang when sched_entity released

2021-02-25 Thread Andrey Grodzovsky
Problem: If scheduler is already stopped by the time sched_entity is released and entity's job_queue not empty I encountred a hang in drm_sched_entity_flush. This is because drm_sched_entity_is_idle never becomes false. Fix: In drm_sched_fini detach all sched_entities from the scheduler's run queu

Re: [PATCH] drm/amd/display: fix 64-bit integer division

2021-02-25 Thread Stempen, Vladimir
[AMD Official Use Only - Internal Distribution Only] Hi Arnd, I have all the patches ready and I have tested them with the feature/platform I'm working on and Bindu helped to test the 32bit build. I'm in process of submitting the latest change. Thanks, Vladimir. On 2021-02-25, 4:36 PM, "Arnd Be

Re: [PATCH] drm/amd/display: Fix an uninitialized index variable

2021-02-25 Thread Arnd Bergmann
On Thu, Feb 25, 2021 at 10:34 PM 'Nick Desaulniers' via Clang Built Linux wrote: > return parse_edid_cea(aconnector, edid_ext, EDID_LENGTH, vsdb_info) ? i : > -ENODEV; > > would suffice, but the patch is still fine as is. Right, I did not want to change more than necessary here, and the original

Re: [PATCH] drm/amd/display: Fix an uninitialized index variable

2021-02-25 Thread Alex Deucher
On Thu, Feb 25, 2021 at 10:01 AM Arnd Bergmann wrote: > > From: Arnd Bergmann > > clang points out that the new logic uses an always-uninitialized > array index: > > drivers/gpu/drm/amd/amdgpu/../display/amdgpu_dm/amdgpu_dm.c:9810:38: warning: > variable 'i' is uninitialized when used here [-Wun

[PATCH] drm/amdgpu: remove unused variables

2021-02-25 Thread Alex Deucher
Not used so remove them. Fixes: d2d0c920a127 ("drm/amdgpu: Remove amdgpu_device arg from free_sgt api") Cc: Ramesh Errabolu Signed-off-by: Alex Deucher --- drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c | 4 1 file changed, 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_dma_bu

[PATCH] drm/amdgpu: remove unnecessary reading for epprom header

2021-02-25 Thread Dennis Li
If the number of badpage records exceed the threshold, driver has updated both epprom header and control->tbl_hdr.header before gpu reset, therefore GPU recovery thread no need to read epprom header directly. Signed-off-by: Dennis Li diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_ras_eeprom.c b

[PATCH] drm/amdgpu: add RAP TA version print in amdgpu_firmware_info

2021-02-25 Thread Kevin Wang
add RAP TA version print in amdgpu_firmware_info. Signed-off-by: Kevin Wang --- drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c | 8 1 file changed, 8 insertions(+) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c b/drivers/gpu/drm/amd/amdgpu/amdgpu_kms.c index ce031a77cda5..a5ed9530f542

[PATCH] drm/amdgpu: remove unused variable in amdgpu_dma_buf_unmap()

2021-02-25 Thread Kevin Wang
clean up unsued variable in amdgpu_dma_buf_unmap(). Fixes: drm/amdgpu: Remove amdgpu_device arg from free_sgt api Signed-off-by: Kevin Wang --- drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c | 4 1 file changed, 4 deletions(-) diff --git a/drivers/gpu/drm/amd/amdgpu/amdgpu_dma_buf.c b/drive

RE: [PATCH] drm/amdgpu: add RAP TA version print in amdgpu_firmware_info

2021-02-25 Thread Zhang, Hawking
[AMD Public Use] Reviewed-by: Hawking Zhang -Original Message- From: Wang, Kevin(Yang) Sent: Friday, February 26, 2021 12:24 To: amd-gfx@lists.freedesktop.org Cc: Zhang, Hawking ; Li, Candice ; Wang, Kevin(Yang) Subject: [PATCH] drm/amdgpu: add RAP TA version print in amdgpu_firmware

RE: [PATCH] drm/amdgpu: remove unused variable in amdgpu_dma_buf_unmap()

2021-02-25 Thread Zhang, Hawking
[AMD Public Use] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: amd-gfx On Behalf Of Kevin Wang Sent: Friday, February 26, 2021 12:25 To: amd-gfx@lists.freedesktop.org Cc: Wang, Kevin(Yang) Subject: [PATCH] drm/amdgpu: remove unused variable in amdgpu_dma_buf_unma

RE: [PATCH] drm/amdgpu: remove unnecessary reading for epprom header

2021-02-25 Thread Zhang, Hawking
[AMD Public Use] What about merge this function with amdgpu_ras_check_err_threshold? Regards, Hawking -Original Message- From: Dennis Li Sent: Friday, February 26, 2021 09:26 To: amd-gfx@lists.freedesktop.org; Chen, Guchun ; Zhang, Hawking ; Koenig, Christian Cc: Li, Dennis Subject:

RE: [PATCH] drm/amdgpu: remove unnecessary reading for epprom header

2021-02-25 Thread Li, Dennis
Hi, Hawking, Agree with your suggestion, and it could further simplify our codes. I will refactor them again. Best Regards Dennis Li -Original Message- From: Zhang, Hawking Sent: Friday, February 26, 2021 12:30 PM To: Li, Dennis ; amd-gfx@lists.freedesktop.org; Chen, Guchun ; Ko

RE: [PATCH] drm/amdgpu: enable one vf mode on navi21 vf

2021-02-25 Thread Quan, Evan
[AMD Public Use] Acked-by: Evan Quan -Original Message- From: Horace Chen Sent: Thursday, February 25, 2021 8:05 PM To: amd-gfx@lists.freedesktop.org Cc: Grodzovsky, Andrey ; Quan, Evan ; Chen, Horace ; Tuikov, Luben ; Koenig, Christian ; Deucher, Alexander ; Xiao, Jack ; Zhang, Ha

RE: [PATCH] amdgpu/pm: read_sensor() report failure apporpriately

2021-02-25 Thread Quan, Evan
[AMD Official Use Only - Internal Distribution Only] Reviewed-by: Evan Quan -Original Message- From: amd-gfx On Behalf Of Shirish S Sent: Thursday, February 25, 2021 11:45 PM To: Deucher, Alexander ; amd-gfx@lists.freedesktop.org Cc: S, Shirish Subject: [PATCH] amdgpu/pm: read_sensor(

[RFC] a new approach to detect which ring is the real black sheep upon TDR reported

2021-02-25 Thread Liu, Monk
[AMD Public Use] Hi all NAVI2X project hit a really hard to solve issue now, and it is turned out to be a general headache of our TDR mechanism , check below scenario: 1. There is a job1 running on compute1 ring at timestamp 2. There is a job2 running on gfx ring at timestamp 3. Job1

[PATCH] drm/amd/pm: bump Navi1x driver if version and related data structures V2

2021-02-25 Thread Evan Quan
New changes were involved for the SmuMetrics structure. Change-Id: Ib45443db03977ccd18618bcfdfd3574ac13d50d1 Signed-off-by: Evan Quan --- .../drm/amd/pm/inc/smu11_driver_if_navi10.h | 98 ++- drivers/gpu/drm/amd/pm/inc/smu_v11_0.h| 6 +- .../gpu/drm/amd/pm/swsmu/smu11/navi10_ppt.c

[PATCH v2] drm/amdgpu: remove unnecessary reading for epprom header

2021-02-25 Thread Dennis Li
If the number of badpage records exceed the threshold, driver has updated both epprom header and control->tbl_hdr.header before gpu reset, therefore GPU recovery thread no need to read epprom header directly. v2: merge amdgpu_ras_check_err_threshold into amdgpu_ras_eeprom_check_err_threshold Sig

RE: [PATCH v2] drm/amdgpu: remove unnecessary reading for epprom header

2021-02-25 Thread Zhang, Hawking
[AMD Public Use] Reviewed-by: Hawking Zhang Regards, Hawking -Original Message- From: Dennis Li Sent: Friday, February 26, 2021 14:42 To: amd-gfx@lists.freedesktop.org; Chen, Guchun ; Zhang, Hawking ; Koenig, Christian Cc: Li, Dennis Subject: [PATCH v2] drm/amdgpu: remove unnecessar

[PATCH] drm/amd/pm: correct gpu metrics related data structures V2

2021-02-25 Thread Evan Quan
To make sure they are naturally aligned. V2: minimum the possible influence to existing applications which were developed based on those data structures. With this change, only 32bit OS are affected while 64bit OS not. Change-Id: I0a139e1e1f09fe27deffdce1cec6ea4594947625 Signed-off-by: Ev

[PATCH] drm/amd/pm: optimize the link width/speed retrieving V2

2021-02-25 Thread Evan Quan
By using the information provided by PMFW when available. V2: put those structures shared around SMU V11 ASICs in smu_v11_0.h Change-Id: I1afec4cd07ac9608861ee07c449e320e3f36a932 Signed-off-by: Evan Quan Acked-by: Alex Deucher --- drivers/gpu/drm/amd/pm/inc/smu_v11_0.h| 10

Re: [PATCH] radeon: ERROR: space prohibited before that ','

2021-02-25 Thread Christian König
Well coding style clean ups are usually welcome, but not necessarily one by one. We can probably merge this if you clean up all checkpatch.pl warnings in the whole file. Christian. Am 26.02.21 um 07:05 schrieb wangjingyu: drm_property_create_range(rdev->ddev, 0 , "coherent", 0, 1); Signed-

Re: [PATCH 0/3] drm/ttm: constify static vm_operations_structs

2021-02-25 Thread Christian König
Am 23.02.21 um 18:31 schrieb Alex Deucher: On Wed, Feb 10, 2021 at 8:14 AM Daniel Vetter wrote: On Wed, Feb 10, 2021 at 08:45:56AM +0100, Christian König wrote: Reviewed-by: Christian König for the series. Smash it into -misc? @Christian Koenig did these ever land? I don't see them in drm-

Re: [RFC] a new approach to detect which ring is the real black sheep upon TDR reported

2021-02-25 Thread Christian König
Hi Monk, in general an interesting idea, but I see two major problems with that: 1. It would make the reset take much longer. 2. Things get often stuck because of timing issues, so a guilty job might pass perfectly when run a second time. Apart from that the whole ring mirror list turned out