; tree without the new patch-set ?
>>>>
>>>> Andrey
>>>>
>>> I think this page fault issue can be seen even on the original tree. It's
>>> just drop the concurrent GPU reset will hit it more easily.
>>>
>>> We may need a new way to prote
Andrey ; Deng, Emily ; Liu, Monk ;
dri-devel@lists.freedesktop.org; amd-...@lists.freedesktop.org; Chen, Horace ; Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Hi Jingwen,
well what I mean is that we need to adjust the implementatio
ch job timeout on each
>>>>>>>>>> queue. Otherwise you have a race condition between the hypervisor
>>>>>>>>>> and the scheduler.
>>>>>>>>>>
>>>>>>>>>> Properly setting in_gpu_reset i
We may need a new way to protect the reset in SRIOV.
>
>>>>>>> Andrey
>>>>>>>
>>>>>>>
>>>>>>>> Regards,
>>>>>>>> Christian.
>>>>>>>>
>>>>>>>
n König ; Grodzovsky,
Andrey ; Deng, Emily ; Liu, Monk ;
dri-devel@lists.freedesktop.org; amd-...@lists.freedesktop.org; Chen, Horace ; Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Hi Jingwen,
well what I mean is that w
anuary 4, 2022 6:19 PM
To: Chen, JingWen ; Christian König ; Grodzovsky,
Andrey ; Deng, Emily ; Liu, Monk ;
dri-devel@lists.freedesktop.org; amd-...@lists.freedesktop.org; Chen, Horace ; Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
f
t; signaling the need for a reset, similar to each job timeout on each
>>>>>>>>> queue. Otherwise you have a race condition between the hypervisor and
>>>>>>>>> the scheduler.
>>>>>>> No it's not, FLR from hypervisor is j
lready executed, but host will do FLR anyway
>>>>>> without waiting for guest too long
>>>>>>
>>>>>>>> In other words I strongly think that the current SRIOV reset
>>>>>>>> implementation is severely broken and wha
Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset
protection for SRIOV
Hi Jingwen,
well what I mean is that we need to adjust the implementation in
amdgpu to actually match the requirements.
Could be that the reset sequence is question
Chen, Horace ; Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Hi Jingwen,
well what I mean is that we need to adjust the implementation in amdgpu to
actually match the requirements.
Could be that the reset sequence is que
hes look good to me. JingWen will pull these patches and do
>>>>>>>> some basic TDR test on sriov environment, and give feedback.
>>>>>>>>
>>>>>>>> Best wishes
>>>>>>>> Emily Deng
>>>>>>>>
>>
;> we are hiring software manager for CVS core team
>> ---------------------------
>>
>> -----Original Message-----
>> From: Koenig, Christian
>> Sent: Tuesday, January 4, 2022 6:19 PM
>> To: Chen, JingWen ; Christian Kö
Liu, Monk ; Chen, JingWen
> ; Christian König
> ; Grodzovsky, Andrey
> ; Deng, Emily ;
> dri-devel@lists.freedesktop.org; amd-...@lists.freedesktop.org; Chen,
> Horace
> Cc: dan...@ffwll.ch
> Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset
> protectio
Andrey ;
dri-devel@lists.freedesktop.org; amd- g...@lists.freedesktop.org;
Chen, Horace ; Chen, JingWen
; Deng, Emily
Cc: dan...@ffwll.ch
Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU
reset protection for SRIOV
[AMD Offic
:19 PM
> To: Chen, JingWen ; Christian König
> ; Grodzovsky, Andrey
> ; Deng, Emily ; Liu,
> Monk ; dri-devel@lists.freedesktop.org;
> amd-...@lists.freedesktop.org; Chen, Horace ;
> Chen, JingWen
> Cc: dan...@ffwll.ch
> Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop
ktop.org; amd- g...@lists.freedesktop.org;
Chen, Horace ; Chen, JingWen
; Deng, Emily
Cc: dan...@ffwll.ch
Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset
protection for SRIOV
[AMD Official Use Only]
@Chen, Horace @Chen, JingWe
4, 2022 6:19 PM
To: Chen, JingWen ; Christian König ; Grodzovsky,
Andrey ; Deng, Emily ; Liu, Monk ;
dri-devel@lists.freedesktop.org; amd-...@lists.freedesktop.org; Chen, Horace ; Chen, JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for
t: Tuesday, January 4, 2022 6:19 PM
To: Chen, JingWen ; Christian König
; Grodzovsky, Andrey
; Deng, Emily ; Liu, Monk
; dri-devel@lists.freedesktop.org;
amd-...@lists.freedesktop.org; Chen, Horace ; Chen,
JingWen
Cc: dan...@ffwll.ch
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset p
ngWen ; Deng, Emily
Cc: dan...@ffwll.ch
Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
[AMD Official Use Only]
@Chen, Horace @Chen, JingWen @Deng, Emily
Please take a review on Andrey's patch
Thanks
---
gWen @Deng, Emily
>>>>>
>>>>> Please take a review on Andrey's patch
>>>>>
>>>>> Thanks
>>>>> ---
>>>>> Monk Liu | Cloud GPU & Virtualization Solut
Christian ; Grodzovsky, Andrey
; dri-devel@lists.freedesktop.org; amd-
g...@lists.freedesktop.org; Chen, Horace ; Chen,
JingWen ; Deng, Emily
Cc: dan...@ffwll.ch
Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset
protection
for SRIOV
[AMD Official Use Only]
@Chen, Horace @Che
; dri-
de...@lists.freedesktop.org; amd-...@lists.freedesktop.org
Cc: dan...@ffwll.ch; Liu, Monk ; Chen, Horace
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Am 22.12.21 um 23:14 schrieb Andrey Grodzovsky:
Since now flr work is serialized against GPU resets there is no need
: Thursday, December 23, 2021 6:14 PM
To: Koenig, Christian ; Grodzovsky, Andrey
; dri-devel@lists.freedesktop.org; amd-
g...@lists.freedesktop.org; Chen, Horace ; Chen,
JingWen ; Deng, Emily
Cc: dan...@ffwll.ch
Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Thursday, December 23, 2021 6:14 PM
>> To: Koenig, Christian ; Grodzovsky, Andrey
>> ; dri-devel@lists.freedesktop.org; amd-
>> g...@lists.freedesktop.org; Chen, Horace ; Chen,
>> JingWen ; Deng, Emily
>> Cc: dan...@ffwll.ch
>> Subject: RE: [RFC v2 8/8] drm/amd/virt:
ndrey
>; dri-devel@lists.freedesktop.org; amd-
>g...@lists.freedesktop.org; Chen, Horace ; Chen,
>JingWen ; Deng, Emily
>Cc: dan...@ffwll.ch
>Subject: RE: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
>for SRIOV
>
>[AMD Official Use Only]
>
>@Chen, Ho
: Liu, Monk ; Grodzovsky, Andrey
; Chen, Horace ; Koenig,
Christian ; dan...@ffwll.ch
Subject: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection for
SRIOV
Since now flr work is serialized against GPU resets there is no need for this.
Signed-off-by: Andrey Grodzovsky
---
drivers
.org; amd-...@lists.freedesktop.org
Cc: dan...@ffwll.ch; Liu, Monk ; Chen, Horace
Subject: Re: [RFC v2 8/8] drm/amd/virt: Drop concurrent GPU reset protection
for SRIOV
Am 22.12.21 um 23:14 schrieb Andrey Grodzovsky:
> Since now flr work is serialized against GPU resets there is no need
> for this.
&g
Am 22.12.21 um 23:14 schrieb Andrey Grodzovsky:
Since now flr work is serialized against GPU resets
there is no need for this.
Signed-off-by: Andrey Grodzovsky
Acked-by: Christian König
---
drivers/gpu/drm/amd/amdgpu/mxgpu_ai.c | 11 ---
drivers/gpu/drm/amd/amdgpu/mxgpu_nv.c |
Since now flr work is serialized against GPU resets
there is no need for this.
Signed-off-by: Andrey Grodzovsky
---
drivers/gpu/drm/amd/amdgpu/mxgpu_ai.c | 11 ---
drivers/gpu/drm/amd/amdgpu/mxgpu_nv.c | 11 ---
2 files changed, 22 deletions(-)
diff --git a/drivers/gpu/drm/amd/
29 matches
Mail list logo