[AMD Official Use Only]


> -----Original Message-----
> From: Paul Menzel <pmen...@molgen.mpg.de>
> Sent: Tuesday, March 22, 2022 11:42 AM
> To: Zhou1, Tao <tao.zh...@amd.com>
> Cc: amd-gfx@lists.freedesktop.org; Zhang, Hawking
> <hawking.zh...@amd.com>; Kuehling, Felix <felix.kuehl...@amd.com>; Yang,
> Stanley <stanley.y...@amd.com>; Chai, Thomas <yipeng.c...@amd.com>;
> Lazar, Lijo <lijo.la...@amd.com>
> Subject: Re: [PATCH] drm/amdkfd: print unmap queue status for RAS poison
> consumption (v3)
> 
> Dear Tao,
> 
> 
> Thank you for rerolling the pacth.
> 
> Am 22.03.22 um 04:29 schrieb Tao Zhou:
> > Print the status out when it passes, and also tell user gpu reset is
> > triggered when we fallback to legacy way.
> 
> The verb *fall back* is written with a space.

[Tao] will update it before push

> 
> > v2: make the message more explicitly.
> 
> explicit

[Tao] will update it before push.

> 
> > v3: change succeeds to succeeded.
> >      replace pr_warn with dev_info.
> 
> Please give the reason, why to decrease the debug level. (I would also do it 
> in a
> separate commit.)

[Tao] please check my reply to Lijo. It's informational message, but ras poison 
consumption is special event, both debug levels are OK for me.

> 
> > Signed-off-by: Tao Zhou <tao.zh...@amd.com>
> > ---
> >   drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c | 13 +++++++++----
> >   1 file changed, 9 insertions(+), 4 deletions(-)
> >
> > diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c
> > b/drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c
> > index 56902b5bb7b6..cc9ddc4b4cb8 100644
> > --- a/drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c
> > +++ b/drivers/gpu/drm/amd/amdkfd/kfd_int_process_v9.c
> > @@ -105,8 +105,6 @@ static void
> event_interrupt_poison_consumption(struct kfd_dev *dev,
> >     if (old_poison)
> >             return;
> >
> > -   pr_warn("RAS poison consumption handling: client id %d\n", client_id);
> > -
> >     switch (client_id) {
> >     case SOC15_IH_CLIENTID_SE0SH:
> >     case SOC15_IH_CLIENTID_SE1SH:
> > @@ -130,10 +128,17 @@ static void
> event_interrupt_poison_consumption(struct kfd_dev *dev,
> >     /* resetting queue passes, do page retirement without gpu reset
> >      * resetting queue fails, fallback to gpu reset solution
> >      */
> > -   if (!ret)
> > +   if (!ret) {
> > +           dev_info(dev->adev->dev,
> > +                   "RAS poison consumption, unmap queue flow
> succeeded: client id %d\n",
> > +                   client_id);
> >             amdgpu_amdkfd_ras_poison_consumption_handler(dev->adev,
> false);
> > -   else
> > +   } else {
> > +           dev_info(dev->adev->dev,
> > +                   "RAS poison consumption, fallback to gpu reset flow:
> client id
> > +%d\n",
> 
> The verb *fall back* is written with a space.

[Tao] will update it before push.

> 
> > +                   client_id);
> >             amdgpu_amdkfd_ras_poison_consumption_handler(dev->adev,
> true);
> > +   }
> >   }
> >
> >   static bool event_interrupt_isr_v9(struct kfd_dev *dev,

Reply via email to