On 2024-11-28 10:21, Lazar, Lijo wrote:
> 
> 
> On 11/28/2024 8:12 PM, Felix Kuehling wrote:
>>
>>
>> On 2024-11-27 22:45, Lazar, Lijo wrote:
>>>
>>>
>>> On 11/28/2024 5:43 AM, Felix Kuehling wrote:
>>>>
>>>> On 2024-11-18 00:34, Lijo Lazar wrote:
>>>>> Write pointer could be 32-bit or 64-bit. Use the correct size during
>>>>> initialization.
>>>>>
>>>>> Signed-off-by: Lijo Lazar <lijo.la...@amd.com>
>>>>> ---
>>>>>   drivers/gpu/drm/amd/amdkfd/kfd_kernel_queue.c | 2 +-
>>>>>   1 file changed, 1 insertion(+), 1 deletion(-)
>>>>>
>>>>> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_kernel_queue.c b/drivers/
>>>>> gpu/drm/amd/amdkfd/kfd_kernel_queue.c
>>>>> index 4843dcb9a5f7..d6037577c532 100644
>>>>> --- a/drivers/gpu/drm/amd/amdkfd/kfd_kernel_queue.c
>>>>> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_kernel_queue.c
>>>>> @@ -125,7 +125,7 @@ static bool kq_initialize(struct kernel_queue *kq,
>>>>> struct kfd_node *dev,
>>>>>         memset(kq->pq_kernel_addr, 0, queue_size);
>>>>>       memset(kq->rptr_kernel, 0, sizeof(*kq->rptr_kernel));
>>>>> -    memset(kq->wptr_kernel, 0, sizeof(*kq->wptr_kernel));
>>>>> +    memset(kq->wptr_kernel, 0, dev->kfd->device_info.doorbell_size);
>>>>
>>>> It would be safer and cleaner to just initialize kq->wptr64_kernel,
>>>> which is always 64 bit and aliases kq->wptr_kernel.
>>>>
>>>
>>> It's done this way because of aliasing. The size requested is
>>> doorbell_size which could be 4 or 8 bytes.
>>>
>>> By safer, do you mean to have an if..else check in case the aliasing is
>>> taken out?
>>
>> Cleaner because the size of the memset would match the size of the variable. 
>> And safer because it clears the entire wptr, regardless of the doorbell size.
>>
>> That said, it doesn't really matter because the whole kq structure is 
>> allocated with kzalloc just before calling kq_initialize. So maybe we should 
>> just remove all those redundant memset(kq->field, 0, size) calls.
>>
> 
> This is not related to kzalloc of kq structure. This is actually
> initializing write pointer value to 0.
> 
> The sequence is -
>       Allocate memory for write pointer (kfd_gtt_sa_allocate)
>       Assign kq wptr to the allocated pointer
>       Set wptr to 0 (Initial write pointer value).
> 
> The last step should actually be based on 4 byte or 8byte, this patch is
> for that. If gtt_sa_allocate is already getting a cleared memory, then
> this can be skipped as well.

OK, sorry, I was reading that wrong. You're right. kfd_gtt_sa_allocate does not 
clear memory. And clearing it with the same size as the allocation is the right 
thing to do. The patch is

Reviewed-by: Felix Kuehling <felix.kuehl...@amd.com>

> 
> Thanks,
> Lijo
> 
>> Regards,
>>   Felix
>>
>>>
>>> Thanks,
>>> Lijo
>>>
>>>> Regards,
>>>>   Felix
>>>>
>>>>
>>>>
>>>>>         prop.queue_size = queue_size;
>>>>>       prop.is_interop = false;
>>>
>>
> 

Reply via email to