On Thu, Apr 03, 2025 at 02:22:41PM +0200, Christian König wrote:
> Am 03.04.25 um 12:25 schrieb Danilo Krummrich:
> > On Thu, Apr 03, 2025 at 12:17:29PM +0200, Philipp Stanner wrote:
> >> On Thu, 2025-04-03 at 12:13 +0200, Philipp Stanner wrote:
> >>> -static int
> >>> -nouveau_fence_signal(struct nouveau_fence *fence)
> >>> +static void
> >>> +nouveau_fence_cleanup_cb(struct dma_fence *dfence, struct
> >>> dma_fence_cb *cb)
> >>>  {
> >>> - int drop = 0;
> >>> + struct nouveau_fence_chan *fctx;
> >>> + struct nouveau_fence *fence;
> >>> +
> >>> + fence = container_of(dfence, struct nouveau_fence, base);
> >>> + fctx = nouveau_fctx(fence);
> >>>  
> >>> - dma_fence_signal_locked(&fence->base);
> >>>   list_del(&fence->head);
> >>>   rcu_assign_pointer(fence->channel, NULL);
> >>>  
> >>>   if (test_bit(DMA_FENCE_FLAG_USER_BITS, &fence->base.flags))
> >>> {
> >>> -         struct nouveau_fence_chan *fctx =
> >>> nouveau_fctx(fence);
> >>> -
> >>>           if (!--fctx->notify_ref)
> >>> -                 drop = 1;
> >>> +                 nvif_event_block(&fctx->event);
> >>>   }
> >>>  
> >>>   dma_fence_put(&fence->base);
> >> What I realized while coding this v2 is that we might want to think
> >> about whether we really want the dma_fence_put() in the fence callback?
> >>
> >> It should work fine, since it's exactly identical to the previous
> >> code's behavior – but effectively it means that the driver's reference
> >> will be dropped whenever it signals that fence.
> > Not quite, it's the reference of the fence context's pending list.
> >
> > When the fence is emitted, dma_fence_init() is called, which initializes the
> > reference count to 1. Subsequently, another reference is taken, when the 
> > fence
> > is added to the pending list. Once the fence is signaled and hence removed 
> > from
> > the pending list, we can (and have to) drop this reference.
> 
> The general idea is that the caller must hold the reference until the 
> signaling is completed.
> 
> So for signaling from the interrupt handler it means that you need to call 
> dma_fence_put() for the list reference *after* you called 
> dma_fence_signal_locked().
> 
> For signaling from the .enable_signaling or .signaled callback you need to 
> remove the fence from the linked list and call dma_fence_put() *before* you 
> return (because the caller is holding the potential last reference).
> 
> That's why I'm pretty sure that the approach with installing the callback 
> won't work. As far as I know no other DMA fence implementation is doing that.

I think it works as long as no one calls dma_fence_singnal(), but only
dma_fence_signal_locked() on this fence (which is what nouveau does). For
dma_fence_signal_locked() it doesn't seem to matter if the last reference is
dropped from a callback. There also can't be other callbacks that suffer from
this, because they'd need to have their own reference.

But either way, as mentioned in my other reply, I agree that we should avoid the
callback approach in favor of your proposal, since it has its own footgun.

Reply via email to