rkrishn7 commented on issue #17451:
URL: https://github.com/apache/datafusion/issues/17451#issuecomment-3262694518

   > > Without some synchronization, the behavior is racy and it's not 
guaranteed that that the dynamic filter is built prior to initiating the right 
side's execution plan.
   > 
   > I think that's fine? For a big query it will be a non issue. It could make 
a difference in a small query. My worry is that if you get synchronization 
wrong or misunderstand something you can actually slow things down, end up with 
deadlocks, etc.
   
   Hmm, I think we want the filter to be built prior to any scans being 
initiated on the right side. The underlying datasource implementation may try 
to make use of it immediately (e.g. pruning predicate for row group filtering). 
In my mind, we ideally don't the filter's presence to be a function of how long 
it takes to get the complete bounds information from all partitions on the left 
side of the join.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to