rkrishn7 commented on PR #17632: URL: https://github.com/apache/datafusion/pull/17632#issuecomment-3304995635
I think one thing to note with this approach is that we are running the hash function up to `N` times until all partitions are complete. Which I imagine would have a performance impact. I wonder if we should wait until we can memoize non-volatile function calls with the same arguments within a filter? (which I think is tracked by https://github.com/apache/datafusion/issues/17599). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org