GitHub user pepijnve added a comment to the discussion: Multiple 'group by's, 
one scan

I read through the linked issues in the meantime. I think what we're trying to 
do is closest to the Splitter idea described in the linked document at 
https://github.com/apache/datafusion/pull/8558#issuecomment-1862968970

The part we've been struggling with a bit is how to fit this in the framework 
of DataFusion without breaking assumptions. While it's definitely already 
feasible to create diamond plans in code, my colleague and I got the impression 
that most code has been written under the assumption that nodes only have a 
single parent. One example that comes to mind was projection pushdown where 
each branch would push down only the columns it needed.

If you have any guidance/pointers on how to go about tackling this problem 
without going against the grain of the library, that would be much appreciated. 
Happy to contribute the work upstream if we can get beyond the point of a hack 
to something sufficiently general and usable for others.

GitHub link: 
https://github.com/apache/datafusion/discussions/15982#discussioncomment-13160947

----
This is an automatically sent email for github@datafusion.apache.org.
To unsubscribe, please send an email to: 
github-unsubscr...@datafusion.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to