ozankabak commented on PR #16196:
URL: https://github.com/apache/datafusion/pull/16196#issuecomment-2956513724

   Great, thanks for the patch. We should use it as one of the new test cases 
in the follow-on PRs.
   
   Look, I see that you are trying to help and we do want to take it. I suspect 
we might be facing a "culture" challenge here: Typically, DF community attacks 
large problems by solving them bit by bit and refining a solution iteratively. 
This is unlike some other projects which front-load the effort by going through 
a more comprehensive design process. We also do that for some tasks where this 
iterative approach is not applicable, but it is not very common.
   
   This "bit by bit approach" doesn't always succeed, every now and then it 
happens that we get stuck or go down the wrong path for a while, and then 
change tacks. However, we still typically prefer to "advance the front" and 
make progress in tangible ways as much as we can (if we see a way). This 
necessarily results in imperfect solutions being the "state of the code" in 
some cases, and they survive in the codebase for a while, but we are good at 
driving things to completion in the long run.
   
   Reflecting all this onto the task at hand, this PR (1) solves many cases 
already and (2) introduces some machinery that will be useful as we iterate on 
the full solution. I don't think it is brittle, I think it is imperfect and 
requires refinement. I am optimistic that we will eventually converge on a good 
approach that requires minimal operator cooperation (but we won't be able to 
reduce that to a strict zero) and is close to being optimal in terms of 
yielding overhead. Where we are at is not where we will ultimately be, this is 
just a step in a long process. I hope that helps. Thanks.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to