rluvaton opened a new pull request, #17592:
URL: https://github.com/apache/datafusion/pull/17592

   ## Which issue does this PR close?
   
   N/A
   
   ## Rationale for this change
   
   I want fast grouping when there are a lot of columns to group by and there 
are a lot of unique groups
   
   ## What changes are included in this PR?
   
   This optimization is fairly simple:
   if the row indices to append are continues (i.e. `append_row_indices[i] + 1 
== append_row_indices[i + 1]`) we will call an optimized function for that case
   
   the optimized function should copy all the data in a single pass making it 
very fast as opposed to item by item
   
   ## Are these changes tested?
   
   Existing tests but I want to add more, before I do I wanna see the benchmark 
results (I'm 99% it will be faster but still)
   
   ## Are there any user-facing changes?
   
   Nope
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to