Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]

via GitHub Wed, 17 Dec 2025 04:43:50 -0800


alamb commented on issue #18411:
URL: https://github.com/apache/datafusion/issues/18411#issuecomment-3665182575


   > Thinking about it, I think also Parquet decoding could be made more 
efficient from this info as well (into primitive byte array instead of utf8 
view, no utf8 validation needed).
   
   I 100% agree that if the goal was to write an executor that handled TPCH 
queries and took advantage of the patterns in the data (ahem benchmaxxing) we 
could do much better.
   
   However my goal is to build a system that is useful for real world queries, 
so I try and focus on optimizations that are applicable to real world queries. 
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] TPCH q1 with no predicates is 2x slower than duckdb [datafusion]

Reply via email to