alamb commented on issue #18411: URL: https://github.com/apache/datafusion/issues/18411#issuecomment-3665182575
> Thinking about it, I think also Parquet decoding could be made more efficient from this info as well (into primitive byte array instead of utf8 view, no utf8 validation needed). I 100% agree that if the goal was to write an executor that handled TPCH queries and took advantage of the patterns in the data (ahem benchmaxxing) we could do much better. However my goal is to build a system that is useful for real world queries, so I try and focus on optimizations that are applicable to real world queries. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
