alamb commented on issue #13983: URL: https://github.com/apache/datafusion/issues/13983#issuecomment-2612198384
And in this case enabling predicate pushdown results in a 2x speedup ```sql set datafusion.execution.parquet.pushdown_filters = false; SELECT * FROM hits_partitioned WHERE "URL" LIKE '%google%' ORDER BY to_timestamp_seconds("EventTime") LIMIT 10; ``` Elapsed 4.108 seconds. Elapsed 5.430 seconds. Elapsed 4.659 seconds. ```sql set datafusion.execution.parquet.pushdown_filters = true; SELECT * FROM hits_partitioned WHERE "URL" LIKE '%google%' ORDER BY to_timestamp_seconds("EventTime") LIMIT 10; ``` Elapsed 2.415 seconds. Elapsed 2.070 seconds. Elapsed 2.279 seconds. Here is the flamegraph for no pushdown:  It would be cool to test with @XiangpengHao 's change to the parquet decoder here: - https://github.com/apache/arrow-rs/pull/6921 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org