alamb commented on issue #13983:
URL: https://github.com/apache/datafusion/issues/13983#issuecomment-2612198384

   And in this case enabling  predicate pushdown results in a 2x speedup
   
   ```sql
   set datafusion.execution.parquet.pushdown_filters = false;
   SELECT * FROM hits_partitioned WHERE "URL" LIKE '%google%' ORDER BY 
to_timestamp_seconds("EventTime") LIMIT 10;
   ```
   Elapsed 4.108 seconds.
   Elapsed 5.430 seconds.
   Elapsed 4.659 seconds.
   
   ```sql
   set datafusion.execution.parquet.pushdown_filters = true;
   SELECT * FROM hits_partitioned WHERE "URL" LIKE '%google%' ORDER BY 
to_timestamp_seconds("EventTime") LIMIT 10;
   ```
   
   Elapsed 2.415 seconds.
   Elapsed 2.070 seconds.
   Elapsed 2.279 seconds.
   
   
   Here is the flamegraph for no pushdown:
   
   
![Image](https://github.com/user-attachments/assets/549a5bb8-5018-4e08-af94-3bf57c192997)
   
   It would be cool to test with @XiangpengHao 's change to the parquet decoder 
here:
   - https://github.com/apache/arrow-rs/pull/6921


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to