[GitHub] spark pull request: SPARK-3968 Use parquet-mr filter2 api in spark...

saucam Tue, 28 Oct 2014 23:44:23 -0700

Github user saucam commented on the pull request:

    https://github.com/apache/spark/pull/2841#issuecomment-60880619
  
    Hi @mateiz , thanks for the suggestions, just a few points 
    1. Need to know which strategy to be kept as default (currently we use a 
different one than the default one in  parquet library)
    2. This PR is adding support to use filter2 api from the parquet library 
which supports row group filtering. Do we need to add tests to ensure that ? 
because such test cases already exist in the parquet library : 
    
    
https://github.com/Parquet/parquet-mr/blob/parquet-1.6.0rc3/parquet-hadoop/src/test/java/parquet/filter2/compat/TestRowGroupFilter.java



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request: SPARK-3968 Use parquet-mr filter2 api in spark...

Reply via email to