Github user saucam commented on the pull request:
https://github.com/apache/spark/pull/2841#issuecomment-60880619
Hi @mateiz , thanks for the suggestions, just a few points
1. Need to know which strategy to be kept as default (currently we use a
different one than the default one in parquet library)
2. This PR is adding support to use filter2 api from the parquet library
which supports row group filtering. Do we need to add tests to ensure that ?
because such test cases already exist in the parquet library :
https://github.com/Parquet/parquet-mr/blob/parquet-1.6.0rc3/parquet-hadoop/src/test/java/parquet/filter2/compat/TestRowGroupFilter.java
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]