Re: Parquet cpp status

2019-11-15 Thread Francois Saint-Jacques
The parquet c++ implementation has all the facilities to expose the required information to implement predicate pushdown. The experimental Dataset API does make use of this with parquet. See [1] for an example of the API. Or a real-life usage with the nyc-tlc taxi dataset [2]. The relevant implemen

Re: Parquet cpp status

2019-11-14 Thread Micah Kornfield
#1 if there isn't a JIRA I would guess no-one is working on it (Note I would expect at least the initial work to be in aParquet JIRA item, and this is probably a discussion for that mailing list). #2. There are some open PR to expose the parquet reader through JNI to java [1] #3. Its possible Dremi

Parquet cpp status

2019-11-13 Thread Chang Chen
Hi I am trying to find doc about current parquet-cpp current status. i googled it, but i didn't find any useful information. here are what i concerned about: #1 column indexes (https://issues.apache.org/jira/browse/PARQUET-1201), the corresponding java implementation already supported it last y