Re: Parquet versus ORC

2016-03-06 Thread Marcin Tustin
If you google, you'll find benchmarks showing each to be faster than the other. In so far as there's any reality to which is faster in any given comparison, it seems to be a result of each incorporating ideas from the other, or at least going through development cycles to beat each other. ORC is v

Re: Parquet versus ORC

2016-03-06 Thread Mich Talebzadeh
Hi, Thanks for that link. It appears that the main advantages of Parquet is stated as and I quote: "Parquet is built to be used by anyone. The Hadoop ecosystem is rich with data processing frameworks, and we are not interested in playing favorites. We believe that an efficient, well-implemented

Re: Parquet versus ORC

2016-03-06 Thread Uli Bethke
Curious why you think that Parquet does not have metadat at file, row group or column level. Please refer here to the type of metadata that Parquet supports in the docs http://parquet.apache.org/documentation/latest/ n 06/03/2016 15:26, Mich Talebzadeh wrote: Hi. I have been hearing a fair bi