If you google, you'll find benchmarks showing each to be faster than the
other. In so far as there's any reality to which is faster in any given
comparison, it seems to be a result of each incorporating ideas from the
other, or at least going through development cycles to beat each other.
ORC is v
Hi,
Thanks for that link.
It appears that the main advantages of Parquet is stated as and I quote:
"Parquet is built to be used by anyone. The Hadoop ecosystem is rich with
data processing frameworks, and we are not interested in playing favorites.
We believe that an efficient, well-implemented
Curious why you think that Parquet does not have metadat at file, row
group or column level.
Please refer here to the type of metadata that Parquet supports in the
docs http://parquet.apache.org/documentation/latest/
n 06/03/2016 15:26, Mich Talebzadeh wrote:
Hi.
I have been hearing a fair bi