> Has there been any study of how much compressing Hive Parquet tables with
> snappy reduces storage space or simply the table size in quantitative terms?
http://www.slideshare.net/oom65/file-format-benchmarks-avro-json-orc-parquet/20
Since SNAPPY is just LZ77, I would assume it would be useful
Mich,
Here are the benchmarks that I did using three different types of data:
http://www.slideshare.net/HadoopSummit/file-format-benchmark-avro-json-orc-parquet
I assume you are comparing parquet-snappy vs parquet-none.
.. Owen
On Wed, Jan 25, 2017 at 1:37 PM, Mich Talebzadeh
wrote:
> Hi,