Re: Parquet tables with snappy compression

2017-01-25 Thread Gopal Vijayaraghavan
> Has there been any study of how much compressing Hive Parquet tables with > snappy reduces storage space or simply the table size in quantitative terms? http://www.slideshare.net/oom65/file-format-benchmarks-avro-json-orc-parquet/20 Since SNAPPY is just LZ77, I would assume it would be useful

Re: Parquet tables with snappy compression

2017-01-25 Thread Owen O'Malley
Mich, Here are the benchmarks that I did using three different types of data: http://www.slideshare.net/HadoopSummit/file-format-benchmark-avro-json-orc-parquet I assume you are comparing parquet-snappy vs parquet-none. .. Owen On Wed, Jan 25, 2017 at 1:37 PM, Mich Talebzadeh wrote: > Hi,