Re: Cache sparkSql data without uncompressing it in memory

Cheng Lian Wed, 12 Nov 2014 19:06:03 -0800

Currently there’s no way to cache the compressed sequence file directly.Spark SQL uses in-memory columnar format while caching table rows, so wemust read all the raw data and convert them into columnar format.However, you can enable in-memory columnar compression by setting|spark.sql.inMemoryColumnarStorage.compressed| to |true|. This propertyis already set to true by default in master branch and branch-1.2.


On 11/13/14 7:16 AM, Sadhan Sood wrote:

We noticed while caching data from our hive tables which contain datain compressed sequence file format that it gets uncompressed in memorywhen getting cached. Is there a way to turn this off and cache thecompressed data as is ?

Re: Cache sparkSql data without uncompressing it in memory

Reply via email to