Re: HDFS Block compression

2016-07-05 Thread Ravi Prakash
Thanks for your email Robert! IMHO compression has other effects (pegging CPUs, needing more memory) . If you enable compression on all blocks, you can't provide uncompressed performance (its arguable whether compression will always be faster / slower). Regardless, users are free to compress at th

Re: HDFS Block compression

2016-07-05 Thread Colin McCabe
We have discussed this in the past. I think the single biggest issue is that HDFS doesn't understand the schema of the data which is stored in it. So it may not be aware of what compression scheme would be most appropriate for the application and data. While it is true that HDFS doens't allow ra