Thanks for your email Robert!
IMHO compression has other effects (pegging CPUs, needing more memory) . If
you enable compression on all blocks, you can't provide uncompressed
performance (its arguable whether compression will always be faster /
slower). Regardless, users are free to compress at th
We have discussed this in the past. I think the single biggest issue is
that HDFS doesn't understand the schema of the data which is stored in
it. So it may not be aware of what compression scheme would be most
appropriate for the application and data.
While it is true that HDFS doens't allow ra