Re: Why does a 3.8 T dataset take up 11.59 Tb on HDFS

ajackson92 Wed, 25 Nov 2015 22:42:11 -0800

Most Hadoop installations use a block replication of 3.  What you're seeing
is your dataset (3.8T) replicated 3 times (11.4TB).




--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Why-does-a-3-8-T-dataset-take-up-11-59-Tb-on-HDFS-tp25471p25488.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Re: Why does a 3.8 T dataset take up 11.59 Tb on HDFS

Reply via email to