Re: Spark using HDFS data [newb]

2014-10-23 Thread Marcelo Vanzin
You assessment is mostly correct. I think the only thing I'd reword is the comment about splitting the data, since Spark itself doesn't do that, but read on. On Thu, Oct 23, 2014 at 6:12 PM, matan wrote: > In case I nailed it, how then does it handle a distributed hdfs file? does > it pull all of

Spark using HDFS data [newb]

2014-10-23 Thread matan
st.1001560.n3.nabble.com/Spark-using-HDFS-data-newb-tp17169.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-ma