You assessment is mostly correct. I think the only thing I'd reword is
the comment about splitting the data, since Spark itself doesn't do
that, but read on.
On Thu, Oct 23, 2014 at 6:12 PM, matan wrote:
> In case I nailed it, how then does it handle a distributed hdfs file? does
> it pull all of
st.1001560.n3.nabble.com/Spark-using-HDFS-data-newb-tp17169.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-ma