That's not work. I don't think it is just slow, It never ends(with 30+ hours, and I killed it).
-- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/parallelize-for-a-large-Seq-is-extreamly-slow-tp4801p4900.html Sent from the Apache Spark User List mailing list archive at Nabble.com.