I actually tried without unpersisting, but given the performance I tryed to
add these in order to free the memory. After your anwser I tried to remove
them again, but without any change in the execution time...

Looking at the web interface, I can see that the "mapPartitions at
GraphImpl.scala:184" step just after the foreach (so I guess computing
messages, even if I would have bet on the innerjoin...) are taking ... 2s ->
6s -> 17s -> 48s -> 15m+ (sometimes ends with GC overhead limit exceeded)
and have shuffle write of 9MB -> 30MB -> 73MB -> 162MB -> never completed.

Is this still related to unpersisting ? I'm mostly a mathematician, and so
I'm not well acquainted with the depth of Spark...



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-GraphX-pregel-like-with-global-variables-accumulator-broadcast-tp12742p12813.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to