On Fri, Jul 18, 2014 at 9:07 PM, ShreyanshB <shreyanshpbh...@gmail.com> wrote: > > Does the suggested version with in-memory shuffle affects performance too > much?
We've observed a 2-3x speedup from it, at least on larger graphs like twitter-2010 <http://law.di.unimi.it/webdata/twitter-2010/> and uk-2007-05 <http://law.di.unimi.it/webdata/uk-2007-05/>. (according to previously reported numbers, graphx did 10 iterations in 142 > seconds and in latest stats it does it in 68 seconds). Is it just the > in-memory version which is changed? If you're referring to previous results vs. the arXiv paper, there were several improvements, but in-memory shuffle had the largest impact. Ankur <http://www.ankurdave.com/>