Re: Graphx : Perfomance comparison over cluster

2014-07-30 Thread Ankur Dave
ShreyanshB writes: >> The version with in-memory shuffle is here: >> https://github.com/amplab/graphx2/commits/vldb. > > It'd be great if you can tell me how to configure and invoke this spark > version. Sorry for the delay on this. Assuming you're planning to launch an EC2 cluster, here's how t

Re: Graphx : Perfomance comparison over cluster

2014-07-23 Thread ShreyanshB
t; If you reply to this email, your message will be added to the discussion > below: > > http://apache-spark-user-list.1001560.n3.nabble.com/Graphx-Perfomance-comparison-over-cluster-tp10222p10281.html > To unsubscribe from Graphx : Perfomance

Re: Graphx : Perfomance comparison over cluster

2014-07-20 Thread Ankur Dave
On Fri, Jul 18, 2014 at 9:07 PM, ShreyanshB wrote: > > Does the suggested version with in-memory shuffle affects performance too > much? We've observed a 2-3x speedup from it, at least on larger graphs like twitter-2010 and uk-2007-05

Re: Graphx : Perfomance comparison over cluster

2014-07-18 Thread ShreyanshB
r_memory and partition strategy, Is >> there any other configuration I should do to have the best performance? >> >> I am using following script, >> import org.apache.spark._ >> import org.apache.spark.graphx._ >> import org.apache.spark.rdd.RDD >>

Re: Graphx : Perfomance comparison over cluster

2014-07-18 Thread Ankur Dave
graph = GraphLoader.edgeListFile(sc, "filepath",true,32) > val endgraphloading = System.currentTimeMillis; > > > Thanks in advance :) > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Graphx-Perfomance-comparison-over-cluster-tp10222.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. >

Graphx : Perfomance comparison over cluster

2014-07-18 Thread ShreyanshB
aphx-Perfomance-comparison-over-cluster-tp10222.html Sent from the Apache Spark User List mailing list archive at Nabble.com.