WikipediaPageRank Data Set

2014-03-27 Thread Niko Stahl
Hello, I would like to run the WikipediaPageRankexample, but the Wikipedia dump XML files are no longer available on Freebase. Does anyone

Re: ClassCastException when using saveAsTextFile

2014-03-25 Thread Niko Stahl
AsTextFile("hdfs://" + masterDomain + ":9000/user/root/" + "test_dir") Even this simple mapping give me a java.lang.ClassCastException. Sorry, my knowledge of Scala is very rudimentary. Thanks, Niko On Tue, Mar 25, 2014 at 5:55 PM, Niko Stahl wrote: > Hi, > &

ClassCastException when using saveAsTextFile

2014-03-25 Thread Niko Stahl
Hi, I'm trying to save an RDD to HDFS with the saveAsTextFile method on my ec2 cluster and am encountering the following exception (the app is called GraphTest): Exception failure: java.lang.ClassCastException: cannot assign instance of GraphTest$$anonfun$3 to field org.apache.spark.rdd.MappedRDD

Re: Comparing GraphX and GraphLab

2014-03-24 Thread Niko Stahl
rom disk, while GraphLab uses shared memory >> for same-machine communication. >> >> I've CC'd Joey and Reynold as well. >> >> Ankur <http://www.ankurdave.com/> >> >> On Mar 24, 2014 11:00 AM, "Niko Stahl" wrote: >> >>

Comparing GraphX and GraphLab

2014-03-24 Thread Niko Stahl
Hello, I'm interested in extending the comparison between GraphX and GraphLab presented in Xin et. al (2013). The evaluation presented there is rather limited as it only compares the frameworks for one algorithm (PageRank) on a cluster with a fixed number of nodes. Are there any graph algorithms w