+1
OSX 10.10.5, java version "1.8.0_40", scala 2.10
mvn clean package -DskipTests
[INFO] Spark Project External Kafka ... SUCCESS [ 18.161
s]
[INFO] Spark Project Examples . SUCCESS [01:18
min]
[INFO] Spark Project External Kafka Assembly .
+1
build/mvn clean package -DskipTests -Pyarn -Phadoop-2.6
OK
Basic graph tests
Load graph using edgeListFile...SUCCESS
Run PageRank...SUCCESS
Minimum Spanning Tree Algorithm
Run basic Minimum Spanning Tree algorithm...SUCCESS
Run Minimum Spanning Tree taxonomy creation...SUCCESS
--
Vi
I'm not sure the problem is quite as bad as you state. Both sampleByKey and
sampleByKeyExact are implemented using a function from
StratifiedSamplingUtils which does one of two things depending on whether
the exact implementation is needed. The exact version requires double the
number of lines of c
I would suggest you move this to the Spark User list, this is the development
list for discussion on development of Spark. It would help if you could give
some more information about what you are trying to do e.g. what code you are
running, how you submitted the job (spark-shell, spark-submit) and