Hi all , i tried to run a terasort benchmark on my spark cluster, but i
found it is hard to find a standard spark terasort program except a PR from
rxin and ewan higgs:

https://github.com/apache/spark/pull/1242
https://github.com/ehiggs/spark/tree/terasort

The example which rxin provided without a validate test so i tried higgs's
example, but i sadly found a always get an error when validate:

assertion failed: current partition min < last partition max

It seems that it requires the min array in partition 2 must bigger than max
array in partion 1, but the code here is confusing:

        println(s"lastMax" + lastMax.toSeq.map(x => if (x < 0) 256 + x else
x))
        println(s"min " + min.toSeq.map(x => if (x < 0) 256 + x else x))
        println(s"max " + max.toSeq.map(x => if (x < 0) 256 + x else x))

Anyone ever run the terasort example successfully? Or where can i get a
standard terasort application?

Reply via email to