Hello,

Thanks Andra for the gaussian sequence generation. It is a little tricky, i just leave this part for future work.

I meet another problem in AffinityPropogation algorithm. I write a few test code for it.
https://github.com/joey001/flink/blob/ap_add/flink-staging/flink-gelly/src/test/java/org/apache/flink/graph/test/example/AffinityPropogationITCase.java
<https://github.com/joey001/flink/blob/ap_add/flink-staging/flink-gelly/src/test/java/org/apache/flink/graph/test/example/AffinityPropogationITCase.java>It passes the COLLECTION while failed when execution mode = CLUSTER.
I not very clear about the differences and the reason.

Does anyone give me a clue?

Thanks,
Best Regards.

ZHOU Yi

On 08/05/2015 23:17, Andra Lungu wrote:
Hi Yi,

To my knowledge, there is no simple way to generate this kind of
DataSet(i.e. there is no env.generateGaussianSequence()).
However, if you look in flink-perf, Till used something like this there:
https://github.com/project-flink/flink-perf/blob/master/flink-jobs/src/main/scala/com/github/projectflink/als/ALSDataGeneration.scala
Maybe he can give you some tips.

You can also call random.nextGaussian() in Java.
http://docs.oracle.com/javase/7/docs/api/java/util/Random.html#nextGaussian%28%29

Not sure if this helps, but there is a paper on generating this kind of
distribution.
http://ifisc.uib-csic.es/raul/publications/P/P44_tc93.pdf

Best of luck,
Andra


On Fri, May 8, 2015 at 9:45 PM, Yi ZHOU <zhouyi0...@hotmail.com> wrote:

Hello, all

when I tested AP algorithm, I had a little question :
  how to generate a DataSet in gaussian distribution? Is there a
implemented funtion?

Does any one has a solution? Thank you,

ZHOU Yi


Reply via email to