Hi Yi, To my knowledge, there is no simple way to generate this kind of DataSet(i.e. there is no env.generateGaussianSequence()). However, if you look in flink-perf, Till used something like this there: https://github.com/project-flink/flink-perf/blob/master/flink-jobs/src/main/scala/com/github/projectflink/als/ALSDataGeneration.scala Maybe he can give you some tips.
You can also call random.nextGaussian() in Java. http://docs.oracle.com/javase/7/docs/api/java/util/Random.html#nextGaussian%28%29 Not sure if this helps, but there is a paper on generating this kind of distribution. http://ifisc.uib-csic.es/raul/publications/P/P44_tc93.pdf Best of luck, Andra On Fri, May 8, 2015 at 9:45 PM, Yi ZHOU <zhouyi0...@hotmail.com> wrote: > Hello, all > > when I tested AP algorithm, I had a little question : > how to generate a DataSet in gaussian distribution? Is there a > implemented funtion? > > Does any one has a solution? Thank you, > > ZHOU Yi >