Dear all I'm trying to run Flink ALS against Yahoo-R2 data set[1] on HDFS. The program is running without showing any errors, but it does not finish. The operators running indefinitely are:
CoGroup (CoGroup at org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:606))(11/240) Join(Join at org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:576))(15/240) I was using the below parameters to run: val als = ALS().setIterations(10).setNumFactors(10).setBlocks(100) And I didn't set the hdfs temporary path. Can someone tell me the parameters to set to run ALS on such large data sets? Why are these operators running indefinitely? [1] https://webscope.sandbox.yahoo.com/catalog.php?datatype=r Best Ziyad
