Re: Flink ML recommender system API

Gábor Hermann Tue, 04 Oct 2016 09:03:56 -0700

Thank you both for your detailed replies.

I think we all agree on extending the evaluation framework to handlerecommendation models, and choosing the scalable form of ranking, sowe'll do it that way. For now we will work upon Theodore's PR.

Thanks for giving me the reasons behind the design decision about nothaving a separate object for the trained model. I haven't thought aboutthe implications of pipelines, so I think we should keep the currentdesign and align our new algorithms to it. Of course, we can bring up adiscussion later and reconsider this design, but I see that it's aseparate issue.

I think it would be good to implement a ScoreMatrixFactorizationRecommender
and a RankingMatrixFactorizationRecommender which both work on a
MatrixFactorizationModel. This model can then either be computed by ALS or
DSGD. This could be controlled by a configuration parameter of the
recommenders.

Do you mean having two different predictors, i.e.Predictor[ScoreMatrixFactorizationRecommender] andPredictor[RankingMatrixFactorizationRecommender]?If I understand right, there should be one common classMatrixFactorizationModel instead of distinct ALS and DSGD classes, andit should be a configuration parameter which one to use for training?

I like this idea, as both trainers would require almost the sameconfiguration. AFAIK there would be an additional 'LearningRate'parameter for DSGD, but apart from that the two configs are the same.

What do you mean with more "typesafe"? I don't see how returning the
trained model from the fit method gives you more type safety.

I probably used the wrong word here. I simply meant that using aseparate type for the trained model, the type ensures that the trainedmodel cannot be trained again, while an untrained model cannot be usedfor prediction.

Regarding the DSGD algorithm, I think it uses another samplingmechanism, and we cannot reuse the simple SGD solver. However, we willmake sure not to write duplicate code for the same problem. We've alsonoticed, independently from DSGD, that the SGD solver is a GD solver inreality, but I have not found the related issues and discussion, sopointing me to them was really useful, thanks!


Cheers,
Gabor

Re: Flink ML recommender system API

Reply via email to