I am asking the basic reproducibility question. If I run twice on the same dataset, with the same hardware setup, will I always get the same resuts? Or is there any chance that on two different runs, the same user will get slightly different suggestions? I am mostly revolving in the space of numerical libraries, where reproducibility is, sort of, a big deal. Maybe it's not much of a concern in machine learning. I am just curious.
On Sun, Mar 17, 2013 at 8:46 AM, Sean Owen <[email protected]> wrote: > What's your question? ALS has a random starting point which changes the > results a bit. Not sure about KNN though. > > > On Sun, Mar 17, 2013 at 3:03 AM, Koobas <[email protected]> wrote: > > > Can anybody shed any light on the issue of reproducibility in Mahout, > > with and without Hadoop, specifically in the context of kNN and ALS > > recommenders? > > >
