This all describes how the implementation operates, logically. The
matrix P is never formed, for sure, certainly not by the caller.

The implementation actually extends to handle negative values in R too
but it's all taken care of by the implementation.

On Thu, Feb 12, 2015 at 11:29 PM, Crystal Xing <crystalxin...@gmail.com> wrote:
> HI Sean,
>
> I am reading the paper of implicit training.
>
> Collaborative Filtering for Implicit Feedback Datasets
>
> It mentioned
>
> "To this end, let us introduce
> a set of binary variables p_ui, which indicates the preference of user u to
> item i. The p_ui values are derived by
> binarizing the r_ui values:
> p_ui = 1 if  r_ui > 0
> and
>
> p_ui=0 if  r_ui = 0
>
> "
>
>
> If for user_item without interactions, I do not include it in the training
> data.  All the r_ui will >0 and all the p_ui is always 1?
> Or the Mllib's implementation automatically takes care of those no
> interaction user_product pairs ?
>
>
> On Thu, Feb 12, 2015 at 3:13 PM, Sean Owen <so...@cloudera.com> wrote:
>>
>> Where there is no user-item interaction, you provide no interaction,
>> not an interaction with strength 0. Otherwise your input is fully
>> dense.
>>
>> On Thu, Feb 12, 2015 at 11:09 PM, Crystal Xing <crystalxin...@gmail.com>
>> wrote:
>> > Hi,
>> >
>> > I have some implicit rating data, such as the purchasing data.  I read
>> > the
>> > paper about the implicit training algorithm used in spark and it
>> > mentioned
>> > the for user-prodct pairs which do not have implicit rating data, such
>> > as no
>> > purchase, we need to provide the value as 0.
>> >
>> > This is different from explicit training where when we provide training
>> > data, for user-product pair without a rating, we just do not have them
>> > in
>> > the training data instead of adding a user-product pair with rating 0.
>> >
>> > Am I understand this correctly?
>> >
>> >  Or for implicit training implementation in spark, the missing data will
>> > be
>> > automatically filled out as zero and we do not need to add them in the
>> > training data set?
>> >
>> > Thanks,
>> >
>> > Crystal.
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to