Some terminology changes will help. We’re changing “cross-indicator” into “cross-coocccurrence” indicator. Both are indicators that predict your primary action.
Cross-cooccurrence finds the secondary interactions that are indicators or predictors of the primary action. In your case AddToCart is what you want to recommend and therefor is your primary action and Liked is “scrubbed” with the cross-cooccurrence calculation to yield additional indicators. The query will be user-history-of-AddToCart -> recommendation field user-history-of-Liked -> a new field—call it "liked-indicator" You need to have a table with row ID = item-id and two multivalued fields, one containing the indicator-matrix values, and one containing the cross-indicator-matrix values (soon to be renamed "similarity-matrix" and "cross-simialrity-matrix”). On Mar 2, 2015, at 9:13 PM, Kevin Zhang <zhangyongji...@yahoo.com.INVALID> wrote: Hi Pat, Thanks for the spark-itemsimilarity. I just ran it against my csv file which looks like below. My two actions are Liked and AddToCart. AddToCart is the main action. usr000d3ca6655-c132-11e4-ac0a-0cc47a03334d Liked prd0000b5bc44b-fca8-4462-ba19-b9b2b823beb6 usr000e615c7ff-c132-11e4-ac0a-0cc47a03334d Liked prd0002da21ddf-6004-4611-9ef5-4fe96bbb969f usr000d4821d99-c132-11e4-ac0a-0cc47a03334d AddToCart prd000075a5710-4890-4ec7-8faa-d9ffb0e94e6d usr000cfd955f3-c132-11e4-ac0a-0cc47a03334d Liked prd0000b5bc44b-fca8-4462-ba19-b9b2b823beb6 usr0000dcd0443-c133-11e4-ac0a-0cc47a03334d Liked prd000e8149493-f3a1-491c-abe1-e7886e0286b9 usr000dfc1e861-c132-11e4-ac0a-0cc47a03334d Liked prd0009d6ff4b1-b139-46de-8d4c-425dbee5b4ca usr000d2db7271-c132-11e4-ac0a-0cc47a03334d Liked prd000db1268fa-7fcb-4c84-8877-3b75f2748fda usr000d274251f-c132-11e4-ac0a-0cc47a03334d Liked prd00078f2d7a1-9d6b-4175-96b5-d50ee35ab6db usr000fb647da9-c132-11e4-ac0a-0cc47a03334d AddToCart prd000613c24dd-b838-433f-b1bc-e8e4dddae4d7 usr000d34d4713-c132-11e4-ac0a-0cc47a03334d Liked prd0007ca32be4-2443-4bba-83b4-f01382968c07 I can see there are two output files. One is under cross-indicator-matrix and the other under indicator-matrix. I use Solr. I created a field called recommendation. It's indexed multivalues string field. I loaded indicator-matrix/part-00000 into this filed. My question is what is cross-indicator-matrix for? What's the difference between indicator-matrix and cross-indicator-matrix? Do I need to create another Solr field for this cross-indicator-matrix? When I should use it? Is indicator-matrix for AddToCart and cross-indicator-matrix for Liked? Thanks a lot, Kevin