Hi Pat, Thanks for the spark-itemsimilarity. I just ran it against my csv file which looks like below. My two actions are Liked and AddToCart. AddToCart is the main action.
usr000d3ca6655-c132-11e4-ac0a-0cc47a03334d Liked prd0000b5bc44b-fca8-4462-ba19-b9b2b823beb6 usr000e615c7ff-c132-11e4-ac0a-0cc47a03334d Liked prd0002da21ddf-6004-4611-9ef5-4fe96bbb969f usr000d4821d99-c132-11e4-ac0a-0cc47a03334d AddToCart prd000075a5710-4890-4ec7-8faa-d9ffb0e94e6d usr000cfd955f3-c132-11e4-ac0a-0cc47a03334d Liked prd0000b5bc44b-fca8-4462-ba19-b9b2b823beb6 usr0000dcd0443-c133-11e4-ac0a-0cc47a03334d Liked prd000e8149493-f3a1-491c-abe1-e7886e0286b9 usr000dfc1e861-c132-11e4-ac0a-0cc47a03334d Liked prd0009d6ff4b1-b139-46de-8d4c-425dbee5b4ca usr000d2db7271-c132-11e4-ac0a-0cc47a03334d Liked prd000db1268fa-7fcb-4c84-8877-3b75f2748fda usr000d274251f-c132-11e4-ac0a-0cc47a03334d Liked prd00078f2d7a1-9d6b-4175-96b5-d50ee35ab6db usr000fb647da9-c132-11e4-ac0a-0cc47a03334d AddToCart prd000613c24dd-b838-433f-b1bc-e8e4dddae4d7 usr000d34d4713-c132-11e4-ac0a-0cc47a03334d Liked prd0007ca32be4-2443-4bba-83b4-f01382968c07 I can see there are two output files. One is under cross-indicator-matrix and the other under indicator-matrix. I use Solr. I created a field called recommendation. It's indexed multivalues string field. I loaded indicator-matrix/part-00000 into this filed. My question is what is cross-indicator-matrix for? What's the difference between indicator-matrix and cross-indicator-matrix? Do I need to create another Solr field for this cross-indicator-matrix? When I should use it? Is indicator-matrix for AddToCart and cross-indicator-matrix for Liked? Thanks a lot, Kevin