MLlib ALS MatrixFactorizationModel.save fails consistently

2016-04-07 Thread Colin Woodbury
Hi all, I've implemented most of a content recommendation system for a client. However, whenever I attempt to save a MatrixFactorizationModel I've trained, I see one of four outcomes: 1. Despite "save" being wrapped in a "try" block, I see a massive stack trace quoting some java.io classes. The M

[MLlib - ALS] Merging two Models?

2016-03-10 Thread Colin Woodbury
Hi there, I'm wondering if it's possible (or feasible) to combine the feature matrices of two MatrixFactorizationModels that share a user and product set. Specifically, one model would be the "on-going" model, and the other is one trained only on the most recent aggregation of some event data. My

Compiling only MLlib?

2016-01-15 Thread Colin Woodbury
Hi, I'm very much interested in using Spark's MLlib in standalone programs. I've never used Hadoop, and don't intend to deploy on massive clusters. Building Spark has been an honest nightmare, and I've been on and off it for weeks. The build always runs out of RAM on my laptop (4g of RAM, Arch Lin