Hi Mathieu, I'm CCing the Spark user list since this will be of general interest to the forum. Unfortunately, there is not a way to begin LDA training with an existing model currently. Some MLlib models have been augmented to support specifying an "initialModel" argument, but LDA does not have this yet. Please feel free to make a feature request JIRA for it!
Thanks, Joseph On Thu, Mar 23, 2017 at 4:54 PM, Mathieu DESPRIEE <mdespr...@bluedme.com> wrote: > Hello Joseph, > > I saw your contribution to online LDA in Spark (SPARK-5563). Please allow > me a very quick question : > > I'm very much interested in training an LDA model incrementally with new > batches of documents. This online algorithm seems to fit, but from what I > understand of the current ml API, this is not possible to update a trained > model with new documents. > Is it ? > > Is there any way to get around the API and do that ? > > Thanks in advance for your insight. > > Mathieu > > -- Joseph Bradley Software Engineer - Machine Learning Databricks, Inc. [image: http://databricks.com] <http://databricks.com/>