We have an open-sourced Random Forest at Alpine Data Labs with the Apache license. We're also trying to have it merged into Spark MLlib now.
https://github.com/AlpineNow/alpineml It's been tested a lot, and the accuracy and training time benchmark is great. There could be some bugs here and there, so we're looking forward to your feedback, and please let us know what you think. We'll continue to improve it and we'll be adding Gradient Boosting in the near future as well. Thanks. Sincerely, DB Tsai ------------------------------------------------------- My Blog: https://www.dbtsai.com LinkedIn: https://www.linkedin.com/in/dbtsai On Mon, Aug 11, 2014 at 10:52 AM, Sameer Tilak <ssti...@live.com> wrote: > Hi All, > I read on the mailing list that random forest implementation was on the > roadmap. I wanted to check about its status? We are currently using Weka > and would like to move over to MLib for performance. >