Setting the vote rate in a Random Forest in MLlib

2015-12-16 Thread Young, Matthew T
One of our data scientists is interested in using Spark to improve performance in some random forest binary classifications, but isn't getting good enough results from MLlib's implementation of the random forest compared to R's randomforest library with the available parameters. She suggested th

Re: Random Forest in MLLib

2015-07-06 Thread Feynman Liang
Not yet, though work on this feature has begun (SPARK-5133 ) On Mon, Jul 6, 2015 at 4:46 PM, Sourav Mazumder wrote: > Hi, > > Is there a way to get variable importance for RandomForest model created > using MLLib ? This way one can know among mul

Random Forest in MLLib

2015-07-06 Thread Sourav Mazumder
Hi, Is there a way to get variable importance for RandomForest model created using MLLib ? This way one can know among multiple features which are the one contributing the most to the dependent variable. Regards, Sourav