Hi,
how i can use this option in Random Forest .
when i transform my vector (100 features ) i have 20 categoriel feature
include.
if i understand categorielFeatureinfo , i should past the position of my 20
categoriels feature inside of the vector containing 100 with map{
positionof feature insid
or when i call random forest algorithm with
>
> model = RandomForest.trainClassifier(rdf,
> numClasses=2,
> categoricalFeaturesInfo=d,
>
Hi,
how i can keep type of my variable like int
because i get this error when i call random forest algorithm with
model = RandomForest.trainClassifier(rdf,
numClasses=2,
categoricalFeaturesInfo=d
Hi,
i am pyspark user and i want test the Randoforest algrithmes.
i found this parmeters categoricalFeaturesInfo how i can use it from list
of categoriels variables .
thanks.
On Wed, May 20, 2015 at 2:44 PM, Don Drake wrote:
>>
>>> I'm running Spark v1.3.1 and when I run the following against my dataset:
>>>
>>> model = GradientBoostedTrees.trainRegressor(trainingData,
>>> categoricalFeaturesInfo=catFeatu
>>> res,
2:44 PM, Don Drake wrote:
>
>> I'm running Spark v1.3.1 and when I run the following against my dataset:
>>
>> model = GradientBoostedTrees.trainRegressor(trainingData,
>> categoricalFeaturesInfo=catFeatu
>> res, maxDepth=6, numIterations=3)
>>
>&
:
> I'm running Spark v1.3.1 and when I run the following against my dataset:
>
> model = GradientBoostedTrees.trainRegressor(trainingData,
> categoricalFeaturesInfo=catFeatu
> res, maxDepth=6, numIterations=3)
>
> The job will fail with the following message:
> Tra
I'm running Spark v1.3.1 and when I run the following against my dataset:
model = GradientBoostedTrees.trainRegressor(trainingData,
categoricalFeaturesInfo=catFeatu
res, maxDepth=6, numIterations=3)
The job will fail with the following message:
Traceback (most recent call last):
File &q