Thank you very much. I'll jump in to the data and verify the
consistency between the training and testing variables and their
levels.

On Fri, Mar 7, 2008 at 5:14 PM,  <[EMAIL PROTECTED]> wrote:
> The error message is pretty clear, really.  To spell it out a bit more,
>  what you have done is as follows.
>
>  Your training set has factor variables in it.  Suppose one of them is
>  "f".  In the training set it has 5 levels, say.
>
>  Your test set also has a factor "f", as it must, but it appears that in
>  the test set it has 6 levels, or more, or levels that do not agree with
>  those for "f" in the training set.
>
>  This mismatch measn that the predict method for randomForest cannot use
>  this test set.
>
>  What you have to do is make sure that the factor levels agree for every
>  factor in both test and training set. One way to do this is to put the
>  test and training set together with rbind(...) say, and then separate
>  them again.  But even this will still have a problem for you.  Because
>  you training set will have some factor levels empty, which are not empty
>  in the test set.  The error will most likely be more subtle, though.
>
>  You really need to sort this out yourself.  It is not particularly an R
>  problem, but a confusion over data.  To be useful, your training set
>  need to cover the field for all levels of every factor.  Think about it.
>
>
>
>  -----Original Message-----
>  From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
>  On Behalf Of Nagu
>  Sent: Saturday, 8 March 2008 5:37 AM
>  To: r-help@r-project.org; [EMAIL PROTECTED]
>  Subject: [R] error in random forest
>
>  Hi,
>
>  I get the following error when I try to predict the probabilities of a
>  test sample:
>
>  Error in predict.randomForest(fit.EBA.OM.rf.50, x.OM, type = "prob") :
>   New factor levels not present in the training data
>
>  I have about 630 predictor variables in the dataset x.OM (25 factor
>  variables and the remaining are continuous variables). Any ideas on
>  how to trace it?
>
>  Thank you,
>  Nagu
>
>  ______________________________________________
>  R-help@r-project.org mailing list
>  https://stat.ethz.ch/mailman/listinfo/r-help
>  PLEASE do read the posting guide
>  http://www.R-project.org/posting-guide.html
>  and provide commented, minimal, self-contained, reproducible code.
>
>
>

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to