The implementation accepts an RDD of LabeledPoint only, so you couldn't feed in strings from a text file directly. LabeledPoint is a wrapper around double values rather than strings. How were you trying to create the input then?
No, it only accepts numeric values, although you can encode categorical values as 0, 1, 2 ... and tell the implementation about your categorical features to use categorical features. On Fri, Jan 16, 2015 at 9:25 PM, Asaf Lahav <asaf.la...@gmail.com> wrote: > Hi, > > I have been playing around with the new version of Spark MLlib Random forest > implementation, and while in the process, tried it with a file with String > Features. > While training, it fails with: > java.lang.NumberFormatException: For input string. > > > Is MBLib Random forest adapted to run on top of numeric data only? > > Thanks --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org