Re: Spark random forest - string data

2015-01-16 Thread Nick Allen
> > > > Is MBLib Random forest adapted to run on top of numeric data only? > > > > Thanks > > - > To unsubscribe, e-mail: user-unsubscr...@spark.apache.org > For additional commands, e-mail: user-h...@spark.apache.org > > -- Nick Allen

Re: How to 'Pipe' Binary Data in Apache Spark

2015-01-16 Thread Nick Allen
ne project from RIPE-NCC ( https://github.com/RIPE-NCC/hadoop-pcap) that does this. Unfortunately, it appears to only support a limited set of network protocols. On Fri, Jan 16, 2015 at 10:40 AM, Nick Allen wrote: > Per your last comment, it appears I need something like this: > > http

Re: How to 'Pipe' Binary Data in Apache Spark

2015-01-16 Thread Nick Allen
> to put it into an RDD). You can encode this as a SequenceFile and read > it with objectFile. > > You could also write a custom InputFormat that knows how to parse pcap > records directly. > > On Fri, Jan 16, 2015 at 3:09 PM, Nick Allen wrote: > > I have an RDD containing bi

How to 'Pipe' Binary Data in Apache Spark

2015-01-16 Thread Nick Allen
xtFile("text-data.csv") Specifically, I am trying to use Spark to transform pcap (packet capture) data to text/csv so that I can perform an analysis on it. Thanks! -- Nick Allen