subject:"RE\: Tranforming flume events using Spark transformation functions"

Re: Tranforming flume events using Spark transformation functions

2014-07-22 Thread Tathagata Das

This is because of the RDD's lazy evaluation! Unless you force a transformed (mapped/filtered/etc.) RDD to give you back some data (like RDD.count) or output the data (like RDD.saveAsTextFile()), Spark will not do anything. So after the eventData.map(...), if you do take(10) and then print the res

RE: Tranforming flume events using Spark transformation functions

2014-07-22 Thread Sundaram, Muthu X.

I tried to map SparkFlumeEvents to String of RDDs like below. But that map and call are not at all executed. I might be doing this in a wrong way. Any help would be appreciated. flumeStream.foreach(new Function,Void> () { @Override public Void call(JavaRDD eventsData)