Re: "overloaded method value updateStateByKey ... cannot be applied to ..." when Key is a Tuple2

2014-11-12 Thread Steve Reinhardt
I'm missing something simpler (I think). That is, why do I need a Some instead of Tuple2? Because a Some might or might not be there, but a Tuple2 must be there? Or something like that? From: Adrian Mocanu mailto:amoc...@verticalscope.com>> You are correct; the filtering I’m talking about i

Re: how to blend a DStream and a broadcast variable?

2014-11-06 Thread Steve Reinhardt
Excellent. Is there an example of this somewhere? Sent from my iPhone > On Nov 6, 2014, at 1:43 AM, Sean Owen wrote: > > Broadcast vars should work fine in Spark streaming. Broadcast vars are > immutable however. If you have some info to cache which might change > from batch to batch, you shoul

Re: Streaming: which code is (not) executed at every batch interval?

2014-11-04 Thread Steve Reinhardt
From: Sean Owen >Maybe you are looking for updateStateByKey? >http://spark.apache.org/docs/latest/streaming-programming-guide.html#trans >formations-on-dstreams > >You can use broadcast to efficiently send info to all the workers, if >you have some other data that's immutable, like in a local fil

Re: Streaming: which code is (not) executed at every batch interval?

2014-11-04 Thread Steve Reinhardt
-Original Message- From: Sean Owen >On Tue, Nov 4, 2014 at 8:02 PM, spr wrote: >> To state this another way, it seems like there's no way to straddle the >> streaming world and the non-streaming world; to get input from both a >> (vanilla, Linux) file and a stream. Is that true? >> >>

Re: with SparkStreeaming spark-submit, don't see output after ssc.start()

2014-11-03 Thread Steve Reinhardt
From: Tobias Pfeiffer mailto:t...@preferred.jp>> Am I right that you are actually executing two different classes here? Yes, I realized after I posted that I was calling 2 different classes, though they are in the same JAR. I went back and tried it again with the same class in both cases, an