Re: Spark Streaming updateStateByKey Implementation

2015-11-09 Thread Hien Luu
Thanks Zoltan. I will take a look at StateDStream.scala On Sun, Nov 8, 2015 at 2:42 AM, Zoltán Zvara wrote: > It is implemented with cogroup. Basically it stores states in a separate > RDD and cogroups the target RDD with the state RDD, which is then hidden > from you. See StateDStream.scala, t

Re: Spark Streaming updateStateByKey Implementation

2015-11-08 Thread Zoltán Zvara
It is implemented with cogroup. Basically it stores states in a separate RDD and cogroups the target RDD with the state RDD, which is then hidden from you. See StateDStream.scala, there is everything you need to know. On Fri, Nov 6, 2015 at 6:25 PM Hien Luu wrote: > Hi, > > I am interested in le

Spark Streaming updateStateByKey Implementation

2015-11-06 Thread Hien Luu
Hi, I am interested in learning about the implementation of updateStateByKey. Does anyone know of a jira or design doc I read? I did a quick search and couldn't find much info. on the implementation. Thanks in advance, Hien