Re: How to know whether I'm in the first batch of spark streaming

2016-04-21 Thread Praveen Devarao
Sometimes courage is the quiet voice at the end of the day saying I will try again" From: Yu Xie To: Praveen Devarao/India/IBM@IBMIN Cc: user@spark.apache.org Date: 21/04/2016 01:40 pm Subject: Re: How to know whether I'm in the first batch of spark streaming

Re: How to know whether I'm in the first batch of spark streaming

2016-04-21 Thread Yu Xie
Thank you Praveen in our spark streaming, we write down the data to a HDFS directory, and use the MMDDHHHmm00 format of batch time as the directory name. So, when we stop the streaming and start the streaming again (we do not use checkpoint), in the init of the first batch, we will write d

Re: How to know whether I'm in the first batch of spark streaming

2016-04-21 Thread Praveen Devarao
Hi Yu, Could you provide more details on what and how are you trying to initialize.are you having this initialization as part of the code block in action of the DStream? Say if the second batch finishes before first batch wouldn't your results be affected as init would have not take