Re: Spark checkpoint restore failure due to s3 consistency issue

2015-10-09 Thread Tathagata Das
That wont really. What we need to see is the lifecycle of the file before the failure, so we need to the log4j logs. On Fri, Oct 9, 2015 at 2:34 PM, Spark Newbie wrote: > Unfortunately I don't have the before stop logs anymore since the log was > overwritten in my next run. > > I created a rdd-_

Re: Spark checkpoint restore failure due to s3 consistency issue

2015-10-09 Thread Spark Newbie
Unfortunately I don't have the before stop logs anymore since the log was overwritten in my next run. I created a rdd-_$folder$ file in S3 which was missing compared to the other rdd- checkpointed. The app started without the IllegalArgumentException. Do you still need to after restart log4j logs?

Re: Spark checkpoint restore failure due to s3 consistency issue

2015-10-09 Thread Tathagata Das
Can you provide the before stop and after restart log4j logs for this? On Fri, Oct 9, 2015 at 2:13 PM, Spark Newbie wrote: > Hi Spark Users, > > I'm seeing checkpoint restore failures causing the application startup to > fail with the below exception. When I do "ls" on the s3 path I see the key