Re: CoordinatorStream errors

2015-12-21 Thread Navina Ramesh
Ok. Sounds good. Thanks! On Mon, Dec 21, 2015 at 11:38 AM, Rick Mangi wrote: > HI Navina, > > It stopped happening once I deleted an old checkpoint topic. I think in > the rapid development cycle my checkpoints became invalid. If it happens > again I will save the logs. > > Thanks! > > > > On De

Re: CoordinatorStream errors

2015-12-21 Thread Rick Mangi
HI Navina, It stopped happening once I deleted an old checkpoint topic. I think in the rapid development cycle my checkpoints became invalid. If it happens again I will save the logs. Thanks! > On Dec 21, 2015, at 2:14 PM, Navina Ramesh > wrote: > > Hi Rick, > Can you share the entire log

Re: CoordinatorStream errors

2015-12-21 Thread Navina Ramesh
Hi Rick, Can you share the entire log for this issue? I suspect the concurrent access happens on the bootstrappedSet (LinkedHashSet -> not thread safe) between the Job Coordinator and SamzaAppMaster. When a container fails, the AM tried to read the locality information. If some other container req