Re: spark lacks fault tolerance with dynamic partition overwrite

2020-04-07 Thread Koert Kuipers
ah ok i was not aware of that jira issue. i will follow the progress there. thanks for letting me known On Tue, Apr 7, 2020 at 11:20 AM wuyi wrote: > Hi, Koert, > > The community is back to this issue to recently and there's already a fix > https://github.com/apache/spark/pull/26339 for it. > >

Re: spark lacks fault tolerance with dynamic partition overwrite

2020-04-07 Thread wuyi
Hi, Koert, The community is back to this issue to recently and there's already a fix https://github.com/apache/spark/pull/26339 for it. You can track and review it there. Best, Yi Wu -- Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/ -

spark lacks fault tolerance with dynamic partition overwrite

2020-04-02 Thread Koert Kuipers
i wanted to highlight here the issue we are facing with dynamic partition overwrite. it seems that any tasks that writes to disk using this feature and that need to be retried fails upon retry, leading to a failure for the entire job. we have seen this issue show up with preemption (task gets kil