mridulm commented on PR #50033: URL: https://github.com/apache/spark/pull/50033#issuecomment-2803443037
Just checked the JDBC integration, the issue is we are always committing when all rows are done. Instead, we should commit the txn only as part of the commit protocol - when driver lets the task know it can commit. This will be an issue even without any determinism issue, for example when there is speculative execution - as a race condition. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org