Re: [SS]Kafka EOS transaction timeout solution

2019-09-03 Thread wenxuan Guan
Thanks for your reply. 1. About submit tasks to commit kafka transaction kafka producer commit transaction is lightweight action, compared with sending message to kafka, there is little chance to fail in commit transaction. However to implement EOS, we must consider commit transaction fail, and we

Re: [SS]Kafka EOS transaction timeout solution

2019-09-02 Thread Genmao Yu
Increasing `transaction.timeout.ms` in both kafka client and server side may not be a best but workable solution. In the design, spark will submit tasks to all executors to do kafka commit (the second phase in 2PC). This will increase the possibility of commit failure. Besides, users may be clear t

[SS]Kafka EOS transaction timeout solution

2019-08-31 Thread wenxuan Guan
Hi all, I have implement Structured Streaming Kafka sink EOS with Kafka transaction producer, and design sketch is in https://issues.apache.org/jira/browse/SPARK-28908";>SPARK-28908 and pr in https://github.com/apache/spark/pull/25618";>25618. But now I meet a problem as blow. When producer faile