Bartosz Mikulski created FLINK-29094:
----------------------------------------

             Summary: Cannot stop Python job with savepoint when using Kafka 
Consumer
                 Key: FLINK-29094
                 URL: https://issues.apache.org/jira/browse/FLINK-29094
             Project: Flink
          Issue Type: Bug
          Components: API / Python
    Affects Versions: 1.15.1
         Environment: Local Environment
 * OS: Ubuntu 20.04
 * Java: openjdk version "11.0.12" 2021-07-20
 * Flink: 1.15.1 (clean instalation)
 * Kafka: kafka_2.13-2.8.1
            Reporter: Bartosz Mikulski


We want to gracefully stop a Python job reading from Kafka. However, running 
`flink stop --savepointPath` causes an exception to be thrown and the job 
fails. Find the exception in the 
[Gist|https://gist.github.com/BartMiki/6eab470a49d8a935ed3e7e544c3f278f#file-exception-txt].
 The type of savepoint also doesn't matter both canonical and native fail. This 
Stack Overflow [answer|https://stackoverflow.com/a/68280329] also does not 
work. 

However, checkpoints work fine. This leads to weird scenarios when you want to 
stop a Python job, stopping fails with an exception, and the job is restarted 
from the latest checkpoint.

The setup was tested in fresh default Flink 1.15.1 installation (however we 
also tried 1.15.0 with the same error).  The error occurred regardless of the 
amount of data processed from Kafka (could be even none). [Gist with minimal 
code required to trigger the 
exception|https://gist.github.com/BartMiki/6eab470a49d8a935ed3e7e544c3f278f].



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to